Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_bidensity.wasp

Title produced by software

Bivariate Kernel Density Estimation

Date of computation

Wed, 12 Nov 2008 12:22:46 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/12/t1226517889eblf0q1acqcscou.htm/, Retrieved Sat, 18 May 2024 18:53:49 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24405, Retrieved Sat, 18 May 2024 18:53:49 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

150

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

-     [Bivariate Kernel Density Estimation] [task 8.1.1] [2008-11-11 17:07:52] [1eab65e90adf64584b8e6f0da23ff414]
F         [Bivariate Kernel Density Estimation] [task 8.1 biv density] [2008-11-12 19:22:46] [0458bd763b171003ec052ce63099d477] [Current]

Feedback Forum

2008-11-23 14:33:53 [Nathalie Daneels] [reply] 
Evaluatie opdracht 3 - Blok 8 (Q1): 
 
De conclusie zou aangevuld moeten worden met het volgende: 
 
De student vermeldde dat de correlatie tussen de variabelen x en y positief is en afgerond 0,68 bedraagt. Er is dus sprake van een positieve correlatie, dit kunnen we ook op de grafiek zien. De hoogtelijnen op de grafiek vormen ellipsen, die wijzen op een verband tussen de variabelen. Als de hoogtelijnen cirkels moesten vormen, dan zouden we niet kunnen spreken van een verband. Aangezien de hoogtelijnen van links onder naar rechts boven gaan, is er sprake van een positief verband. Dit wordt dan weer bevestigd door de tabel. We moeten eveneens vermelden dat de hoogtelijnen niet rechtstreeks iets te maken hebben met de derde dimensie, maar wel met de dichtheid/de concentratie van de punten: Ze hebben te maken met de waarschijnlijkheid dat een bepaald verband tussen variabelen zich bevindt waar de hoogtelijnen de hoogste waarde aannemen.  
De bivariate density wordt gevormd aan de hand van de puntenwolk van de scatterplot. Als er op deze scatterplot op bepaalde plaatsen heel veel punten (samen/in groep) bevinden, dan gaat dit duidelijk weerspiegeld worden in de bivariate density: Dat is de plaats waarop de hoogtelijnen de hoogste waarde hebben. Op die plaats is de dichtheid/de concentratie van de variabelen het grootste. Dit is dan ook de uitleg over wat de hoogtelijnen precies weergeven: Zij geven de dichtheid/de concentratie van de punten weer. De hoogste concentratie op de grafiek wordt aangeduid met een witte kleur en deze hoogste concentratie bevindt zich net onder de rechte. Deze recht geeft het gemiddelde van de gegevens weer/ dat is de 'beste' rechte die men door de puntenwolk van de scatterplot kan tekenen.  
We kunnen eveneens vaststellen uit de grafiek dat er zich geen clusters hebben gevormd: dit zou het geval zijn als we verschillende groepen konden waarnemen. Er zouden dus op verschillende plaatsen in de grafiek een hoge waarde van de hoogtelijnen waar te nemen zijn. 
2008-11-24 12:31:15 [Anouk Greeve] [reply] 
aanvullend: In de bivariate Kernel density plot worden er dimensies gesuggereerd door de hoogtelijnen.  
de hoogtelijnen hebben te maken met de dichtheid van de punten. wanneer deze hoog is, is de waarschijnlijkheid in het midden hoger dan die van de buitenrand. 

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Dataseries Y:

Download CSV

Histogram

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'George Udny Yule' @ 72.249.76.132

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 3 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24405&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]3 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24405&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24405&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'George Udny Yule' @ 72.249.76.132

Bandwidth
x axis	2.99006311411283
y axis	8.57073381946338
Correlation
correlation used in KDE	0.684629200251958
correlation(x,y)	0.684629200251958

\begin{tabular}{lllllllll}
\hline
Bandwidth \tabularnewline
x axis & 2.99006311411283 \tabularnewline
y axis & 8.57073381946338 \tabularnewline
Correlation \tabularnewline
correlation used in KDE & 0.684629200251958 \tabularnewline
correlation(x,y) & 0.684629200251958 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24405&T=1

[TABLE]
[ROW][C]Bandwidth[/C][/ROW]
[ROW][C]x axis[/C][C]2.99006311411283[/C][/ROW]
[ROW][C]y axis[/C][C]8.57073381946338[/C][/ROW]
[ROW][C]Correlation[/C][/ROW]
[ROW][C]correlation used in KDE[/C][C]0.684629200251958[/C][/ROW]
[ROW][C]correlation(x,y)[/C][C]0.684629200251958[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24405&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24405&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Bandwidth
x axis	2.99006311411283
y axis	8.57073381946338
Correlation
correlation used in KDE	0.684629200251958
correlation(x,y)	0.684629200251958

Figure 1

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

Parameters (R input):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

R code (references can be found in the software module):

par1 <- as(par1,'numeric')
par2 <- as(par2,'numeric')
par3 <- as(par3,'numeric')
par4 <- as(par4,'numeric')
par5 <- as(par5,'numeric')
library('GenKern')
if (par3==0) par3 <- dpik(x)
if (par4==0) par4 <- dpik(y)
if (par5==0) par5 <- cor(x,y)
if (par1 > 500) par1 <- 500
if (par2 > 500) par2 <- 500
bitmap(file='bidensity.png')
op <- KernSur(x,y, xgridsize=par1, ygridsize=par2, correlation=par5, xbandwidth=par3, ybandwidth=par4)
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main=main,xlab=xlab,ylab=ylab)
if (par6=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par7=='Y') points(x,y)
(r<-lm(y ~ x))
abline(r)
box()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Bandwidth',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'x axis',header=TRUE)
a<-table.element(a,par3)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'y axis',header=TRUE)
a<-table.element(a,par4)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation used in KDE',header=TRUE)
a<-table.element(a,par5)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation(x,y)',header=TRUE)
a<-table.element(a,cor(x,y))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code