Examples
Filter by
R1a & R1b distribution - heatmap
heatmappointsregion clusteringhaplogroups
R1a & R1b distribution - heatmap
Here we show an example use-case for heatmap layer type visualization. We have showed a well known distribution of R1a and R1b Y-DNA haplogroups, that divides Europe to the Western and Eastern parts.
1. Load blank session to start from scratch.
2. We are going to work on Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. Fetch a dataset with determined R1a Y-DNA haplogroup and with location in the Europe. To do that, use query builder.
4. Do the same for R1b haplogroup. You may want to change names of the datasets for convenience.
5. Because the datasets differ in the samples count and we want would like only to compare the spatial distributions, one quick way to solve this is to use sampling option. To even both datasets we have sampled 36.6% of the R1b dataset. Another way would be using "Data weight" settings of the map layer.
6. Create a new map layer. Set it to use the R1a haplogroup dataset and set it to appear as a heatmap. You may also change the name of the layer correspondingly, as well as play with the appearance settings to make the layer more appealing. We have changed the color and the radius and blur of heatmap.<br>You can investigate all the applied changes and values later on while manually exploring this session's settings.
7. Next use the clone button to create another heatmap. For the new layer, change the dataset to R1b one and apply different color for the heatmap. Don't forget to change the layer name.
8. To create labels we have cloned both layers and changed their type to points. We have also hid the body of points leaving only samples counts and we position R1b dataset labels layer to the bottom to declutter data.
9. In the end, we have changed clustering settings to regional and distance clustering. We have adjusted the clustering distance to properly declutter the samples without big reduction of data location resolution.
10. For readability, we have also filtered out all regions except Europe. This leaves us with thick border drawn on the map that clearly indicates, that this is the only region we are focusing on in our visual analysis.
Comparsion of periods
piechartsmultiple periodsregion clusteringhaplogroups
Comparsion of periods
Here we show that is is possible to visualize changes of spatio-temporal distribution and composition of the samples attributes on a single map view, so without manipulating the timeline position. We have prepared an example presenting comparsion of three arbitrary chosen time periods.
1. Load blank session to start from scratch.
2. We are going to work on Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. Fetch a dataset of archeogenomic samples dated to originate from 6000 BC – 3500 BC time period. Do the same for the 3500 BC – 0 and 0 – 1000 AD periods. You may want to change names of the datasets for convenience.
4. Create a new map layer and set it to use the 6000 BC – 3500 BC dataset. Also change the layer type to piecharts, attribute to haplogroup Y and name the layer properly.
5. This time, we want the layer to indicate which dataset it presents by using labels, not colors. To achieve that, head on to the layer appearance settings. Then, turn on the "show layer name" option and turn off the "show layer color".
6. To make comparsion of periods more readable, we have limited the data presentation resolution to the whole regions by using clustering with UN subregions collection and turning off the distance clustering.
7. Time to add data of the remaing periods to the map. To quickly do that, you can clone twice the created map layer and change the dataset option of the newly created clones.
8. Since the layers' data points share the same position of the region centroid, you can declutter them by setting different rotation angle around their position. For readability, you can remove the cluster pointers for the extreme periods in the layer appearance options.
9. Even after taking all these steps, it is hard to analyse the haplogroup data as a whole. We recommend you to always limit the visual analysis to a samller subset of attribute values. Here, for example we have performed automatic clustering of the Y-DNA haplogroups by grouping them to the name of their parent haplogroup positioned at a given tree level. You can also perform manual clustering and further filtering of the haplogroups if you want.
Distribution of admixtures
piechartsdistance clusteringadmixture
Distribution of admixtures
Here we wanted to investigate, whether is it possible to correlate given admixture analysis components to any distinct geographical location or human population. To do that, we have visualized spatial distribution of the samples with high genomic affinity to any of the admixture components. We created this example for admixture k=7 analysis.
1. Load blank session to start from scratch.
2. We are going to work on Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. To obtain samples with highly homogenic admixture composition we have queried for samples with affinity higher than 80% (with exception for component #6, as there are no such samples) to any of the admixture components. That was done in relation to the results of admixture k=7 analysis. To prepare the query, we have created a list of statements for every component and we joined them with OR operation.
4. Next we have created a map layer and set it to display the newly created dataset samples in a form of piecharts.
5. For readability, we have removed the boundary color by using corresponding option in a layer appearance section.
6. We have also disabled grouping attributes of low frequency to the OTHER special group. Moreover, we have adjusted features size range and data weight options to distinguish clusters with higher samples count by making them bigger.
7. At the end, we have changed the basemap appearance setting, to make manually picked colors of the admixture components more visible.
Males with Y-DNA and mt-DNA haplogroups determined
tagspointsdistance clusteringhaplogroupsmultiple attributes
Males with Y-DNA and mt-DNA haplogroups determined
Analysing samples' haplogroups on a map can be difficult, as there are many unique values to present at the same time. To make it easier, you can use tags cloud layer type, which enables to analyse both haplogroup composition of the samples cluster as well as their counts proportions. We have also showed here an example of presenting several attributes of the same samples at once. We have visualized Y-DNA and mt-DNA haplogroups of the males that have had determined both of them.
1. Load blank session to start from scratch.
2. We are going to work on Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. Create a query for samples which have had determined both Y-DNA and mt-DNA haplogroups.
4. Then create a new map layer and set it's dataset to the one previously fetched from Human AGEs database. Also, set it's layer type to tags and choose haplogroup Y as an attribute to present.
5. You may play a bit with features size, layer appearance and tags appearance settings until you are satisfied with the results. Don't forget to name you layer.
6. Next, clone the layer and change the presented attribute to mt-DNA haplogroup. Also, change the color, layer name and try to reposition both layers, so the tag clouds are close to each other, but they do not overlap much.
7. Because both tag clouds present information on the same samples, the data weight normalization and scaling options won't be necessary, so we disabled it.
8. At the end we have created a third layer to present common information for both tag clouds about samples count. To do that, we have cloned one of the layers, changed it's type to points, we have enabled showing weight as a value and we have adjusted the layer appearance as position, to make it appear simplistic and next to the both tag clouds.
9. Additionally, you can adjust a distance of the clustering, to declutter the map.
Admixture values for samples laying in the range of Epigravettian culture
culturespiechartsadmixturedistance clusteringregion clustering
Admixture values for samples laying in the range of Epigravettian culture
Here we show an example of visualization and clustering by archeological cultures. For simplicity, we have limited the visual analysis to Epigravettian culture and we have showed the distribution of samples among it and their genomic admixture composition. You can manipulate the timeline to see how the genomic composition changed in time for the region of Epigravettian culture occurence.
1. Load blank session to start from scratch.
2. We are going to work on Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. Fetch the samples that are localized in the region of Epigravettian or Late Epigravettian culture occurence.
4. Create a new map layer and set it for the loaded dataset, piechart layer type and admixture k=7 attributes.
5. Apart from distance clustering, set on clustering by archeological cultures regions.
6. Filter out all cultures except for Epigravettian and Late Epigravettian.
7. You may now adjust appearance of the culture regions (by clicking their symbols on the map legend) and the map layer to you liking.
8. To better visualize physiogeographical context of the culture spatial occurence, we have changed the basemap to Imagery.
Comparsion of past and contemporary times data of mt-DNA haplogroup distribution
piechartsregion clusteringmultiple periods
Comparsion of past and contemporary times data of mt-DNA haplogroup distribution
Here we show an example of presenting data from multiple dataset at the same time. For this purpose we have used archeogenomic data from Allen Ancient DNA Resource v54.1.p1 dataset and contemporary genomic data from the EMPOP mt-DNA v4/R13 dataset. You can use the timeline to explore how the regional distribution of the mt-DNA haplogroups changed to meet the state that we see today.
1. Load blank session to start from scratch.
2. First, we are going to fetch data from Allen Ancient DNA Resource v54.1.p1 dataset, so please select it.
3. Query for a dataset that consists of samples with determined mt-DNA haplogroup.
4. Second, switch to the EMPOP mt-DNA v4/R13 dataset.
5. Fetch whole dataset. After a dataset is loaded, sample it down to 25%. It is a big dataset, thus without sampling it would take a lot of time to show up on the map. When you are done working on the map and layers appearance, you can switch back sampling to 100%.
6. Before we create map layers, please change the clustering options to region clustring by UN subregions and disable distance clustering. This way we can normalize the spatial distributuon of the different dataset clusters, as they will show up at the region centroid.
7. Create a new piechart layer and set it for Allen Ancient DNA Resource and mt-DNA haplogroup.
8. Do the same for the EMPOP dataset. Please be patient, as it may take some time.
9. Next, you may want to adjust both layers appearance, to make them distinguishable from each other and to make them appear next to each other.
10. For the last thing, may want to limit the number of visualized haplogroups for easier analysis. We have performed manual clustering to U, J, T and X parental haplogroups by grouping together samples that contain any descendant mt-DNA haplogroup. Any other haplogroup was filtered out.