HomozgosityMapper pedigree

HomozygosityMapper

Tutorial

 

best viewed with Mozilla Firefox

This tutorial is as brief as possible, if you are interested in more detailed information or in the implementation, please consult the documentation or the technical documentation, respectively. Unless mentioned here, there is no need to change any settings on the interfaces.

Please note that projects created without a personal account can be deleted by anyone unless you decide to make them private. In this case, you will have to use special URLs to work with your data. In any case, we delete data uploaded as guest (i.e. without a login) periodically (they are kept for at least 2 months). If you want to work with real data, you should hence consider to create a personal account.

If you are only interested in the output of HomozygosityMapper, you don't have to upload genotypes first. Simply open the query projects page and select one of the existing public projects and move to section 3 of this tutorial. The sample files used in this tutorial are also there, under the intuitive names 'tutorial', or 'tutorial_VCF', respectively. Click here for SNP genotypes or here for NGS (VCF) genotypes to access the project directly and continue with section 4.

 

1. Upload genotypes

First, you must upload genotypes. Please open the upload page.

SNP genotypes

For this tutorial, sample genotypes (with a homozygous region on chromosome 7) exist - download them here.

  1. Under new project name, choose a unique project name ('test' is already in use!). This name must not contain any non-alphanumeric characters; only letters, digits and undescores are allowed.
  2. Set chip to 'Affymetrix 50K Hind240'.
  3. Under genotype file, select the file you want to upload (i.e. this file).
  4. Click on submit and have a cup of tea (or a pint of beer, respectively) - the upload will take about two minutes. You can watch the process in the browser (if no coffee or beer is available).
  5. When the upload is complete, click on the link 'Analyse your genotypes'.

NGS genotypes (VCF)

For this tutorial, a sample VCF (variant call format) file (with a homozygous region on chromosome 15) exist - download it here.

  1. Under new project name, choose a unique project name ('test' is already in use!). This name must not contain any non-alphanumeric characters; only letters, digits and undescores are allowed.
  2. Set chip to 'VCF'.
  3. Under genotype file, select the file you want to upload (i.e. this file).
  4. Click on submit and have a cup of tea (or a pint of beer, respectively) - the upload will take about two minutes. You can watch the process in the browser (if no coffee or beer is available).
  5. When the upload is complete, click on the link 'Analyse your genotypes'.

Don't check the access restriction button unless you work with a personal account.

Please delete your genotypes when you've finished the tutorial.

 

2. Analyse your genotypes

Open the analysis page (unless it's already open).

SNP genotypes

  1. Select your project under project (if you followed the link after the upload it should already be selected) or click here to use the sample genotypes already in the database.
  2. Under analysis name, give your analysis a name (again, 'test' is not a good choice).
  3. In the analysis description field, enter some text explaining the analysis to you and, maybe, others.
  4. Enter the affected samples from your file under cases: NA07019, NA06993, NA07034, NA07056, NA07357
  5. Enter the unaffected samples under controls: NA10835, NA10846, NA10854
  6. Select 'from controls' under allele frequencies to see the observed vs. the expected homozygosity.
  7. Click on submit and watch the screen - 30 seconds won't be enough for another coffee or beer.
  8. When the analysis is accomplished, click on the 'Show results' link.

NGS genotypes (VCF)

Open the analysis page (unless it's already open).

  1. Select your project under project (if you followed the link after the upload it should already be selected) or click here to use the sample genotypes already in the database.
  2. Under analysis name, give your analysis a name (again, 'test' is not a good choice).
  3. In the analysis description field, enter some text explaining the analysis to you and, maybe, others.
  4. Enter the affected samples from your file under cases: Sample1, Sample2
  5. Enter the unaffected samples under controls: Sample3, Sample4
  6. Click on submit and watch the screen - sadly, one minute won't be enough for another coffee or beer.
  7. When the analysis is accomplished, click on the 'Show results' link.

 

3. Query a project

Open the query projects page (unless it's already open) or click here for SNP genotypes or here for NGS (VCF) genotypes to access the sample data directly.

  1. Select your analysis in the list (if you followed the link after the analysis it should already be selected).
  2. Click on submit.

 

4. Genome-wide homozygosity

After selecting the respective analysis, the homozygosity scores will be displayed as a genome-wide bar chart. Scores higher than 80% of the maximum score will be displayed as red bars to highlight them. Above the homozygosity scores, the ratio of observed vs, expected homozygous genotypes will be displayed (see the documentation for details). Below the bar chart, all scores above 80% of the maximum are listed (sorted by their score) and direct links to these regions are provided. On top of this table, 'broad' regions are displayed - in these, smaller decreases of the score within a homozygous region are neglected. At the bottom of the table, 'narrow' regions with sharp limits follow. If you expect some degree of heterozygosity, you should consider the 'broad' regions.

Below the regions, links to GeneDistiller and export options are displayed:
  • With GeneDistiller, candidate genes in all long homozygous regions can be queried at once.
  • It is also possible to generate files to perform a fine-mapping of the potential disease regions via Alohomora. A short manual is provided after the export.
  • For the enrichment needed for deep sequencing approaches, HomozygosityMapper can produce *.bed files containing either all regions or the complete genes/only the exons contained within them.
  1. Move the mouse pointer on the highest score in the bar chart (on chromosome 7 with the sample genotypes) and left-click.

Click here for SNP genotypes or here for NGS (VCF) genotypes to access the sample data directly.

 

5. Chromosomal homozygosity

After clicking on a chromosome, the view is reduced to this chromosome. In our sample project, you can clearly see a relatively wide and high homozygosity score and, above this, in the small observed/expected ratio a clear surplus of homozygous genotypes (only red bars on top of the baseline). You can further zoom into the region by

  1. Left-click on a position left of the homozygous region.
  2. Left-click on a position right of the homozygous region.
  3. In box that now opens, click on 'zoom in'.
  4. Now it's time to take a look at the underlying genotypes. Click on the first 'genotypes' link in the table below the bar chart.

 

6. Genotypes view

In this view, the single genotypes within your selected region are shown as colour-coded boxes. Each column stands for a SNP (sorted from left to right by its position), each line for a sample. Note that affected individuals are displayed on top and controls at the bottom, with a small free space between them. A grey box stands for an unknown genotype, blue for heterozygosity and different shades of red reflect the length of the homozygous strech if a genotype is homozygous. Genotypes homozygous for the minor allele (based on this study!) have a black diagonal line. A box is drawn around the homozygous region and the limiting markers are displayed below the genotypes. Further below, a link to our candidate gene database, GeneDistiller, is included. If you want to search for a candidate gene within your region, simply click on GeneDistiller. If you want to change the region, follow the instructions below.

  1. Left-click on the first SNP left of the homozygous region and select 'start' in the pop-up.
  2. Left click on the first SNP right of the homozygous region and select 'end' in the pop-up.
  3. Now click on the GeneDistiller button.

The GeneDistiller start page will now open with your limiting markers filled in as limits. To learn about the functions of GeneDistiller, either play around (it's quite intuitive) or read its manual.

 

7. Remove projects or analysis

Go to the delete your data page. All proejcts and analyses owned by you will be displayed.

  1. Select an analysis or project you want to delete.
  2. Click on the Delete selected projects and/or analyses button.
  3. On the page that now opens, click on the Confirm button.

Congratulations!

You have now completed the tutorial. If you want to analyse 'real' genotypes and want to hide them from the world (and make them public when your manuscript has been accepted), you should create a personal account.

 

8. Create a personal account

Go to the create profile page and enter your name, an e-mail address, your organisation and select your login name and a password. You will be logged in automatically (your browser must accept cookies for this function). If you return to HomozygosityMapper (we hope you will), go to the login page to create and access your private data. More about the publication and the excahnge of your data can be found in the documentation.


This tutorial is meant only as a brief introduction to HomozygosityMapper. Please read the documentation if you are interested in the elements not mentioned here. And if any questions remain unanswered, don't hesitate to ask us (e-mail to dominik.seelow (at) charite.de). We would also appreciate hearing of your experiences, bugs you discovered, and suggestions to improve this application.