Friday, April 30, 2010

1000 Genomes - Data

HOW TO ACCESS 1000 GENOMES DATA

Download data: The sequence and alignment data generated by the 1000genomes project is made available as quickly as possible via our mirrored ftp sites.

EBI FTP: ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/
NCBI FTP: ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/

Users in the Americas should use the NCBI ftp site and users in Europe and the rest of the world should use the EBI ftp site

The data is also available via an aspera server from both sites. To be able to use this service you need to download the Aspera connect software. This provides both a firefox plug in for downloading data and a bulk download client called ascp

The plugin should automatically start when you visit either the EBI Aspera site or the NCBI Aspera site.


An example commandline for the ascp command looks like

ascp -i bin/aspera/etc/asperaweb_id_dsa.putty -Tr -Q -l 100M -L- fasp-g1k@fasp.1000genomes.ebi.ac.uk:vol1/ftp/data/NA12878/alignment/NA12878.chrom10.SLX.SRP000032.2009_04.bam ./


(this Post content was reproduced from: http://www.1000genomes.org/page.php?page=data)