SMOLNET PORTAL home about changes
README file for rgadb of RiceGAAS                                 [2002.11.21]
-----------------------------------------------------------------------------

RiceGAAS (Rice Genome Automated Annotation System) collects the rice genome 
entries in GenBank.

Please see the following URLs about RiceGAAS.

   http://ricegaas.dna.affrc.go.jp
   http://ricegaas.dna.affrc.go.jp/rgadb/
   http://ricegaas.dna.affrc.go.jp/RiceGAAS_system.html

RiceGAAS collects the rice genome entries from the daily-update in GenBank 
database. The collection rules are as follows;

      SOURCE  =  "oryza  sativa"
     and
      9999bp  <  [sequence  size]  <  1Mbp
     not
      DIVISION=  "GSS"  or  "STS"

Each entry from GenBank is categorized into the anchored chromosome, or 
"unknown" if it dose not have any chromosomal information.

Two data set as shown below are made every week.

    GenBank  files
    FASTA  files

-  FTP  files

   ftp://ftp.dna.affrc.go.jp/pub/RiceGAAS/
     +--  20020918   ..   data directory (made at 2002-09-18)
     +--  yyyymmdd   ..   data directory (made at yyyy-mm-dd)
     +--  README     ..   this document
     +--  current    ..   link to the latest data directory

   Data directories in the ftp file above have GenBank files and FASTA files  
   containing the following files below.

-  GenBank files contain the following files.

     RiceGAAS.GenBank.chr01.tar.gz   ..  chromosome   1  genbank  files
     RiceGAAS.GenBank.chr02.tar.gz   ..  chromosome   2  genbank  files
     RiceGAAS.GenBank.chr03.tar.gz   ..  chromosome   3  genbank  files
     RiceGAAS.GenBank.chr04.tar.gz   ..  chromosome   4  genbank  files
     RiceGAAS.GenBank.chr05.tar.gz   ..  chromosome   5  genbank  files
     RiceGAAS.GenBank.chr06.tar.gz   ..  chromosome   6  genbank  files
     RiceGAAS.GenBank.chr07.tar.gz   ..  chromosome   7  genbank  files
     RiceGAAS.GenBank.chr08.tar.gz   ..  chromosome   8  genbank  files
     RiceGAAS.GenBank.chr09.tar.gz   ..  chromosome   9  genbank  files
     RiceGAAS.GenBank.chr10.tar.gz   ..  chromosome  10  genbank  files
     RiceGAAS.GenBank.chr11.tar.gz   ..  chromosome  11  genbank  files
     RiceGAAS.GenBank.chr12.tar.gz   ..  chromosome  12  genbank  files
     RiceGAAS.GenBank.unknown.tar.gz   ..  unknown  genbank  files

   % gunzip RiceGAAS.GenBank.chr01.tar.gz
   % tar tvf RiceGAAS.GenBank.chr01.tar|more
   drwxr-xr-x  100/100        0  2002-04-25  19:51  chr01/
   -rw-r--r--  100/100   145408  2000-03-01  22:40  chr01/10A19I
   -rw-r--r--  100/100   195764  2002-03-26  01:33  chr01/B1085F09
   -rw-r--r--  100/100   193497  2002-03-26  01:34  chr01/OJ1174_D05
   -rw-r--r--  100/100   201668  2002-03-26  01:35  chr01/OSJNBa0025P13
   -rw-r--r--  100/100   209261  2002-03-26  01:25  chr01/OSJNBa0004B13
   -rw-r--r--  100/100   212063  2002-03-26  01:29  chr01/P0666G04
     ::


-  FASTA files contain only the sequence data picked up from GenBank files,
   with comment lines as follows;

   >[Accession No.] ([clone name]) [chromosome No.] [location(cM)] [size(bp)]

   The "location" means the position on the genetic map by RGP shown in the 
   web pages.

     http://rgp.dna.affrc.go.jp/cgi-bin/statusdb/statassign.pl

   FATSA files;
     RiceGAAS.chr01.fasta.gz   ..  chromosome   1  fasta  file
     RiceGAAS.chr02.fasta.gz   ..  chromosome   2  fasta  file
     RiceGAAS.chr03.fasta.gz   ..  chromosome   3  fasta  file
     RiceGAAS.chr04.fasta.gz   ..  chromosome   4  fasta  file
     RiceGAAS.chr05.fasta.gz   ..  chromosome   5  fasta  file
     RiceGAAS.chr06.fasta.gz   ..  chromosome   6  fasta  file
     RiceGAAS.chr07.fasta.gz   ..  chromosome   7  fasta  file
     RiceGAAS.chr08.fasta.gz   ..  chromosome   8  fasta  file
     RiceGAAS.chr09.fasta.gz   ..  chromosome   9  fasta  file
     RiceGAAS.chr10.fasta.gz   ..  chromosome  10  fasta  file
     RiceGAAS.chr11.fasta.gz   ..  chromosome  11  fasta  file
     RiceGAAS.chr12.fasta.gz   ..  chromosome  12  fasta  file
     RiceGAAS.unknown.fasta.gz   ..  unknown   fasta  file

   % gunzip RiceGAAS.chr01.fasta.gz
   % grep "^>" RiceGAAS.chr01.fasta |more
   >10A19I  (10A19I)  chr01  99587bp
   >AP003103  (B1085F09)  chr01  52.7cM  132713bp
   >AP003118  (OJ1174_D05)  chr01  20.2cM  128525bp
   >AP003140  (OSJNBa0025P13)  chr01  58.1cM  133242bp
   >AP003018  (OSJNBa0004B13)  chr01  142268bp
   >AP003047  (P0666G04)  chr01  22.6cM  141983bp
   >AP003074  (OSJNBa0004G10)  chr01  42.4-43.2cM  150379bp
   >AP003104  (OSJNBa0038J17)  chr01  30.5cM  180186bp
     ::


If you have any question, please send e-mail to "support@dna.affrc.go.jp".


.
Response: text/plain
Original URLgopher://bitreich.org/0/gopher2007/archive/gopher.dna.aff...
Content-Typetext/plain; charset=utf-8