Rfam annotation
Rfam annotation file can be directly downloaded from ftp://ftp.sanger.ac.uk/pub/databases/Rfam Rfam database [5]. Since the reference genomes of some species might changes in different version of Rfam database, make sure that you are using the right Rfam annotation file for your genome assembly. For example, the latest Rfam annotation file for NCBI37/mm9 mouse genome is in version 9.0, but not version 10 where you can find the annotation file for GRCh37/hg19 human genome. If there is a conflict between the genome assembly used by Rfam and the one you are analyzing, we offer a simple and quick way to solve it by generating new Rfam annotation file for your genome assembly. Basically, we extract Rfam sequences based on the genome annotation in Rfam, then blast them to the new genome, finally use rfam_scan.pl from Rfam database and custom script to create new Rfam annotation file from blast results. And another way to avoid the conflict is: if you have the chain file specifying differences between two different genome assemblies, you can use liftOver to get new rfam gff file.
In Rfam annotation file, the value in "Alias" tag in the "attributes" column indicates the ncRNA family information. Example of "attributes" column:
ID=RF00026.1;Name=RF00026;Alias=U6;Note=AC157543.8/131368-131259
Chongjian Chen
2012-01-26