Rfam 数据库是一个 RNA 家族的集合,每个家族都由多个序列比对、一致二级结构和协方差模型表示。
conda install -c bioconda infernal
https://ftp.ebi.ac.uk/pub/databases/Rfam/CURRENT/
# Rfam.cm
wget -c https://ftp.ebi.ac.uk/pub/databases/Rfam/CURRENT/Rfam.clanin
wget -c https://ftp.ebi.ac.uk/pub/databases/Rfam/CURRENT/Rfam.cm.gz
gunzip Rfam.cm.gz
cmpress Rfam.cm
## Working... done.
## Pressed and indexed 4178 CMs and p7 HMM filters (4178 names and 4178 accessions).
## Covariance models and p7 filters pressed into binary file: Rfam.cm.i1m
## SSI index for binary covariance model file: Rfam.cm.i1i
## Optimized p7 filter profiles (MSV part) pressed into: Rfam.cm.i1f
## Optimized p7 filter profiles (remainder) pressed into: Rfam.cm.i1p
mkdir infernal
cd infernal
genome=../../genome.renamed.fa
cmscan \
--cut_ga \
--rfam \
--nohmmonly \
--fmt 2 \
--tblout genome.tblout \
--clanin ../Rfam.clanin \
../Rfam.cm \
${genome}
https://github.com/nawrockie/jiffy-infernal-hmmer-scripts/blob/master/infernal-tblout2gff.pl
参考
Rfam 地址:https://rfam.xfam.org/