------测序原理------ - 一些图表 - 《组学分析》

分类	子程序	功能
NCBI数据库下载	`<font style="color:rgba(0, 0, 0, 0.9);">update_blastdb.pl</font>`	下载 NCBI 中的公共数据库
本地数据库操作	`<font style="color:rgba(0, 0, 0, 0.9);">makeblastdb</font>`	使用 FASTA 数据创建 BLAST数据库
	`<font style="color:rgba(0, 0, 0, 0.9);">makembindex</font>`	为已有核酸数据库创建索引，与 megablast 一起进行 blast
	`<font style="color:rgba(0, 0, 0, 0.9);">makeprofiledb</font>`	从由`<font style="color:rgba(0, 0, 0, 0.9);">psiblast</font>`生成的特定于输入位置的评分矩阵（记分板）的列表中创建一个保守结构域数据库
	`<font style="color:rgba(0, 0, 0, 0.9);">dustmasker</font>`	主要用于数据库准备过程中，屏蔽输入核苷酸序列中的低复杂度区域
	`<font style="color:rgba(0, 0, 0, 0.9);">windowmasker</font>`	屏蔽输入核苷酸序列中的重复区域
	`<font style="color:rgba(0, 0, 0, 0.9);">segmasker</font>`	主要用于数据库准备过程中，屏蔽输入氨基酸序列中的低复杂度区域
	`<font style="color:rgba(0, 0, 0, 0.9);">convert2blastmask</font>`	将屏蔽的小写字母转换为`<font style="color:rgba(0, 0, 0, 0.9);">makeblastdb</font>`可读数据
	`<font style="color:rgba(0, 0, 0, 0.9);">blastdb_aliastool</font>`	创建数据库别名
	`<font style="color:rgba(0, 0, 0, 0.9);">blastdbcheck</font>`	检测数据库的完整性
	`<font style="color:rgba(0, 0, 0, 0.9);">blastdbcmd</font>`	从 BLAST 数据库中检索序列或其它信息
	`Cleanup-blastdb-volumes.py`	用于清理 BLAST 数据库卷（volumes）和删除未引用卷的 Python 脚本（请谨慎使用）
blast搜索程序	`<font style="color:rgba(0, 0, 0, 0.9);">blastn</font>`	在【核苷酸】数据库搜索待检索【核苷酸】序列
	`<font style="color:rgba(0, 0, 0, 0.9);">blastp</font>`	在【氨基酸】数据库搜索待检索【氨基酸】序列
	`<font style="color:rgba(0, 0, 0, 0.9);">blastx</font>`	将待检索【核苷酸】序列基于六种翻译框翻译为氨基酸序列，再在【氨基酸】数据库中 BLAST
	`<font style="color:rgba(0, 0, 0, 0.9);">tblastn</font>`	【核苷酸】数据库基于六种翻译框翻译为氨基酸序列后，待检索【氨基酸】序列再与其 BLAST
	`<font style="color:rgba(0, 0, 0, 0.9);">tblastx</font>`	将待检索【核苷酸】序列和【核苷酸】数据库基于六种翻译框翻译为氨基酸序列后，再进行 BLAST
专业蛋白质blast程序	`<font style="color:rgba(0, 0, 0, 0.9);">deltablast</font>`	使用更敏感算法，在氨基酸数据库搜索待检索氨基酸序列
	`<font style="color:rgba(0, 0, 0, 0.9);">psiblast</font>`	查找蛋白质家族的成员，识别与待检索氨基酸序列有远近关系的蛋白质，或为待检索氨基酸序列构建特定位置的评分矩阵
保守结构域blast程序	`<font style="color:rgba(0, 0, 0, 0.9);">rpsblast</font>`	在保守域数据库搜索待检索氨基酸序列，以标识序列中存在的功能域
	`<font style="color:rgba(0, 0, 0, 0.9);">rpstblastn</font>`	将待检索核酸序列基于六种翻译框翻译为氨基酸序列，再在保守结构域数据库中检索
Command line translator	`<font style="color:rgba(0, 0, 0, 0.9);">legacy_blast.pl</font>`	将 BLAST 搜索命令行转换为对应 BLAST+ 命令行并执行
Result formatting tool	`<font style="color:rgba(0, 0, 0, 0.9);">blast_formatter</font>`	使用已分配的请求 ID（RID）或保存的归档文件格式化 BLAST 结果

`qaccver`	查询的AC版本（与此类似的还有`qseqid`、`qgi`、`qacc`，与序列命名有关）
`saccver`	目标的AC版本（于此类似的还有`sseqid`、`sallseqid`、`sgi`、`sacc`、`sallacc`，也是序列命名相关)
`pident`	完全匹配百分比（响应的nident则是匹配数）
`length`	联配长度(另外`slen`表示查询序列总长度，`qlen`表示目标序列总长度)
`mismatch`	错配数目
`gapopen`	gap 的数目
`qstart`	查询序列起始
`qsend`	查询序列结束
`sstart`	目标序列起始
`send`	目标序列结束
`evalue`	期望值
`bitscore`	Bit 得分

十进制	十六进制	Description	描述
`1`	0x1	read paired	该read是成对的Paired reads中的一个
`2`	0x2	read mapped in proper pair	Paired reads中每个都正确比对到参考序列
`4`	0x4	read unmapped	该read没有比对到参考序列上
`8`	0x8	mate unmapped	与该read成对的另一端的read没有比对到参考序列上
`16`	0x10	read reverse strand	该read和参考序列相比，是反向互补的
`32`	0x20	mate reverse strand	与该read成对的另一端的read是反向互补的
`64`	0x40	first in pair	在Paired reads中，该read是第1条
`128`	0x80	second in pair	在Paired reads中，该read是第2条
`256`	0x100	not primary alignment	非最优的比对结果
`512`	0x200	read fails platform/vendor quality checks	没有通过质量控制
`1024`	0x400	read is PCR or optical duplicate	PCR重复或光学重复
`2048`	0x800	supplementary alignment

	iSeq 100	MiniSeq	MiSeq	NextSeq 550	NextSeq 1000/1000-CN NextSeq 2000/2000-CN	NovaSeq 6000	NovaSeq X

每个流动槽最大通量	1.2Gb	7.5Gb	15Gb	120Gb	540Gb	3Tb	8Tb
运行时间（范围）	约 9.5-19h	约 2-24h	约 5-56h	约 11-29h	约 8-44h	约 13-44h	约 17-48h
每次运行的最大 reads 数（单端 reads）	4M	25M	25M	400M	1.8B	10B（单流动槽） 20B（双流动槽）	26B（单流动槽） 52B（双流动槽）
最大读长	2×150bp	2×150bp	2×300bp	2×150bp	2×300bp	2×250bp	2×150bp
大型全基因组测序（人类、动物、植物）	- [ ]	- [ ]	- [ ]	- [ ]	- [ ]	- [x]	- [x]
小型全基因组测序（微生物、病毒）	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]
外显子组和大型 panel 测序（基于富集）	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]
靶向基因测序（基于扩增子、基因 panel）	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]
单细胞分析（scRNA-seq、scDNA-seq、寡核苷酸标记检测）	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]
转录组测序（总 RNA-seq、mRNA-seq、基因表达图谱分析）	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]
miRNA 和小 RNA 分析	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]
染色质分析（ATAC-seq、ChIP-seq）	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]	- [x]
甲基化测序	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]
16S宏基因组测序	- [ ]	- [x]	- [x]	- [x]	- [x]	- [x]	- [x]
宏基因组学分析（鸟枪法宏基因组学、宏转录组学）	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]
游离核酸测序和液体活检分析	- [ ]	- [ ]	- [ ]	- [x]	- [x]	- [x]	- [x]

https://emea.illumina.com/systems/sequencing-platforms.html