Genome informatics is a scientific discipline that encompasses all aspects of genome information acquisition, processing, storage, distribution, analysis, and interpretation.它是一个学科领域,包含着基因组信息的获取、处理、存储、分配 、分析和解释的所有方面。
A contig (from contiguous) is a set of overlapping DNA segments that together represent a consensus region of DNA.
重叠群(来自连续的)是一组重叠的、完整连续的DNA片断
2. scaffold**:**
A scaffold is a portion of the genome sequence reconstructed from end-sequenced whole-genome shotgun clones. Scaffolds are composed of contigs and gaps.
八、【计算题】Nearest Neighbor Model for Watson-Crick Base Pairs
Watson-Crick****碱基对的最近邻模型,描述了 DNA 中碱基对之间的相互作用力和稳定性
Nearest Neighbor Model for Free Energy of a Sample Hairpin Loop:
通过计算样品发夹环中碱基对的相互作用能量,可以评估该环状结构的稳定性和热力学特性
九、人类基因组相关概念(黄色的是上课说的考点)
1.genotype (基因型): thegenetic constitution of the organism.生物体的遗传构成。
2.phenotype (表型): thecomposite of an organism's observable characteristics or traits.生物体可观察到的特点或特征的总和。
3.Trait (性状):is adistinct variant of a phenotype characteristic of an organism that may beinherited, be environmentally determined or be a combination of the two.性状是生物的表型特征的不同变异,可能是遗传,环境决定或两者共同作用的结果。
5.Alleles(等位基因):refersto any one of two or more genes that may occur alternatively at a given site(locus) on a chromosome.一般指位于一对同源染色体的相同位置上、控制着相对性状的一对基因。
6.Mutation(突变):a changein the genetic material, usually rare and pathological.遗传物质的改变,通常是罕见的和致病的。
7.Polymorphism(多态性) :a changein the genetic material, usually common and not pathological.遗传物质的改变,是个体之间正常的差异,但比较普遍,且不致病。
8.Homozygote (纯合子) :an organism with identical pair of alleles for a specific trait.同源染色体上相同位点等位基因相同的基因型个体。
9.Heterozygote (杂合子):an organism that has different alleles at a particular gene locus onhomologous chromosomes. 同源染色体上相同位点等位基因不同的基因型个体。
10.Hemizygote (半合子): an organism that has only one allele for a specific trait. 对一种特定的性状只有一个等位基因的基因型个体。
Males are hemizygous for most genes on the sexchromosomes.雄性在性染色体上的大多数基因是半合子的。
11.Dominant trait (显性性状): a traitthat shows in a heterozygote.在杂合子中显示的性状。
12.Recessive trait (隐性性状): a traitthat is hidden in a heterozygote.隐藏在杂合子中的性状。
13.karyotype(核型分析):Akaryotype is the number and appearance of chromosome in the nucleus of aneukaryotic cell.核型是指真核细胞细胞核中染色体的数目和出现情况。
14.Population Genetics(群体遗传学):Population genetics is the study of variation within populations ofindividuals, and the forces which shape it. This involves studying changes inthe frequencies of genetic variation in populations over space and time.群体遗传学是研究群体内个体的变异,以及形成变异的力量。这涉及到研究种群在空间和时间上遗传变异频率的变化。
15. Population(种群):Aninterbreeding group of the same species in a given geographical area. 同一物种在特定地理区域内的杂交群。
16.Population genetics (种群遗传学):The study of the genetics of a population and how the alleles vary withtime.对一个种群的遗传学以及等位基因如何随时间变化的研究。
17.Gene pool(基因库):Thecollection of all alleles in the members of the population. 种群成员中所有等位基因的集合。所以计算时每个个体的两个等位基因都要考虑在内。
18.Gene flow(基因流动):Movementof alleles between populations when people migrate and mate. 当种群迁移和交配时,等位基因在种群之间的移动。
**19.Allele frequency (等位基因频率):**特定等位基因/所有等位基因。每个个体的两条染色体都要考虑。【2*纯合子+杂合子】
**20.Cohort (队列):**a groupof people who share a common characteristic or experience within a definedperiod (e.g., are born, are exposed to a drug or vaccine or pollutant, orundergo a certain medical procedure). 在一定时期内具有共同特征或经历的一个群体。(例如,出生日期,使用同种药物、接种同种疫苗或接受特定的医学治疗)。
21. Pedigree chart (谱系图):A chartshowing the genetic connections between individuals. 显示个体之间基因联系的图表。
22.Microevolution(微进化):Thesmall genetic changes due to changing allelic frequencies in populations. 由于群体中等位基因频率的变化引起的小的遗传变化。五个因素:非随机交配、迁移、遗传漂变、突变和自然选择。
23.Genetic drift(基因漂移):is thechange in the frequency of a gene variant (allele) in a populationdue to random sampling of organisms. 是由于机体的随机抽样,导致一个种群中等位基因频率的变化。
24.Linked Genes(连锁基因): Geneslocated together on a chromosome. Linked genes can assort separately from oneanother only through crossing over.位于染色体上一起的基因,连锁基因只有通过交叉互换才能进行单独的组合。在染色体上,两个基因越接近,它们越有可能是连锁基因。分为完全连锁和不完全连锁。
**25. Missense mutation (错义突变):**A point mutation in which a single nucleotide change results in a codonthat codes for a different amino acid. 一种点突变,其中单个核苷酸的变化导致编码不同氨基酸的密码子的改变。会改变蛋白质序列,一般发生在遗传密码子的第一位或第二位。
**26. Nonsense mutation (无义突变):**A pointmutation that results in a stop codon (nonsense codon) in a transcribed mRNA,which encodes incomplete and usually nonfunctional protein product.在转录的mRNA中引入终止密码子的点突变。会导致翻译提前终止,得到不完整、无生物功能的蛋白质。更易造成疾病。
29. Copy number variation (CNV)(拷贝数变异):Copy number variation (CNV) of DNA sequencesconstitute large segments of DNA ranging from 1 Kb that have copy numberdifferences when compared to a reference genome.与参考基因组相比,DNA序列的拷贝数变异(CNV)构成了从1 Kb不等的DNA大段,具有拷贝数差异。
基因拷贝数异变:删除、复制、插入,倒置……
CNV的致病性更强,比SNP更容易被发现,CNVs可以通过细胞遗传学研究检测到。
30. Genome Wide Association Study (GWAS)(全基因组关联分析):GWAS isthe study of genetic variation across the entire genome that is designed toassociate genetic variations (SNPs) with traits or with the presence or absenceof disease or condition. GWAS是研究整个基因组的遗传变异,旨在将遗传变异(SNPs)与性状或疾病或状态的存在或不存在联系起来,全基因组信息,结合临床和其他表型数据,提供了潜力:增加了对影响人类健康的基本生物过程的了解;疾病预测和病人护理方面的改进;实现个性化医疗的承诺.