• Article
  • | |
  • Metrics
  • |
  • Reference [16]
  • |
  • Related [20]
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    The problems of haplotyping and haplotype frequency estimation on trio genotype data under the Mendelian law of inheritance and the assumption of Hardy-Weinberg equilibrium are studied in this paper. Since most past efforts only focused on haplotyping on genotype data of unrelated individuals and data with general pedigrees, but gave insufficient efforts to the special case of trio genotype data, there is coming an increasing demand in analyzing them in particular, especially when taking into account that part of HAPMAP database is exactly trio data. This paper presents a two-staged method to estimate haplotype frequencies in trios: i) haplotyping stage, find haplotype configurations without recombinant for each trio; ii) frequency estimation stage, use the expectation-maximization (EM) algorithm to estimate haplotype frequencies based on these inferred haplotype configurations. Both the haplotyping algorithm and the EM algorithm are implemented in software package TRIOHAP using C language. Its effectiveness and efficiency and tested on simulated and real data sets as well. The experimental results show that, TRIOHAP runs much faster than a popular frequency estimation software which discards trio information. Moreover, because TRIOHAP utilizes such information, its estimation is more reliable.

    Reference
    [1]Clark AG.Inference of haplotypes from PCR-amplified samples of diploid populations.Mol.Biol.Evol.,1990,7(2):111-122.
    [2]Zhang QF,Che HY,Chen GL,Sun G.A practical algorithm for haplotyping by maximum parsimony.Journal of Software,2005,16(10):1699-1707 (in English with Chinese abstract).http://www.jos.org.cn/1000-9825/16/1699.htm
    [3]Excoffier L,Slatkin M.Maximum-Likelihood estimation of molecular haplotype frequencies in a diploid population.Molecular Biology and Evolution,1995,12(5):921-927.
    [4]Stephens M,Smith NJ,Donnelly P.A new statistical method for haplotype reconstruction for population data.American Journal of Human Genetics,2001,68(4):978-989.
    [5]Niu T,Qin ZS,Xu X,Liu JS.Bayesian haplotype inference for multiple linked single nucleotide polymorphisms.American Journal of Human Genetics,2002,70(1):157-169.
    [6]Lin S,Speed TP.An algorithm for haplotype analysis.Journal Computational Biology,1997,4(4):35-46.
    [7]Tapadar T,Ghosh S,Majumder PP.Haplotyping in pedigrees via a genetic algorithm.Human Heredity,2000,50(1):43-56.
    [8]Qian D,Beckmann L.Minimum-Recombinant haplotyping in pedigrees.American Journal of Human Genetics,2002,70(6):1434-1445.
    [9]Li J,Jiang T.Efficient rule-based haplotyping algorithms for pedigree data.In:Proc.of the RECOMB 2003.2003.197-206.
    [10]Li J,Jiang T.An exact solution for finding minimum recombinant haplotype configurations on pedigrees with missing data by integer linear programming.In:Proc.of the RECOMB 2004.2004.20-29.
    [11]Elston RC,Stewart J.A general model for the genetic analysis of pedigree data.Human Heredity 1971,21(6):523-542.
    [12]Griffiths A,Gelbart W,Lewontin R,Miller J.Modern Genetic Analysis:Integrating Genes and Genomes.New York:W.H.Freeman and Company,2002.
    [13]Fallin D,Schork NJ.Accuracy of haplotype frequency estimation for biallelic loci,via the expectation-maximization algorithm for unphased diploid genotype data.American Journal of Human Genetics,2000,67(4):947-959.
    [14]Zhao HY,Zhang SL,Merikangas KR,Trixler M,Wildenauer DB,Sun FZ,Kidd KK.Transmission/Disequilibrium tests using multiple tightly linked markers.American Journal of Human Genetics,2000,67(4):936-946.
    [15]O'Connell JR.Zero-Recombinant haplotyping:applications to fine mapping using SNPs.Genetic Epidemiology,2000,19(Suppl.1):s64-s70.
    [2]张强锋,车皓阳,陈国良,孙广中.最大节约原则下单倍型推导问题的实用算法.软件学报,2005,16(10):1699-1707.http://www.jos.org.cn/ 1000-9825/16/1699.htm
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

张强锋,徐云,陈国良,车皓阳.三元家庭基因数据的单体分型和单体型频率估计.软件学报,2007,18(9):2090-2099

Copy
Share
Article Metrics
  • Abstract:4148
  • PDF: 5268
  • HTML: 0
  • Cited by: 0
History
  • Received:December 21,2004
  • Revised:March 31,2006
You are the first2051453Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063