Presence / Absence Marker Discovery in RAD Markers for Multiplexed Samples in the Context of Next-Generation Sequencing

Nikooienejad, Amir

dc.contributor.advisor	Yoon, Byung-Jun
dc.contributor.advisor	Dabney, Alan
dc.creator	Nikooienejad, Amir
dc.date.accessioned	2013-12-16T20:04:15Z
dc.date.available	2015-08-01T05:48:33Z
dc.date.created	2013-08
dc.date.issued	2013-07-24
dc.date.submitted	August 2013
dc.identifier.uri	https://hdl.handle.net/1969.1/151135
dc.description.abstract	Recent improvements in sequencing technologies have caused various interesting problems to arouse. Having millions of read sequences as the final product of sequencing genome at a lower cost compared to micro array era, has encouraged scientists to enhance previous methods in various areas of bioinformatics. Genotyping and generating genetic maps to study inherited genotypes in order to analyze specific traits in a population is one of the fields of bioinformatics that involves generating different genetic markers and identify polymorphisms in different individuals of a population. Presence/absence markers are the main focus of this thesis. This is one type of Restriction site Associate DNA (RAD) markers which is present in some samples and absent in others and is the sign of variation in the cut site of a restriction enzyme. However, the counts of markers in an experiment are highly correlated and calling true absence and presence is not a straightforward task which means any marker with zero count is not necessarily absent in the sample under study. This is also the case for non-zero count markers which are not necessarily present. A good model that can fit the data is able to make true calls. We propose two different contexts for designing such models as a solution to this problem and investigate their performance. On the other hand, utilizing features of next generation sequencing technology in an even more efficient way, requires the ability to multiplex high number of samples in a single experiment run. In that case, appropriate barcoding, that is robust to various sources of noise in the machine, becomes paramount. Designing such barcodes in an efficient way is a challenging task which is addressed in detail as another problem of this thesis. We make two contributions. One, we propose an algorithm for barcoding multiplexed RADSeq samples. Two, we propose an algorithm for the statistical selection of presence/absence markers on the basis of RADSeq data on two related individuals. Operating characteristics of our methods are explored using both simulated and real data.	en
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	NGS	en
dc.subject	Barcode	en
dc.subject	Markers	en
dc.subject	RADSeq	en
dc.subject	PAV	en
dc.title	Presence / Absence Marker Discovery in RAD Markers for Multiplexed Samples in the Context of Next-Generation Sequencing	en
dc.type	Thesis	en
thesis.degree.department	Electrical and Computer Engineering	en
thesis.degree.discipline	Electrical Engineering	en
thesis.degree.grantor	Texas A & M University	en
thesis.degree.name	Master of Science	en
thesis.degree.level	Masters	en
dc.contributor.committeeMember	Dougherty, Edward R.
dc.contributor.committeeMember	Chamberland, Jean-Francois
dc.type.material	text	en
dc.date.updated	2013-12-16T20:04:15Z
local.embargo.terms	2015-08-01

Files in this item

Name:: NIKOOIENEJAD-THESIS-2013.pdf
Size:: 860.0Kb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record