Show simple item record

dc.contributor.advisorReece, Julia
dc.contributor.advisorMisra, Siddharth
dc.creatorCarpp, Timothy Ryan
dc.date.accessioned2023-10-12T14:41:49Z
dc.date.created2023-08
dc.date.issued2023-08-10
dc.date.submittedAugust 2023
dc.identifier.urihttps://hdl.handle.net/1969.1/200007
dc.description.abstractIn the digital age, leveraging the enormous amounts of data available is critical to understanding the petroleum system. One example is chemometric analysis (the application of multivariate statistics to the analysis of chemical data), which has been applied to the Permian Basin as a whole. However, previously published studies have lacked data from the Delaware Basin. The standard chemometric methodology is principal component analysis (PCA) and hierarchical clustering (HCA). The objective of this study is to apply Uniform Manifold Approximation and Projection (UMAP) and mean-shift methodology as well as the traditional methodology to the same proprietary dataset from the Delaware Basin and compare their performances to gain new insights into the petroleum genetic groups in the basin. Thirty-three geochemical parameters were selected from the dataset and were given to both algorithms for analysis. UMAP identified seven different groups while PCA and HCA identified four different groups. The groups generated by both models were compared to source rock extracts and oil samples from the Midland Basin that were published by Echegu (2013) for external validation. The comparison revealed that a first order control on the characteristics of the oil families is whether their source rock was deposited in the Permian or Tobosa Basin. Furthermore, PCA and HCA created one group for oils originating from the Tobosa Basin whereas UMAP differentiates the Tobosa Basin oils into two groups. These two oil groups match the geochemical characteristics of the Woodford and Barnett Formations’ source rock extracts from the dataset in Echegu (2013). For the Permian Basin petroleum systems this study shows that the most probable source rocks for the groups produced by both models are carbonate and shale sources in the Bone Spring and Wolfcamp Formations respectively. Furthermore, the differences between the results from both models are not contradictory but are complimentary. The data analytics supports that these differences are the result of UMAP & mean-shift emphasizing geochemical parameters that are related to thermal maturity whereas PCA & HCA emphasizes the depositional environment.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectGeochemistry
dc.subjectChemometric Analysis
dc.subjectData Analytics
dc.subjectPetroleum Systems
dc.subjectMachine Learning
dc.titleApplication of Data Analytics to Chemometric Analysis of Conventionally Produced Oil Samples From the Delaware Basin
dc.typeThesis
thesis.degree.departmentGeology and Geophysics
thesis.degree.disciplineGeology
thesis.degree.grantorTexas A&M University
thesis.degree.nameMaster of Science
thesis.degree.levelMasters
dc.contributor.committeeMemberBecker, Mauro
dc.type.materialtext
dc.date.updated2023-10-12T14:41:53Z
local.embargo.terms2025-08-01
local.embargo.lift2025-08-01
local.etdauthor.orcid0009-0007-1286-6919


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record