Graduate Theses & Dissertations

Pages

Modelling Request Access Patterns for Information on the World Wide Web
In this thesis, we present a framework to model user object-level request patterns in the World Wide Web.This framework consists of three sub-models: one for file access, one for Web pages, and one for storage sites. Web Pages are modelled to be made up of different types and sizes of objects, which are characterized by way of categories. We developed a discrete event simulation to investigate the performance of systems that utilize our model.Using this simulation, we established parameters that produce a wide range of conditions that serve as a basis for generating a variety of user request patterns. We demonstrated that with our framework, we can affect the mean response time (our performance metric of choice) by varying the composition of Web pages using our categories. To further test our framework, it was applied to a Web caching system, for which our results showed improved mean response time and server load. Author Keywords: discrete event simulation (DES), Internet, performance modelling, Web caching, World Wide Web
Application of Data Science to Paramedic Data
Paramedic data has significant potential for research. Paramedics see many patients every year and collect a wide variety of crucial data at each encounter. This data is rarely used for good reason: it’s messy and hard to work with. But like theunderdog character in a classic movie, with a little bit of work and a lot of understanding, paramedic data has significant potential to change the world of medical research. Paramedics throughout the world are involved in research every day, but most of this research uses purpose-built data structures and never takes advantage of the existing data that paramedics create as part of their everyday work. Through a project-based approach grounded in developing a better understanding of the opioid crisis, this thesis will examine the quantity and structure of the existing paramedic data, the complexities of its current design, the steps necessary to access it, and the processes necessary to clean existing data to a point where it can be easily modelled. Once we have our dataset, we will explore the challenges of choosing key metrics by examining the effectiveness of metrics currently employed to monitor the opioid crisis and the influences public health programs and changing policies have had on these metrics. Next, we will explore the temporal distributions of opioid and other intoxicant use with an eye to providing data to support public health in their harm reduction efforts. And lastly, we will look at the effect of fixed- and floating-point temporal influences on intoxicant-related calls with an eye to how these temporal points can affect call volumes. By using this exploration of the opioid crisis, this thesis will show that with a more thorough understanding of what paramedic data is, what data points are available, and the processes needed to transform it, paramedic data has the potential to greatly expand the limits of health care data science into a more precise and more all-encompassing discipline. Author Keywords: Ambulance, Data Science, Opioid, Overdose, Paramedic, Pre-hospital
Machine Learning for Aviation Data
This thesis is part of an industry project which collaborates with an aviation technology company on pilot performance assessment. In this project, we propose utilizing the pilots' training data to develop a model that can recognize the pilots' activity patterns for evaluation. The data will present as a time series, representing a pilot's actions during maneuvers. In this thesis, the main contribution is focusing on a multivariate time series dataset, including preprocessing and transformation. The main difficulties in time series classification is the data sequence of the time dimension. In this thesis, I developed an algorithm which formats time series data into equal length data. Three classification and two transformation methods were used. In total, there are six models for comparison. The initial accuracy was 40%. By optimization through resampling, we increased the accuracy to 60%. Author Keywords: Data Mining, K-NN, Machine Learning, Multivariate Time Series Classification, Time Series Forest
Particulate Matter Component Analyses in Relation to Public Health in Canada
This thesis explores the shot-term relationship between exposure to ambient air pollution and human health through metrics such as mortality and hospitalization in Canada. We begin by detailing the organization and interpolation of air pollution data from its partially quality-controlled source form. Analyses of seasonal, regional and temporal trends of all major components of PM2.5, was performed, showing a seasonal variation across most regions and validating the dataset. A one-pollutant statistical Generalized Additive Model was applied to the data, estimating the health risk associated with exposure to thirteen different components of PM2.5. The selected components were based on those that compromised the majority of the mass and included: sulphate, nitrate, zinc, silicon, iron, nickel, vanadium, potassium, organic carbon, organic matter, elemental carbon, total carbon. Trends based on annual estimates of the association for PM2.5, and its constituents,were compared, showing that carbonaceous compounds, sulphate and nitrate had similar estimates of association. Many estimates, as is common in population ecologic epidemiology, had association estimates statistically indistinguishable from zero, but with clear features of interest, including evident differences between cold and warm season associations in Canada's temperate climate. A method to model two correlated pollutants (in this case, PM2.5 and O3) was developed using thin plate splines. In this approach, the location of the response surface (after accounting for the temperature, a smooth function of time and day of week) that corresponds to the average pollutant concentration and the average plus one unit was used as the estimate of the joint contribution of pollutants due to a unit increase. The estimates from the thin plate spline (TPS) approach were compared to the single pollutant models, with large increases and decreases in PM2.5 and O3 being captured in the TPS estimates. However, this approach indicated significantly larger error in the estimates than would be expected, indicating a possible future area for refinement. Author Keywords: Air pollution, Environmental Epidemiology, Generalized Additive Models, Human Health, Multivariate Models, Thin Plate Splines
ADHD Symptomatology Across Adulthood
Objective: To improve on several methodological issues and research gaps regarding current literature investigating the stability of ADHD symptomatology across adulthood and relationships between the two core ADHD symptom dimensions (i.e., inattention and hyperactivity-impulsivity) and multiple life outcomes in adults. Method: A large sample of postsecondary students were initially assessed for ADHD symptomatology using the Conners’ Adult ADHD Rating Scale (CAARS). Six years later, academic success was assessed using students’ official academic records (e.g., final GPAs and degree completion status), and fifteen years later, participants were re-assessed using the CAARS and several measures of life success (e.g., relationship satisfaction, career satisfaction, and stress levels). Results: Inattention and hyperactivity-impulsivity symptoms showed strong stability across the 15-year period. Additionally, greater inattention symptoms during emerging adulthood and early middle adulthood were consistently associated with poorer life success (e.g., lower GPAs, poorer relationship and career satisfaction), particularly for men. Associations for hyperactivity-impulsivity symptoms were less consistent. Conclusion: ADHD symptomatology can be conceptualized as a stable, dimensional trait across adulthood, with robust associations with measures of life success. Author Keywords: academic success, ADHD, adults, job satisfaction, relationship satisfaction, stability
Assessing factors associated with wealth and health of Ontario workers after permanent work injury
I drew on Bourdieu’s theory of capital and theorized that different forms of economic, cultural and social capital which injured workers possessed and/or acquire over their disability trajectory may affect certain outcomes of permanent impairments. Using data from a cross-sectional survey of 494 Ontario workers with permanent impairments, I measured workers’ different indicators of capital in temporal order. Hierarchical regression analyses were used to test the unique association of workers’ individual characteristics, pre-injury capital, post-injury capital, and the outcomes of permanent impairments. The results show that factors related to individual characteristics, pre-injury and post-injury capital were associated with workers’ perceived health change, whereas pre-injury and post-injury capital were most relevant factors in explaining workers’ post-injury employment status and income recovery. When looking at the significance of individual predictors, post-injury variables were most relevant in understanding the outcomes of permanent impairment. The findings suggest that many workers faced economic and health disadvantages after permanent work injury. Author Keywords: Bourdieu, hierarchical regression, theory of capital, work-related disability, workers with permanent impairments
Prescription Drugs
Medication used to treat human illness is one of the greatest developments in human history. In Canada, prescription drugs have been developed and made available to treat a wide variety of illnesses, from infections to heart disease and so on. Records of prescription drug fulfillment at coarse Canadian geographic scales were obtained from Health Canada in order to track the use of these drugs by the Canadian population. The obtained prescription drug fulfillment records were in a variety of inconsistent formats, including a large selection of years for which only paper tabular records were available (hard copies). In this work, we organize, digitize, proof and synthesize the full available data set of prescription drug records, from paper to final database. Extensive quality control was performed on the data before use. This data was then analyzed for temporal and spatial changes in prescription drug use across Canada from 1990-2013. In addition, one of major research areas in environmental epidemiological studies is the study of population health risk associated with exposure to ambient air pollution. Prescription drugs can moderate public health risk, by reducing the drug user's physiological symptoms and preventing acute health effects (e.g., strokes, heart attacks, etc.). The cleaned prescription drug data was considered in the context of a common model to examine its influence on the association between air pollution exposure and various health outcomes. Since, prescription drug data were available only at the provincial level, a Bayesian hierarchical model was employed to include the prescription drugs as a covariate at regional level, which were then combined to estimate the association at national level. Although further investigations are required, the study results suggest that the prescription drugs influenced the air pollution related public health risk. Author Keywords: Data, Error checking, Population health, Prescriptions
Capital Ratios and Liquidity Creation
Using quarterly data from the six largest Canadian banks, we investigate the relationship between regulatory capital ratio and on-balance sheet liquidity created in the Canadian economy by “Big Six”. We find a significant positive relationship between Tier 1 capital ratio and on-balance sheet liquidity creation for Canadian big six banks, implying that large banks in Canada favor risks and rely on capital to fund illiquid assets. In contrast, for smaller banks, the relationship is significantly negative. Our results are robust to dynamic panel regression using 2-Step GMM, two exogenous shocks - COVID-19 crisis and the Global Financial Crisis (2007-2009), mergers & acquisitions activities in the banking industry, and core deposits financing. The COVID-19 pandemic and core deposits adversely impact the Tier 1 capital ratio’s relationship with on-balance-sheet liquidity creation, while the global financial crisis (2007-2009) effect on the association is insignificant. Author Keywords: Big Six, COVID -19, Deposits, Liquidity Creation, Tier 1 Capital Ratio,
Time Series Algorithms in Machine Learning - A Graph Approach to Multivariate Forecasting
Forecasting future values of time series has long been a field with many and varied applications, from climate and weather forecasting to stock prediction and economic planning to the control of industrial processes. Many of these problems involve not only a single time series but many simultaneous series which may influence each other. This thesis provides methods based on machine learning of handling such problems. We first consider single time series with both single and multiple features. We review the algorithms and unique challenges involved in applying machine learning to time series. Many machine learning algorithms when used for regression are designed to produce a single output value for each timestamp of interest with no measure of confidence; however, evaluating the uncertainty of the predictions is an important component for practical forecasting. We therefore discuss methods of constructing uncertainty estimates in the form of prediction intervals for each prediction. Stability over long time horizons is also a concern for these algorithms as recursion is a common method used to generate predictions over long time intervals. To address this, we present methods of maintaining stability in the forecast even over large time horizons. These methods are applied to an electricity forecasting problem where we demonstrate the effectiveness for support vector machines, neural networks and gradient boosted trees. We next consider spatiotemporal problems, which consist of multiple interlinked time series, each of which may contain multiple features. We represent these problems using graphs, allowing us to learn relationships using graph neural networks. Existing methods of doing this generally make use of separate time and spatial (graph) layers, or simply replace operations in temporal layers with graph operations. We show that these approaches have difficulty learning relationships that contain time lags of several time steps. To address this, we propose a new layer inspired by the long-short term memory (LSTM) recurrent neural network which adds a distinct memory state dedicated to learning graph relationships while keeping the original memory state. This allows the model to consider temporally distant events at other nodes without affecting its ability to model long-term relationships at a single node. We show that this model is capable of learning the long-term patterns that existing models struggle with. We then apply this model to a number of real-world bike-share and traffic datasets where we observe improved performance when compared to other models with similar numbers of parameters. Author Keywords: forecasting, graph neural network, LSTM, machine learning, neural network, time series
Assessing the Cost of Reproduction between Male and Female Sex Functions in Hermaphroditic Plants
The cost of reproduction refers to the use of resources for the production of offspring that decreases the availability of resources for future reproductive events and other biological processes. Models of sex-allocation provide insights into optimal patterns of resource investment in male and female sex functions and have been extended to include other components of the life history, enabling assessment of the costs of reproduction. These models have shown that, in general, costs of reproduction through female function should usually exceed costs through male function. However, those previous models only considered allocations from a single pool of shared resources. Recent studies have indicated that the type of resource currency can differ for female and male sex functions, and that this might affect costs of reproduction via effects on other components of the life history. Using multiple invasibility analysis, this study examined resource allocation to male and female sex functions, while simultaneously considering allocations to survival and growth. Allocation patterns were modelled using both shared and separate resource pools. Under shared resources, allocation patterns to male and female sex function followed the results of earlier models. When resource pools were separate, however, allocations to male function often exceeded allocations to female function, even if fitness gains increased less strongly with investment in male function than with investment in female function. These results demonstrate that the costs of reproduction are affected by (1) the types of resources needed for reproduction via female or male function and (2) via trade-offs with other components of the life history. Future studies of the costs of reproduction should examine whether allocations to reproduction via female versus male function usually entail the use of different types of resources. Author Keywords: Cost of Reproduction, Gain Curve, Life History, Resource Allocation Patterns, Resource Currencies
Modelling Depressive Symptoms in Emerging Adulthood
Depression during the transition into adulthood is a growing mental health concern, with overwhelming evidence linking the developmental risk for depressive symptoms with maternal depression. In addition, there is a lack of research on the protective role of socioemotional competencies in this context. This study examines independent and joint effects of maternal depression and trait emotional intelligence (TEI) on the longitudinal trajectory of depressive symptoms during emerging adulthood. A series of latent growth models was applied to three biennial cycles of data from a nationally representative sample (N=933) from the Canadian National Longitudinal Survey of Children and Youth. We assessed the trajectory of self-reported depressive symptoms from age 20 to 24 years, as well as whether it was moderated by maternal depression at age 10 to 11 and TEI at age 20, separately by gender. The results indicated that mean levels of depression declined during the emerging adulthood in females, but remained relatively stable in males. Maternal depressive symptoms significantly positively predicted depressive symptoms across the entire emerging adulthood in females, but only at age 20-21 for males. In addition, likelihood of developing depressive symptoms was attenuated by higher global TEI in both females and males, and additionally by higher interpersonal skills in males. Our findings suggest that interventions for depressive symptoms in emerging adulthood should consider development of socioemotional competencies. Author Keywords: Depression, Depressive Symptoms, Emerging Adulthood, Intergenerational Risk, Longitudinal, Trait Emotional Intelligence
Academic Efficiency
Universities produce a significant and increasing share of basic research that is later commercialized by firms. We argue that the university's prominence as a producer of basic research is the result of a differential efficiency in research production that cannot be replicated by firms or individual agents - teaching. By using research accomplishments to signal knowledge and attract tuition-paying students, universities are uniquely positioned to undertake certain types of research projects. However, in a market for innovation without patent rights, a significant and increasing number of basic research projects, that are social welfare improving, cannot be initiated by firms or universities. The extension of patent rights to university-generated research elegantly redresses this issue and leaves us to ponder important questions about the future of our innovation-driven economies. Author Keywords: Innovation, Intellectual Property Rights, Research, Science Technology and Innovation Policy

Pages

Search Our Digital Collections

Query

Enabled Filters

  • (-) ≠ Farell
  • (-) = Applied Modeling and Quantitative Methods
  • (-) = Trent University Graduate Thesis Collection
  • (-) ≠ Avusuglo

Filter Results

Date

2004 - 2024
(decades)
Specify date range: Show
Format: 2024/03/19