Home » Research Overview » Applied Data Science

Applied Data Science

This page highlights a selection of my work in applied data science, where I explore how data can be transformed into insightful conclusions and knowledge discovery. My work combines statistical analysis, machine learning, and domain expertise to identify patterns, predict outcomes, and bridge the gap between theory and real-world applications.

Castor, Pollux, and Gemini

Castor, Pollux, and Gemini are three different implementations of a computational pipeline that projects high-dimensional data into an interactive visualization, combining both dimension reduction and clustering algorithms in the projection. In addition to DR and clustering, semantic interactions are implemented within the interface, allowing an analyst to provide feedback to an underlying learning component. With this feedback, an analyst can interact directly with the data to express their interests, incrementally training those underlying models and causing future updates to the visualization to better reflect the dimensions of interest to the analyst. The overarching goal of this research is to explore both the design and interaction space at the intersection of dimension reduction algorithms, clustering algorithms, and semantic interaction.

Related publications:

Google Scholar icon LinkedIn icon orcID icon