Seth Chatterton

Master of Business Analytics Student at MIT

Seth Chatterton

About Me

I am a current student in the Master of Business Analytics program at MIT. I have experience in data science, machine learning, optimization, and management consulting. My current research with Professor Dimitris Bertsimas is on using large language models for document understanding and structured information extraction. Previously, I was a Consultant at Sage Analysis Group, where I used computer vision, natural language processing, and simulation modeling to solve complex client problems. I'm interested in Data Scientist positions and similar roles, especially in roles where I can have direct business impact.

Resume

Email: setcha@mit.edu

LinkedIn: seth-chatterton

GitHub: setcha

Research Advisor: Dimitris Bertsimas

Projects

The Future of Work - Navigating Workforce Dynamics in the Age of GenAI

8/2024

In this project, I worked in collaboration with Accenture and my partner Pranav to develop a comprehensive simulation framework designed to help companies navigate the workforce changes driven by Generative AI (GenAI). Our solution models individual career paths, skill development, job transitions, and task demands to provide deep insights into the evolving landscape of work. By simulating the impact of different GenAI adoption strategies, the framework enables companies to experiment with various scenarios and identify the most effective approaches for their unique needs. This tool empowers organizations to make data-driven decisions on reskilling and workforce optimization, directly linking these strategies to potential business outcomes like increased productivity, faster GenAI integration, and revenue growth. By aligning workforce capabilities with strategic technology initiatives, our framework positions companies to not only adapt to the transformative power of GenAI but also to leverage it as a key driver of competitive advantage and economic value.

See the poster

Technologies: AgentPy, PyTorch, scikit-learn, LangChain, pandas

Using Diffraction Patterns to Diagnose Glaucoma

5/2024

Example synthetic data from glaucoma and vasculature diffraction dataset

Glaucoma is an eye disease which can lead to blindness. One possible biomarker of this disease is reduced branching of the blood vessel vasculature. High resolution vasculature imaging using optical coherence tomography is technically challenging and expensive, limiting widespread clinical viability. Instead of directly imaging vasculature, lasers could be used to scatter light off of vasculature and produce unique network-specific speckle patterns. We use a synthetic dataset of vasculature and simulated diffraction patterns of this vasculature to try to differentiate between healthy and diseased eyes.

See the presentation

Technologies: PyTorch, scikit-learn, diffractio

Does Hospital Managment Save Lives?

5/2024

Using data from the World Management Survey we can answer the question, “How does the management of a hospital affect patient outcomes?” Through a cross-sectional study of hospitals from 5 countries, surveyed over 3 years, we can analyze the relationship between management quality and acute myocardial infarction (heart attack) mortality rates. We implemented two linear regression models to try to estimate the causal relationship between management practices on heart attack mortality rates: fixed-effects and instrumental variable estimation.

View the R markdown

Read the report

See the presentation

Technologies: R

Optimal Routing for Electric Vehicle Last-Mile Delivery

12/2023

Vehicle Paths

Developed an optimal routing algorithm for electric vehicle last-mile delivery using Gurobi and Julia. The project focused on minimizing delivery time and maximizing efficiency, while respecting battery range and recharging time constraints.

Read the full report

Technologies: Gurobi, Julia

Language Modeling using Tree-Based Methods

12/2023

Language Modeling Tree

Implemented tree-based models for natural language processing tasks. The project involved extensive data analysis and model training using Python libraries.

Read the full report

Technologies: Python, scikit-learn, pandas

Predicting Views of Viral Youtube Videos

12/2023

Viral Video Correlations

Created a predictive model to estimate the views of YouTube videos. The model was trained on various features such as video category, time published, colors in the thumbnail, and attributes of the title.

Read the full report

Technologies: Python, scikit-learn, pandas

Particle Swarm Optimization with Flocking and Genetic Programming

8/2018

Implemented a particle swarm optimization algorithm using the flocking behavior of birds for improved information diffusion, and genetic programming to choose flocking parameters and behaviors. The project aimed to create an algorithm capable of solving complex optimization problems.

View the poster

Technologies: Java, Python

U.S. Patent No. 10,118,670 - Strain Gauge Device and Approach

8/2016

Involved in the development of a novel strain gauge device for measuring the rigging tension on sailing vessels. The project resulted in a successful patent filing.

Read the patent

Technologies: Arduino, 3D printing