Machine Learning

Need help with Machine Learning?

Visit Drop-in Hours or Schedule a Consultation: <link to an embedded google calendar OB widget or google form widget>

Below are the consultant we have available with Machine Learning and other expertise listed.

R Machine Learning with tidymodels: Parts 1-2

February 24, 2025, 3:00pm

Machine learning often evokes images of Skynet, self-driving cars, and computerized homes. However, these ideas are less science fiction as they are tangible phenomena that are predicated on description, classification, prediction, and pattern recognition in data. During this two part workshop, we will discuss basic features of supervised machine learning algorithms including k-nearest neighbor, linear regression, decision tree, random forest, boosting, and ensembling using the tidymodels framework. To social scientists, such methods might be critical for investigating evolutionary relationships, global health patterns, voter turnout in local elections, or individual psychological diagnoses.

Read more about R Machine Learning with tidymodels: Parts 1-2

Python Machine Learning Fundamentals: Parts 1-2

April 8, 2025, 12:00pm

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Read more about Python Machine Learning Fundamentals: Parts 1-2

Python Machine Learning Fundamentals: Parts 1-2

February 24, 2025, 2:00pm

Read more about Python Machine Learning Fundamentals: Parts 1-2

Language Models in Mental Health Conversations – How Empathetic Are They Really?

December 3, 2024

Sohail Khan

by Sohail Khan. Language models are becoming integral to daily life as trusted sources of advice. While their utility has expanded from simple tasks like text summarization to more complex interactions, the empathetic quality of their responses is crucial. This article explores methods to assess the emotional appropriateness of these models, using metrics such as BLEU, ROUGE, and Sentence Transformers. By analyzing models like LLaMA in mental health dialogues, we learn that while they suffer through traditional word-based metrics, LLaMA's performance in capturing empathy through semantic similarity is promising. In addition, we must advocate for continuous monitoring to ensure these models support their users' mental well-being effectively.

Read more about Language Models in Mental Health Conversations – How Empathetic Are They Really?

A Recipe for Reliable Discoveries: Ensuring Stability Throughout Your Data Work

November 19, 2024

Jaewon Saw

by Jaewon Saw. Imagine perfecting a favorite recipe, then sharing it with others, only to find their results differ because of small changes in tools or ingredients. How do you ensure the dish still reflects your original vision? This challenge captures the principle of stability in data science: achieving acceptable consistency in outcomes relative to reasonable perturbations of conditions and methods. In this blog post, I reflect on my research journey and share why grounding data work in stability is essential for reproducibility, adaptability, and trust in the final results.

Read more about A Recipe for Reliable Discoveries: Ensuring Stability Throughout Your Data Work

Python Deep Learning: Parts 1-2

September 24, 2024, 2:00pm

The goal of this workshop is to build intuition for deep learning by building, training, and testing models in Python. Rather than a theory-centered approach, we will evaluate deep learning models through empirical results.

Read more about Python Deep Learning: Parts 1-2

Python Deep Learning: Parts 1-2

November 18, 2024, 9:00am

Read more about Python Deep Learning: Parts 1-2

Python Machine Learning Fundamentals: Parts 1-2

November 19, 2024, 1:00pm

Read more about Python Machine Learning Fundamentals: Parts 1-2

Leveraging Large Language Models for Analyzing Judicial Disparities in China

October 8, 2024

Nanqin Ying

by Nanqin Ying. This study analyzes over 50 million judicial decisions from China’s Supreme People’s Court to examine disparities in legal representation and their impact on sentencing across provinces. Focusing on 290 000 drug-related cases, it employs large language models to differentiate between private attorneys and public defenders and assess their sentencing outcomes. The methodology combines advanced text processing with statistical analysis, using clustering to categorize cases by province and representation, and regression models to isolate the effect of legal representation from factors like drug quantity and regional policies. Findings reveal significant regional disparities in legal access driven by economic conditions, highlighting the need for reforms in China’s legal aid system to ensure equitable representation for marginalized groups and promote transparent judicial data for systemic improvements.

Read more about Leveraging Large Language Models for Analyzing Judicial Disparities in China

R Machine Learning with tidymodels: Parts 1-2

October 14, 2024, 1:00pm

Read more about R Machine Learning with tidymodels: Parts 1-2

« first View: Taxonomy term
‹ previous View: Taxonomy term
1 of 12 View: Taxonomy term
2 of 12 View: Taxonomy term
3 of 12 View: Taxonomy term (Current page)
4 of 12 View: Taxonomy term
5 of 12 View: Taxonomy term
6 of 12 View: Taxonomy term
7 of 12 View: Taxonomy term
8 of 12 View: Taxonomy term
9 of 12 View: Taxonomy term
…
next › View: Taxonomy term
last » View: Taxonomy term