Machine Learning

Need help with Machine Learning?

Visit Drop-in Hours or Schedule a Consultation: <link to an embedded google calendar OB widget or google form widget> 

Below are the consultant we have available with Machine Learning and other expertise listed.

R Introduction to Deep Learning: Parts 1-2

November 17, 2021, 10:00am
This workshop introduces the basic concepts of Deep Learning — the training and performance evaluation of large neural networks, especially for image classification, natural language processing, and time-series data. Like many other machine learning algorithms, we will use deep learning algorithms to map input data to their appropriately classified outcome labels.

CANCELED: Python Machine Learning Fundamentals: Parts 1-2

November 14, 2022, 4:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Skyler Yumeng Chen

Data Science for Social Justice Fellow 2024
Haas School of Business

Skyler is a Ph.D. student in Behavioral Marketing at the Haas School of Business. Her research centers on consumer behavior and judgment and decision-making, with a keen interest in both experimental methods and data science techniques. She holds a B.A. in Economics and a B.S. in Data Science from New York University Shanghai.

Grace Hu

Data Science for Social Justice Fellow 2024
Bioengineering

Grace is a 3rd year Bioengineering PhD candidate in the joint UC Berkeley-UCSF Graduate Program. Her research lies at the nexus of computational design and 3D-bioprinting to advance tissue engineering for regenerative medicine. She previously studied Materials Science and Engineering (B.S.) and Computer Science (M.S.) at Stanford University, where she investigated printable batteries to power an ultra-affordable scanning electron microscope and explored computer science education research by developing AI models to augment teaching ability.

In her free time she...

Hugh Kadhem

Mathematics

Hugh Kadhem is a Ph.D. student in Applied Mathematics, with broad research interests in computational quantum physics and high-performance scientific computing.

Sand Mining - Plugging a Critical Data Gap

May 14, 2024
by Suraj Nair. Excessive sand mining is causing a global ecological crisis. In this blog post, I present why sand mining is one of the most pressing challenges facing the planet, and why persistent data gaps hinder accountability and monitoring. I also discuss an ongoing research project of mine where we combine freely available satellite imagery and machine learning models to build open-source sand mine detection tools that can plug some of these data gaps.

Tactics for Text Mining non-Roman Scripts

April 15, 2024
by Hilary Faxon, Ph.D. & Win Moe. Non-Roman scripts pose particular challenges for text mining. Here, we reflect on a project that used text mining alongside qualitative coding to understand the politicization of online content following Myanmar’s 2021 military coup.

Computational Social Science in a Social World: Challenges and Opportunities

March 26, 2024
by José Aveldanes. The rise of AI, Machine Learning, and Data Science are harbingers of the need for a significant shift in social science research. Computational Social Science enables us to go beyond traditional methods such as Ordinary Least Squares, which face challenges in addressing complexities of social phenomena, particularly in modeling nonlinear relationships and managing high-dimensionality data. This paradigmatic shift requires that we embrace these new tools to understand social life and necessitates understanding methodological and ethical challenges, including bias and representation. The integration of these technologies into social science research calls for a collaborative approach among social scientists, technologists, and policymakers to navigate the associated risk and possibilities of these new tools.

Using Big Data for Development Economics

March 18, 2024
by Leïla Njee Bugha. The proliferation of new sources of data emerging from 20th and 21st century technologies such as social media, internet, and mobile phones offers new opportunities for development economics research. Where such research was limited or impeded by existing data gaps or limited statistical capacity, big data can be used as a stopgap and help accurately quantify economic activity and inform policymaking in many different fields of research. Reduced cost and improved reliability are some key benefits of using big data for development economics, but as with all research designs, it requires thoughtful consideration of potential risks and harms.

Dive into the Future of AI with the LLM Working Group at D-Lab

February 7, 2024
by Tom van Nuenen, Celebrating 10 years of innovation in data-intensive social science, D-Lab in collaboration with Grad Div is excited to introduce the LLM Working Group, an initiative focused on the exploration and discussion of Large Language Models (LLMs) like ChatGPT within academic research and teaching. This group aims to unite scholars, students, and data scientists to address crucial questions about AI's role in academia, including access, impact, creativity, and learning in the age of information automation. Through a series of interactive sessions, participants will gain insights into LLM capabilities, discuss ethical considerations, and explore innovative approaches to utilizing these tools in their work. Whether you're an AI veteran or a novice curious about the potentials of GenAI, the LLM Working Group offers a collaborative platform to learn, share, and shape the future of academic inquiry. Join us in navigating the world of LLMs together.