Web Scraping

Sahiba Chopra

Data Science Fellow 2024-2025
Haas

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Stephanie Andrews

Data Science for Social Justice Senior Fellow 2024
Info & Data Science MIDS

Stephanie Andrews is currently studying data science in the MIDS program, having previously majored in Social Welfare as an undergraduate at Cal. After graduating, she worked as an advocate for survivors of gender-based violence, as a public policy analyst focusing on anti-trafficking initiatives, and as a software engineer for progressive and social impact organizations. She is now conducting research with the Human Rights Center's Investigations Lab, using OSINT and data science methods to investigate human rights violations.

Kurt Soncco Sinchi

Data Science for Social Justice Fellow 2024
Civil Engineering

First generation student and looking to improve and apply Data Science core concepts into social impactful projects, as well as trying to leverage the information from previous cases for better insights of society. Focused on infrastructure and its impact under natural disasters.

Chirag Manghani

Consultant
School of Information

Chirag is a 2nd year graduate at the I-School. Proficient in Python, Java, R, and SQL, he navigates software application development, machine learning and data science. His keen interest lies in data analysis and statistical methods, driving him to bridge theory and practice seamlessly. Chirag's dedication to excellence, adaptable mindset, and innate curiosity define him as a dynamic problem solver in the ever-evolving tech landscape.

Aaron Culich

Deputy Director of D-Lab; Cyberinfrastructure Architect and Consulting Lead

Aaron Culich is a staff member at the D-Lab with expertise in Cloud Computing, High Performance Computing (HPC), Databases (SQL and NoSQL), JupyterHub and BinderHub infrastructure, and a variety of programming languages (Python, R, Java, C, C++, and more). His ongoing mission is to explore new compute possibilities, discovering useful tools and practices, and making them more accessible to researchers on campus and beyond.

How can we use big data from iNaturalist to address important questions in Entomology?

February 26, 2024
by Leah Lee. Large-scale geographic data over time on insect diversity can be used to answer important questions in Entomology. Open-source, open-access citizen science platforms like iNaturalist generate huge amounts of data on species diversity and distribution at accelerating rates. However, unstructured citizen science data contain inherent biases and need to be used with care. One of the efforts to validate big data from iNaturalist is to cross-check with systematically collected data, such as museum specimens.

Chirag Manghani

Consulting Drop-In Hours: Wed 1pm-3pm

Consulting Areas: Python, R, SQL, Stata, SAS, LaTeX, HTML / CSS, Javascript, C++, APIs, Cloud & HPC Computing, Cybersecurity & Data Security, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Deep Learning, Machine Learning, Natural Language Processing, Python Programming, R Programming, Software Tools, Text Analysis, Web Scraping, Regression Analysis, Software Output Interpretation, Bash or Command Line, Excel, Git or Github, Qualtrics, RStudio, RStudio...

Lauren Chambers

Consulting Drop-In Hours: Wed 11am-1pm

Consulting Areas: Python, R, HTML / CSS, APIs, Data Manipulation and Cleaning, Data Science, Data Visualization, Python Programming, R Programming, Software Tools, Web Scraping, Regression Analysis, Software Output Interpretation, Bash or Command Line, Git or Github, OCR, RStudio

Quick-tip: the fastest way to speak to a consultant is to first ...

Addison Pickrell

IUSE Undergraduate Advisory Board
Mathematics
Sociology

Addison is an aspiring mathematician and social scientist (Class of '27). He loves collecting books he'll never read, is an open-source and open-access advocate, and an aspiring community organizer and systems disrupter. Ask me about community-based participatory action research (CBPAR), critical pedagogy, applied mathematics, and social science.

Searching for Other Solar Systems

November 21, 2023
by Emma Turtelboom. Over the last three decades, we have discovered over 5000 exoplanets, which are planets outside of our Solar System. With these observations, we can try to answer many questions we have about the universe. For example, how unique is the Solar System? How do planets form? Is there life elsewhere in the Milky Way? We can query the NASA Exoplanet Archive to compare multi-planet systems to the Solar System. Through this, we can compare how similar (or dissimilar!) the systems are.