Data Sources

Tactics for Text Mining non-Roman Scripts

April 15, 2024
by Hilary Faxon, Ph.D. & Win Moe. Non-Roman scripts pose particular challenges for text mining. Here, we reflect on a project that used text mining alongside qualitative coding to understand the politicization of online content following Myanmar’s 2021 military coup.

Elijah Mercer

School of Information

Elijah Mercer is a Master's student in the School of Information. He is particularly interested in using data to drive results for marginalized communities. His interests are in the field of criminal justice, policy and juvenile justice.

Chirag Manghani

School of Information

Chirag is a 2nd year graduate at the I-School. Proficient in Python, Java, R, and SQL, he navigates software application development, machine learning and data science. His keen interest lies in data analysis and statistical methods, driving him to bridge theory and practice seamlessly. Chirag's dedication to excellence, adaptable mindset, and innate curiosity define him as a dynamic problem solver in the ever-evolving tech landscape.

GPT Fundamentals

April 17, 2024, 3:00pm
This workshop offers a general introduction to the GPT (Generative Pretrained Transformers) model. We will explore how they reflect and shape our cultural narratives and social interactions, and which drawbacks and constraints they have.

What Are Vowels Made Of? Graphing a Classic Dataset with R

February 13, 2024
by Anna Björklund. Vowels are all around us. Mainstream US English has around twelve unique vowels. How can our brains tell these sounds apart? This blog post will help you answer this question by plotting vowel data from a classic American English dataset by Peterson and Barney (1952).

Jailynne Estevez

Consulting Drop-In Hours: Fri 3pm-5pm

Consulting Areas: Python, SQL, Stata, HTML / CSS, Javascript, Google AppScripts, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Python Programming, Surveys, Sampling & Interviews, Text Analysis, , Bash or Command Line, Excel, Git or Github, Stata

Quick-tip: the fastest way to speak to a consultant is to first ...

Anna Björklund

Data Science Fellow

I am a fifth-year PhD student in the Department of Linguistics with an areal interest in the Wintuan languages, traditionally spoken in the northern Sacramento Valley and now undergoing revitalization. My primary research interests are in leveraging archival recordings for the phonetic analysis of these under-documented languages, as well as designing tools to assist in their revitalization. I have worked as a linguistic consultant for the Paskenta Band of Nomlaki Indians since 2020 and the Wintu Tribe of Northern California since 2022. I received my MA in linguistics from UC...

Covidence: Getting Started

February 29, 2024, 12:00pm
Covidence, a web-based tool licensed by the UC Berkeley Library, helps with your systematic and other literature reviews, which are popular processes to summarize and synthesize literature in your topic of interest. Covidence helps you organize and track progress on your review, from search results to extraction. This interactive workshop will take you through how to use Covidence. How to add reviewers or make changes mid-review, how to develop exclusion criteria, and how to get help will be covered. There will be plenty of time for Q & A during this session; you are welcome to raise questions about your specific review or review process.

Chirag Manghani

Consulting Drop-In Hours: Wed 1pm-3pm

Consulting Areas: Python, R, SQL, Stata, SAS, LaTeX, HTML / CSS, Javascript, C++, APIs, Cloud & HPC Computing, Cybersecurity & Data Security, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Deep Learning, Machine Learning, Natural Language Processing, Python Programming, R Programming, Software Tools, Text Analysis, Web Scraping, Regression Analysis, Software Output Interpretation, Bash or Command Line, Excel, Git or Github, Qualtrics, RStudio, RStudio...

Elijah Mercer

Consulting Drop-In Hours: Mon 3pm-5pm

Consulting Areas: Python, R, Data Sources, Mixed Methods, Qualitative methods, Surveys, Sampling & Interviews, Excel, Qualtrics

Quick-tip: the fastest way to speak to a consultant is to first submit a request and then ...