Log in

Sign up for our mailing list!

Digital Humanist aims to run OCR over a terabyte of rare book scans

Posted: Apr, 26, 2017

By: Anonymous (not verified)

Since his college days at Brigham Young University (BYU), Adam Anderson has been measuring evenings and weekends in pages, rather than hours. “You can scan about 400 pages an hour, once you get in the groove,” he explains. Anderson, a Mellon Postdoctoral Fellow in Digital Humanities at UC Berkeley, has spent his career scanning texts in order to draw upon secondary literature in archaeology and computational linguistics.

Read →
D-Lab is hiring instructors and consultants!

Posted: Apr, 12, 2017

By: Evan Muzzall

The D-Lab is hiring instructors and consultants beginning Summer 2017 to teach introductory, advanced beginner, and intermediate workshops and/or provide consulting. If you feel inspired to help fellow researchers and students learn about and work research tools, methods & techniques, we're interested in hearing from you. D-Lab is an open, inclusive culture that values diversity and open exchange. We will let you develop leadership in your particular area while working collaboratively with other students and staff.

Read →
Cloud Working Group mid-semester update: Spring 2017

Posted: Apr, 12, 2017

By: Aaron Culich

This semester’s Cloud Working Group (CloudWG) has focused on a researcher’s ability to move her data and execute her research computation on multiple platforms, including commercial cloud (e.g., Amazon AWS), national infrastructure (e.g., XSEDE’s Jetstream), and a local workstation or laptop.

Read →
D-Lab is Looking for a Qualitative Data Analysis Topic Area Lead

Posted: Mar, 23, 2017

By: Patty Frontiera

D-Lab is seeking a new graduate student team member to implement our qualitative data analysis (QDA) vision for a spectrum of services, including workshops, consulting and working groups. If you feel inspired to help fellow researchers and students learn about and work with research methods and tools, we're interested in hiring you.

Read →
D-Lab is looking for a blogger!

Posted: Mar, 16, 2017

By: Willie Calvin

Are you a passionate writer, with a creative eye and love for all things data? If so, we’d love to hear from you!

D-Lab is looking for a UC Berkeley graduate student, for the position of a blog and grant writer who can support both the D-Lab and Digital Humanities in keeping their respective blogs active and also helping with grant applications. This position would be hired as a GSR at 50%.

About D-Lab:

Read →
Go from Analog to Digital Texts with OCR

Posted: Mar, 10, 2017

By: Quinn Dombrowski

OCR text

A collection of digitized texts marks the start of a research project —  or does it?

For many social sciences and humanities researchers, creating searchable, editable, and machine-readable digital texts out of heaps of paper in archival boxes or from books painstakingly sourced from overlooked corners of the library can be a tedious, time-consuming process.

Read →
Getting Started with Machine Learning at the D-Lab

Posted: Jan, 12, 2017

By: Patty Frontiera

Getting Started with Machine Learning at the D-Lab

January 8, 2017

Evan Muzzall, Chris Hench, Chris Kennedy

 

Machine learning is a high-value concept for the social sciences and humanities. Classification and regression models are becoming more common in these disciplines and student interest is rapidly growing. However, machine learning has a steep (if not frightening) learning curve that dissuades many from pursuing these interests.

 

Read →
D-Lab is Hiring Instructors and Consultants

Posted: Dec, 01, 2016

By: Patty Frontiera

The D-Lab is hiring instructors and consultants for the Spring semester to teach introductory, advanced beginner, and intermediate workshops and / or provide consulting. If you feel inspired to help fellow researchers and students learn about and work research tools, methods & techniques, we're interested in hearing from you. D-Lab is an open, inclusive culture that values diversity and open exchange. We will let you develop leadership in your particular area while working collaboratively with other students and staff.

Read →
Online Access to OCR Software Now Available

Posted: Dec, 01, 2016

By: Patty Frontiera

Data comes in many formats, some more useful than others. Many researchers -- particularly those who work in archives -- have to convert images or PDFs of text into usable, editable text using optical character recognition (OCR). Adobe Acrobat has very basic OCR functionality, which may be sufficient for clearly-typed text in English or a small handful of other languages. If your document has complex layout (e.g. tables or columns) or text formatting, or uses languages not supported by Adobe Acrobat, ABBYY FineReader may be a better option.

Read →
D-Lab is hiring work-study students to join the D-Lab Team

Posted: Sep, 15, 2016

By: Susan Ayasse Gr...

We are a vibrant workplace with lots of people coming through, asking lots of questions, so lots of moving parts. We thrive on problem solving, open exchange, and diverse experiences.

If:
You are service-oriented and detail-oriented & You are interested helping students & You like to ask questions & You are interested in data science…

Then:
You’ll like D-Lab.

Read →

Pages