Programming Languages

Organized Code Repositories Accelerate Science and Facilitate Reproducubility

March 2, 2021

Computational and data-driven research increasingly requires developing complex codebases. At the same time, many scientists don’t receive training in software engineering practices, resulting in, for some, the perception that scientists write terrible software. As scientists, good software should accelerate our work and facilitate its reproducibility. While building good coding practices takes some time and experience, it doesn’t require a...

Visuals for Everyone: An Exercise on the Importance of Intuitive Data Visualization

March 30, 2021

A couple years ago, I took an undergraduate biostatistics course here at UC Berkeley and vividly remember one of the first discussion section activities on interpreting data and visualizations. From this activity, I learned about why, as data consumers, we must always be aware of not only what visualizations are really representing but also understanding where the data is really coming from. While this might seem obvious, this has been one of the most valuable lessons as an aspiring data scientist/enthusiast. I learned the importance of analyzing and understanding data with...