Categories
updates

PUDL awarded NSF POSE grant

Introducing POSE

We are excited to share that the Public Utility Data Liberation Project (PUDL) and Catalyst Cooperative have been awarded a Pathways to Open Source Ecosystems (POSE) Phase I grant by the National Science Foundation (NSF)! This grant will fund a slate of community building and infrastructure projects to expand the PUDL community and facilitate contributions.

Why we pursued the POSE grant

Over the past few years, we’ve made substantial technical improvements to PUDL thanks to generous support from RMI, the Sloan Foundation, Climate Change AI, and the Mozilla Foundation. These improvements have made accessing PUDL data and adding new datasets easier than ever before.

We’ve spent time on community-building activities like developing relationships with open energy modelers, presenting at conferences, hosting office hours, and responding to questions on Github Discussions. We applied for the NSF POSE grant so that we can spend more time fostering the PUDL community and improving people’s experience working with public energy data.

Getting to know our community

Are you a researcher or analyst working with energy data or models? An environmental non-profit, clean energy advocate or data journalist working on the U.S. energy transition? A data engineer or open-source expert interested in contributing to the energy transition?

If so, we would love to talk to you! For the first step of our POSE grant, we’re conducting a series of half-hour interviews over the next month to better understand how people find, prepare, and work with energy data, the different contexts they’re working in, and what their biggest data pain points and challenges are. You can sign up using this link. Please spread the word and forward this link to anyone you think might be interested!

Our Focus Areas

With POSE funding, we’ll be working to get PUDL data into more hands and creating new opportunities to contribute back to the PUDL ecosystem. Here’s a glimpse into what’s in the works:

  • Exploring new front-end tools to make PUDL data easier to access: We’re busy prototyping an alternative to our existing UI tool. Stay tuned, we’ll be looking for users to give us feedback on our beta tool!
  • Creating new resources for PUDL users: We’ll be hosting a webinar aimed at nonprofits and developing new data access tutorials to make accessing our data easier than ever before.
  • Supporting PUDL’s contributors: We’ll be developing new resources and coordination practices for external contributors, and creating a contributor onboarding workshop. 
  • Addressing technical barriers to contribution: Whether refactoring memory-intensive tests, or improving our data validation framework using Pandera, Pydantic, and Dagster asset checks, we’re excited to implement some long-awaited improvements to support more distributed development.
  • Coming to a town near you!: We’ll be traveling to academic conferences, university brown-bags, FOSS meetups and more in order to present on the PUDL project and connect with other clean energy advocates.
  • Developing organizational models and governance practices to sustain our growing ecosystem: In conversation with our downstream users, we’ll be developing strategies to keep PUDL free, accessible and maintained in the long-term.

We’ll be sharing updates on POSE-funded projects on our socials, blog and newsletter over the coming months. If you want to learn more about any of these projects, get in touch via hello@catalyst.coop or drop by our office hours.

Categories
updates

We hired a technical writer for PUDL!

Catalyst is very excited to announce that we have hired Nancy Amandi as a technical writer for PUDL’s Google Season of Docs project (full proposal here). The project will run from June through October, during which time Nancy will work on improving our documentation to make it easier for PUDL users to navigate and find the data they need.

Currently, it’s difficult for new (and long-time) PUDL users and contributors to quickly jump in and start using PUDL because our documentation is extensive and spread out between multiple repositories and websites. We have a data dictionary page in our docs, a Datasette deployment for exploring the data, and a set of example notebooks hosted on Kaggle, but none do a particularly good job of shepherding users to the data they want. The goal of this project is to create a better, more nested, system of table/column documentation so users aren’t overwhelmed by PUDL and know where to find the latest versions of the tables that are most relevant to them! 

If you’ve ever struggled to navigate the PUDL docs and have feedback, please send an email to hello@catalyst.coop, and we will incorporate suggestions into our plan for the project.

About Nancy

Nancy is a data engineer and technical writer living in Nigeria. She’s passionate about helping data-driven businesses write clear, concise documentation to convey complex technical concepts to a diverse range of audiences. In addition to her writing, Nancy has extensive experience in creating scalable data pipelines, exhaustive data mining, explanatory datasets, analytical models, and business reporting solutions with structured, semi-structured, and unstructured data. In 2023, Nancy and her team members won the Nigeria Energy Forum Tertiary Institutions Energy Pitch Challenge for their work on OneGrid Energies, a clean tech startup working towards closing the energy affordability gap in Nigeria.

Learn more about Nancy: LinkedIn, X, GitHub