# Module 5.1: Asking Data Driven Questions

## Introduction

As library workers, we may want to draw insights from open civic data data through computational methods. To do this, we need to develop a question that our data can answer. In this module, we explore framing data driven questions.

#### Guiding Questions

* How do we frame a research question that we can answer computationally?

#### Vignette

The librarian wants to draw insights from the library WiFi usage data. They begin developing a question to ask of the data: “Is WiFi being effectively used by library patrons?” But they realize that they need to build greater specificity to the question, defining what "effective" means in the context of this analysis. They decide that one measure of service effectiveness is the number of patrons who use the Wifi. They reframe their question to be a more data driven one, asking: "Are WiFi services effective, measured by achieving the goal of annual WiFi sessions being greater than or equal to 25% of annual patron counts?”

## Instructional Materials

### Video

{% embed url="<https://files.gitbook.com/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FiHhyQLuCq600FlDt9jWx%2Fuploads%2FgCa4niv1aGE8xyrzJQMg%2FModule5.1.mp4?alt=media&token=c0e3faea-baac-4203-b544-89ca3cd01a97>" %}

#### Video Files

{% file src="/files/5Mwj7CbrW3XRKyTKHssz" %}
**Module 5.1 Uncaptioned Video:** *Asking Data Driven Questions* \
MP4 File   &#x20;
{% endfile %}

### Scripts and Slide Deck

{% tabs %}
{% tab title="Module 5.1 Script Preview" %}
{% embed url="<https://docs.google.com/document/d/1xtxv5wv_lz3KA2lAngXnpx74EhbcVOmV505BtL4fHgo/edit?usp=sharing>" %}
{% endtab %}

{% tab title="Module 5.1 Slides Preview" %}
{% embed url="<https://docs.google.com/presentation/d/1qko8VeHhfR6m5_bs-XedbSPYtpemVRGHdtpX7UoyTwc/edit?usp=sharing>" %}
{% endtab %}
{% endtabs %}

#### Script and Slide Deck Files

{% file src="/files/yFAOzPLCovS9wsxzkSFA" %}
**Module 5.1 Script:** *Asking Data Driven Questions* \
Word Document File &#x20;
{% endfile %}

{% file src="/files/xbAxcSTpBOESZDoH49yW" %}
**Module 5.1 Script:** *Asking Data Driven Questions* \
PDF File &#x20;
{% endfile %}

{% file src="/files/lJfEW8k3vbjjc9tYfvFT" %}
**Module 5.1 Slides:** *Asking Data Driven Questions* \
PowerPoint File &#x20;
{% endfile %}

{% file src="/files/DnXsRcivwWr9LJFAtQVD" %}
**Module 5.1 Slides:** *Asking Data Driven Questions* \
PDF File &#x20;
{% endfile %}

## Activity

**Overview**:&#x20;

Look at this data activity from data basic: <https://databasic.io/en/culture/ask-questions>

This exercise can be done individually or in pairs.

**Supplies**: An Internet-connected device and paper and a pen/pencil for notetaking

**Time**: 20 minutes

**Activity**:

1. Select a dataset&#x20;

Choose a dataset that is published on the Western Pennsylvania Regional Data Center or a data portal that is local to your region.

2\. Interrogate the data&#x20;

Apply these questions to your dataset and the metadata available on the data portal. Why was the data collected? Who collected the data? What tools were used to collect the data? When was the data collected? What might have been the motivations or perspectives of the individuals and/or groups collecting the data? What data wasn’t collected (or, at least, is not available to us)?

3\. Share-Out

Data have context. Share what you learned about the context of your data! Note what you were unable to address, inferred, or found a clear answer to.

#### Activity Files

{% file src="/files/cNUvHx2FKr7jOcBp2rs7" %}
**Module 5.1 Activity File:** *Asking Data Driven Questions* \
Word Document File &#x20;
{% endfile %}

{% file src="/files/uoYlCMXSGFBNHCV6LumR" %}
**Module 5.1 Activity File:** *Asking Data Driven Questions* \
PDF File &#x20;
{% endfile %}

## Resources

{% embed url="<http://d-scholarship.pitt.edu/33891/1/Shifting%20to%20Data%20Savvy.pdf>" %}
The project explores the challenges associated with implementing data science within diverse library environments by examining two specific perspectives framed as ‘the skills gap,’ i.e. where librarians are perceived to lack the technical skills to be effective in a data-rich research environment; and ‘the management gap,’ i.e. the ability of library managers to understand and value the benefits of in-house data science skills and to provide organizational and managerial support.\
\
**Citation**: Burton, Matt and Lyon, Liz and Erdmann, Chris and Tijerina, Bonnie (2018). *Shifting to Data Savvy: The Future of Data Science In Libraries.* Project Report. University of Pittsburgh, Pittsburgh, PA. <https://d-scholarship.pitt.edu/33891/>
{% endembed %}

{% embed url="<https://vimeo.com/236119180>" %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://civic-switchboard.gitbook.io/education-series/segment-5-using-civic-data/module-5.1-asking-data-driven-questions.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
