Day Two
π NYC High School Directory Exploration
Section titled βπ NYC High School Directory ExplorationβDatasets: daily_tasks/day_2/day_2_datasets
π― Objective
Section titled βπ― ObjectiveβExplore a real-world dataset using Python and Jupyter Notebook. Youβll clean the data, perform basic analysis, create visualizations, and share your insights.
β Instructions
Section titled ββ Instructionsβ-
Download the dataset from the
day_2_datasets
folder. -
Open a new Jupyter Notebook and:
- Load the dataset using
pandas
- Clean the column names (make lowercase, replace spaces with
_
, remove special characters)
- Load the dataset using
-
Filter the dataset to include only schools located in Brooklyn
-
Answer the following:
- How many total schools are in Brooklyn?
- How many offer Grade 9 entry?
-
Group and summarize:
- Count of schools per borough
- Average number of students per borough
- Summary of
grade_span_max
grouped by borough
-
Create visualizations:
- Bar chart: Number of schools per borough
-
Write 2β3 key insights based on your findings at the end of the notebook
π What to Submit
Section titled βπ What to Submitβ-
In your sub-branch (
[your-name]/day-2
), navigate to thedaily_tasks/day_2/
folder. -
Add your Jupyter Notebook named:
day2_analysis.ipynb
-
Ensure the notebook contains:
- Code
- Answers to the questions
- Visualizations
- A short summary of observations
-
Create a PR, and comment on this task with the PR link.
π Related GitHub Issue
Section titled βπ Related GitHub IssueβTrack the task, post questions, and link your work here:
π Day 2 Task
π Example: Task Completion Comment
Section titled βπ Example: Task Completion CommentβTask completed in PR #124
π‘ You donβt need to aim for perfection β this is about practicing Python and sharpening your analytical thinking.