From Couch to Jupyter: A Beginner's Guide to Data Science Tools and Concepts
Introduction
Host: Manogna, Senior Data Scientist at Slalom.
Presenter: Kiko K., Analytic Scientist at FICO on the Scores Predictive Analytics team.
Background:
Graduated from UC Berkeley in 2019 with a degree in Applied Mathematics and Data Science.
Led teams integrating data science into non-traditional curricula.
Passionate about data science's power and community.
Workshop Overview
Title: "From Couch to Jupyter—A Beginner's Guide to Data Science Tools and Concepts"
Objective: Provide foundational knowledge and tools for beginners in data science.
Structure:
Introduction to Jupyter Notebook.
Basics of Python programming.
Understanding data structures and statistical concepts.
Interactive code demonstrations.
Resources:
GitHub repository with tutorial notebooks and datasets.
Anaconda installation guide for environment setup.
Key Topics Covered
Using Jupyter Notebook
Understanding markdown and code cells.
Running cells and writing code.
Python Basics
Data types: integers, floats, strings, booleans.
Variables and functions.
Arithmetic operations and function calls.
Data Structures
Arrays with NumPy.
Pandas Series and DataFrames.
Indexing and slicing data.
Data Manipulation and Analysis
Importing libraries and reading data files.
Handling missing data (NaN values).
Filtering and selecting data.
Basic statistical calculations: mean, median, standard deviation.
Practical Demonstrations
Working with a stroke prediction dataset from Kaggle.
Visualizing data distributions.
Imputing missing values.
Additional Resources
Anaconda Installation Guide: For setting up the Python environment.
Tutorial Notebooks: Covering various topics in more depth.
External Links: Videos and other learning materials for further study.
Conclusion
Q&A Session: Addressed audience questions on topics like:
Differences between Jupyter Notebook and JupyterLab.
Handling missing data and NaN values.
Differences between arrays and series.
Recommendations for beginners starting with data sets.
Final Remarks:
Encouraged attendees to explore provided resources.
Emphasized continuous learning in data science.
Thanked the audience for participation.
Note: The workshop aims to make data science accessible to beginners by providing hands-on experience with tools like Jupyter Notebook and Python, using practical examples and interactive code demonstrations.
Integrating AI: Applying AI in professional practice.
Tool Development: Making decisions about AI tool development.
Self-Evaluation Tool:
Created for students and staff to assess their AI proficiency.
Helps identify current competency level before engaging with AI tools.
Proposed Framework for Tutorials
Integration of ChatGPT:
Recommended using ChatGPT as a companion in tutorial sessions.
Applicable across various session types (feedback, problem-solving, etc.).
Implementation Process:
Self-Evaluation:
Students assess their initial proficiency with AI.
Facilitates personalized support from the tutor.
Prompting Practice:
Focus on developing effective communication with AI.
Emphasizes the importance of prompt language and structure.
Reflection and Awareness:
Encourage students to document their AI interaction process.
Discuss successes and areas for improvement.
Self-Monitoring:
Promote autonomy in controlling AI usage.
Foster critical thinking about AI's role in learning.
Objective:
Enhance critical thinking skills.
Empower students to use AI tools effectively and responsibly.
Student Perspective
Quote: Emphasized taking control over AI tools rather than allowing AI to dictate the learning process.
Insight: Highlights the importance of maintaining critical oversight when using AI.
Ongoing Work
Canvas Course Development:
Creating online resources for academics and students.
Aimed at educating users about AI integration in learning.
Courses are currently under development and not yet widely available.
Conclusion
Acknowledgments:
Thanked the audience for their attention.
Noted that the proposed framework is a starting point for discussion.
Future Considerations:
Recognized the need for ongoing dialogue about AI's role in education.
Invited feedback and collaboration to refine approaches.
Note: The presenters emphasized that the framework and recommendations are preliminary and subject to further refinement based on collective input and evolving understanding of AI in educational contexts.
Students with part-time jobs or varying schedules.
Technological Solutions:
Collaborative platforms accessible to external participants.
Features like collaborative document editing, version history, security measures.
Workload and Time Allocation:
Discrepancy Noted: Actual time spent often exceeds allocated time.
Examples: Some allocated 80 hours but spent 200 hours.
Lack of Formal Allocation: Many lacked official time allotments for curriculum design.
Use of AI in Curriculum Design:
High Interest: 95% would use AI tools.
Applications:
Brainstorming ideas.
Generating content and learning outcomes.
Image generation.
AI Tools Mentioned: Generative text models (e.g., ChatGPT), AI image generators, subject-specific AI like Math GPT and Music LLM.
Barriers and Challenges:
Top Barriers:
Limited time to learn and implement new technologies.
Licensing and subscription issues for preferred tools.
Other Challenges:
Technical difficulties.
Lack of training and support.
Resistance to change among staff.
Reward and Recognition:
Concerns:
Time allocation for curriculum design tasks.
Recognition in promotions and leadership opportunities.
Compensation methods for student involvement.
No Clear Solutions: Highlighted as areas needing attention.
Next Steps
Interviews: Conducting in-depth interviews to build on survey findings (two completed so far).
Focus Areas:
Use of digital technology and AI in curriculum design.
Strategies for inclusivity and flexibility.
Invitation: Open call for participation from other institutions and individuals.
Discussion Questions
Examples Sought:
Digital technologies that have made curriculum design more inclusive, flexible, or collaborative.
How these technologies were implemented.
AI Usage:
Do you use AI tools like ChatGPT in your curriculum design?
What are the opportunities and challenges associated with AI in this context?
Conclusion
Project Status: Ongoing with evolving insights.
Collaborative Effort: Involvement of both staff and students enriches perspectives.
Community Engagement: Encouraged attendees to share experiences and insights.
Note: The presenters emphasized the importance of technology in enhancing the curriculum design process and are actively seeking collaborations and discussions to further this research.