"

Tools and Software Resources

This textbook employs three primary software tools that represent different approaches to data science work: Microsoft Excel for foundational data manipulation and visualization, JASP for statistical analysis, and KNIME Analytics Platform for workflow automation. These tools were selected to provide readers with practical experience across the spectrum of data science applications, from individual analysis tasks to enterprise-scale automated processing.

Microsoft Excel

Excel serves as the foundation for data manipulation, basic statistical analysis, and initial visualization work throughout this text. The methodologies presented assume access to Excel 2019 or later versions, including the Data Analysis ToolPak and Power Query functionality. Readers using earlier versions may find some advanced features unavailable, though core concepts remain applicable across versions.

Access and Resources:

Microsoft 365 subscriptions include the latest Excel features referenced in this text

Educational institutions often provide free access through Microsoft’s academic licensing programs

Official documentation and tutorials: support.microsoft.com/excel

Data Analysis ToolPak installation guides available through Microsoft Support

JASP Statistical Software

JASP provides an accessible interface for the statistical analyses demonstrated in this textbook, particularly for hypothesis testing, correlation analysis, and regression modeling. This open-source statistical package offers point-and-click functionality while maintaining transparency in analytical procedures.

Access and Resources:

Free download for Windows, macOS, and Linux: jasp-stats.org

No licensing fees or institutional subscriptions required

Comprehensive user guide and tutorial library available at the official website

Regular updates ensure compatibility with current operating systems

KNIME Analytics Platform

KNIME enables the visual workflow development and automation concepts central to advanced data science practice. The visual programming environment allows readers to construct complex data processing pipelines without traditional coding requirements, making advanced analytics accessible while teaching systematic thinking about data workflows.

Access and Resources:

KNIME Analytics Platform (free version): knime.com/knime-analytics-platform

Academic licensing provides additional features for educational use

Extensive node documentation and workflow examples in the KNIME Community Hub

Video tutorials and learning paths available through KNIME Learning Center

Integration and Alternative Tools

While this textbook focuses on these three platforms, the conceptual frameworks and methodologies transfer readily to alternative tools. Readers familiar with R, Python, Tableau, or other data science environments will find the underlying principles directly applicable to their preferred software ecosystem. The emphasis throughout remains on systematic thinking and methodological rigor rather than tool-specific procedures.

Important: Software interfaces and specific features evolve continuously. Readers should consult the official documentation links provided above for the most current installation procedures, system requirements, and feature availability. The analytical concepts and methodological approaches presented in this textbook remain stable across software versions, ensuring long-term value regardless of interface updates.

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Introduction to Data Science Copyright © by GORAN TRAJKOVSKI is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.