How to install a Data Science Stack? Easy as 3 commands with Canonical's DSS
Data Science Stack: Your Out-of-the-Box Solution for ML Environments
Canonical, the company behind Ubuntu, has released Data Science Stack (DSS), a ready-to-use solution designed to simplify the setup of machine learning (ML) environments. This open-source tool is available on various platforms, including Linux distributions, Windows Subsystem for Linux (WSL), and macOS with Multipass.
[
Multipass orchestrates virtual Ubuntu instances
Multipass is a CLI to launch and manage VMs on Windows, Mac and Linux that simulates a cloud environment with support for cloud-init. Get Ubuntu on-demand with clean integration to your IDE and version control on your native platform.
](https://multipass.run/?ref=sredevops.org)
The Need for Speed in AI Development
The adoption of AI is rapidly increasing, but so are the challenges associated with its implementation. Deloitte's statistics reveal that:
- 51% of organizations using AI consider cybersecurity to be their highest risk.
- 36% cite regulatory compliance as a major concern.
These challenges highlight the need for efficient and secure ML development environments.
Data Science Stack: A Three-Command Solution
DSS addresses these challenges by enabling quick setup with just three commands:
- Set up your container orchestration layer (microk8s)
- Install the DSS CLI (with snap)
- Initialize the Data Science Stack.
This streamlined process can be completed in 10-30 minutes, depending on your experience level.
Features and Benefits
DSS comes pre-loaded with essential tools for ML development:
- Jupyter Notebook: For model development and experimentation.
- MLflow: For experiment tracking and model registry.
- Popular ML Frameworks: PyTorch and TensorFlow are included by default, allowing users to choose the most suitable framework for their needs.
Furthermore, DSS is highly customizable, enabling users to add new libraries based on their specific use cases.
Hardware Optimization
DSS is designed to work seamlessly with any hardware type, ensuring optimal performance regardless of your setup. It leverages optimized ML frameworks from various vendors and integrates advanced extensions like AVX, VNNI, and AMX for accelerated model training and experimentation.
Security and Support
Canonical's commitment to security extends to DSS through Ubuntu Pro, which offers enterprise-grade support and security maintenance for your ML solution. This ensures timely issue resolution and adherence to Canonical's Service Level Agreements (SLAs).
Now what?
[
GitHub - canonical/data-science-stack: Stack with machine learning tools needed for local development.
Stack with machine learning tools needed for local development. - canonical/data-science-stack
GitHubcanonical
](https://github.com/canonical/data-science-stack?ref=sredevops.org)
Data Science Stack provides a comprehensive and user-friendly solution for setting up efficient and secure ML environments. Its three-command setup, pre-installed tools, hardware optimization, and enterprise support make it an ideal choice for both individual developers and organizations looking to accelerate their AI initiatives.
Further Reading
[
Data science tools on Ubuntu: how can you quickly get started? | Ubuntu
Learn how Data Science Stack will accelerate your learning curve
Ubuntu
](https://ubuntu.com/engage/data-science-tools?ref=sredevops.org)
Subscribe to my newsletter
Read articles from Nicolás Georger directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
Nicolás Georger
Nicolás Georger
Cybernetics, Linux and Kubernetes enthusiast. Site Reliability Engineering/DevOps culture, self educated IT professional, social sciences academic background.