AutoML Framework with ArkhamPy

Arkham's AI Platform provides a powerful, code-first AutoML framework designed to balance automation with developer control. It is accessed programmatically through **ArkhamPy**, our Python SDK, and is built upon a library of transparent **Model Classes**. This approach empowers you to rapidly build strong baseline models and seamlessly transition to fine-grained custom development.

The Pro-Code Workflow

The entire ML lifecycle is managed within a code-driven environment, giving data scientists maximum control and reproducibility. You use `ArkhamPy` to orchestrate data access, model configuration, and training.

The core of the framework combines the `Model Class` library with the automated training pipeline. This process is designed for speed and efficiency:

  • Select a Model Class: Choose from a curated library of over 10 pre-built, production-grade templates for common use cases (e.g., forecasting, classification). These are "glass-box" templates, not black boxes.
  • Configure in Code: Using `ArkhamPy`, you connect your dataset from the Lakehouse and configure the model's parameters.
  • Launch the AutoML Pipeline: Trigger the automated pipeline, which handles the most complex parts of the ML lifecycle: data preprocessing, feature engineering, algorithm selection, and hyperparameter tuning.
  • Review and Deploy: The pipeline produces a leaderboard of the best-performing models. You can review their metrics, inspect their configurations, and deploy the chosen model to the Model Registry with a single command.

How It Improves Your ML Lifecycle

This integrated, code-first approach provides several key advantages for technical builders:

  • Accelerated Baselines: Generate high-performing baseline models in hours, not weeks. This immediately provides value and sets a clear performance benchmark for any further custom tuning or research.
  • Enforces Best Practices: Model Classes encapsulate proven architectures, preventing teams from "reinventing the wheel." This ensures all projects start from a robust, scalable, and governable foundation.
  • Unified MLOps: Whether a model is the direct output of the AutoML pipeline or a heavily customized solution, it is registered, versioned, deployed, and monitored through the same unified MLOps infrastructure, ensuring consistency and reliability from experimentation to production.
  • Maximum Flexibility: You are never locked into a restrictive UI. The `ArkhamPy` SDK allows you to extend, override, or build completely bespoke solutions when a Model Class doesn't fit your needs.