Analytics platforms, data pipelines, BI tools, and data science frameworks with AI.
100 tools
Databricks is a data insights solution for unified analytics platform. Key capabilities include collaborative notebooks for data science and engineering, real-time data processing with apache spark, and machine learning model management and deployment.
Modern business intelligence platform built on AI-native architecture. Query data with natural language, build interactive dashboards, and share insights across teams.
Data research and preparation service for improving large language model performance. The LLM Data Company specializes in curating and validating post-training datasets to enhance model accuracy and reduce hallucinations.
HelixDB combines graph and vector capabilities in a single database, enabling both relationship queries and semantic search. Built for teams building AI applications, recommendation engines, and knowledge graphs at scale.
Snowflake is a cloud-native data warehouse that separates storage and compute for flexible scaling. Teams query structured and semi-structured data in real-time, share datasets across organizations securely, and integrate machine learning models without moving data.
Data security platform that monitors third-party GenAI tools for unauthorized data exposure. Scans LLM usage for compliance violations, audits data transfers, and blocks high-risk AI tool usage in real-time.
Automorphic is an AI framework for training domain-specific language models with minimal data. Embed specialized knowledge into models without large labeled datasets. Includes real-time collaboration and performance analytics.
Neural network platform for brain research and clinical applications. Processes neuroimaging data (fMRI, EEG, MEG) with deep learning models to generate brain maps, predict treatment responses, and support clinical decision-making. Used by neuroscience labs, hospitals, and research institutions.
Basedash is a database client that generates BI dashboards from your existing SQL data without code. Query any PostgreSQL, MySQL, or MongoDB database to build custom reports, track metrics, and collaborate with non-technical stakeholders in minutes.
Airtrain AI provides a no-code platform for data curation and annotation at scale. It automates dataset preparation, labeling, and quality evaluation for LLM fine-tuning. Teams can integrate with major LLM frameworks and track model performance with built-in evaluation metrics.
BIOS Health decodes neural signals from implantable and wearable devices, converting real-time nerve data into actionable clinical insights. It uses machine learning to interpret neural patterns, integrates with EHRs, and provides a dashboard for clinicians to monitor patient status and adjust treatment.
Turntable is an analytics OS that automates data visualization and real-time reporting. It connects to SQL databases and data warehouses to deliver collaborative dashboards without custom code. Teams use it to explore data, track KPIs, and share insights across departments.
LanceDB is an open-source vector database engineered for similarity search at scale. It provides serverless compute, sub-millisecond latency, and native integration with Python ML frameworks. AI engineers use it to store embeddings, power semantic search, and serve RAG and recommendation pipelines without managing infrastructure.
Data labeling and annotation for LLM training datasets. Provides crowdsourced and expert labeling, quality control, and integration with Hugging Face. Used by ML teams to prepare domain-specific training data for fine-tuning.
High-resolution spatial data platform for robotics and machine learning. Collects 3D environmental data, processes it in real-time, and provides customizable datasets via API. Powers perception models for autonomous systems and robotic applications.
dbt is a data insights solution for data transformation tool. Key capabilities include sql-based data transformation, version control for data models, and automated testing of data transformations.
Great Expectations is an open-source data quality framework that validates data pipelines through testable expectations. Define checks as code, run them continuously against production data, and get documentation and data profiling reports. Used by data engineers to prevent bad data from reaching ML models and dashboards.
Fivetran is a data insights solution for automated data integration. Key capabilities include automated data connectors, real-time data replication, and schema migration.
Monte Carlo is a data observability platform that monitors data pipelines for quality issues, freshness delays, and schema changes. Detects anomalies in volume, distribution, and integrity, then alerts teams before broken data reaches downstream systems.
Sciloop is an AI co-scientist that automates machine learning experiment management and analysis. It tracks hyperparameters, metrics, and datasets across experiments, compares model performance side-by-side, and provides automated insights on what drives accuracy. Integrates with TensorFlow, PyTorch, and scikit-learn. Built for data scientists to eliminate manual tracking and speed up iteration.
Prefect orchestrates data pipelines and workflows with Python-native syntax, automatic error handling, and observability. Execute ETL jobs, schedule dependencies, and monitor data processing with real-time insights.
Unsiloed AI parses unstructured data (documents, images, audio, video) using multimodal AI APIs. Extract structured data from messy sources in real time. Integrates directly into your data pipeline with analytics and reporting.
camelAI is an AI-powered business intelligence platform that analyzes data in real-time and generates actionable insights. It features customizable dashboards, predictive analytics, automated reporting, and natural language query processing. Teams use it to identify trends, make data-driven decisions, and automate report generation.
AWS (Amazon Web Services) is a comprehensive cloud computing platform providing alternatives to traditional infrastructure through 200+ services. It offers scalable computing via EC2, managed databases through RDS, serverless functions with Lambda, and content delivery via CloudFront. Designed for developers, startups, and enterprises seeking flexible cloud infrastructure without managing physical servers.
Lightly automates data labeling and collaborative annotation for ML teams. Integrates with popular ML frameworks and provides real-time data insights. Used by machine learning engineers and data scientists to improve dataset quality and accelerate model training.
Scale AI provides data labeling, annotation, and curation services for machine learning teams. It automates data pipeline management, quality assurance, and validation at enterprise scale. Teams use it to prepare production-quality training datasets faster without manual annotation bottlenecks.
AskYourDatabase enables natural language queries across multiple database types, eliminating the need for SQL knowledge. It provides real-time data visualization and custom dashboards for analytics and reporting.
Aquarium Learning helps machine learning teams improve model performance by assessing and enhancing dataset quality. It offers tools for bias identification, automated augmentation, and version control, facilitating collaboration and reproducibility.
Sureform collects, annotates, and validates human data for robotics and embodied AI systems. It provides real-time data validation and integrates with ML frameworks for training models on human motion and manipulation tasks.
Collaborative data annotation platform for machine learning teams. Provides version control, customizable workflows, and integration with popular ML frameworks. Designed for data scientists and research teams to label training data efficiently.
DeepGrove provides AI-driven real-time data analytics and insights. It offers customizable dashboards, collaboration tools, cross-device compatibility, and predictive analytics for analyzing complex datasets.
Zeit AI lets business users query datasets using plain English instead of SQL, automatically generating visualizations and reports. It powers real-time dashboards and KPI monitoring for data teams and executives who need insights without writing queries.
Travo analyzes real estate market data with AI to provide property valuations, market trend insights, and investment analysis. It integrates with CRM systems to help agents and investors make data-backed decisions.
Encord provides data annotation tools for machine learning teams with collaboration features, dataset version control, and integration with ML frameworks. It manages data storage, supports model training workflows, and enables real-time data analysis.
We are disrupting the consulting industry with hirable AI analysts for strategy and corporate finance work, starting with Excel - where half the work happens.
Aluna provides curated biomedical datasets and AI-driven analysis tools for healthcare researchers and ML engineers. It supports machine learning workflow integration, real-time data updates, custom reporting, and collaborative research project management.
Chamber automates AI model deployment and production monitoring. It handles model scaling, performance tracking, and infrastructure management. The platform integrates with existing ML pipelines and provides a dashboard for observability.
Trainy manages GPU clusters for AI/ML workloads. It automates GPU resource allocation, provides real-time monitoring, and supports multi-cloud model deployment. The dashboard allows cluster management for AI/ML model training and workload orchestration.
Novaflow analyzes biological experiment data with real-time insights and automated reporting. The platform integrates with lab instruments, provides data visualization dashboards, and enables team collaboration on research findings.
Panels collects high-quality audio data for AI labs. It offers customizable datasets, real-time analysis, and a user-friendly interface for data management. API access integrates with existing workflows.
Kater.ai is a business intelligence tool. It processes natural language for data queries and provides real-time data visualization. Features include customizable dashboards, collaboration tools, automated reporting, and alerts. It generates business reports, analyzes market trends, and monitors financial performance.
Monarcha is a spatial data platform combining AI analysis with geospatial visualization. It processes satellite imagery, climate data, and location-based datasets to generate predictive insights. Customizable dashboards and APIs enable automated reporting for urban planning, environmental monitoring, and logistics optimization.
Lumetric uses AI and natural language processing to analyze spreadsheet data, automatically generating insights and detecting trends. It creates customizable dashboards for visualization and includes collaboration tools for team sharing. Automates financial reporting, identifies sales trends, and detects data anomalies.
Nao Labs is an open-source analytics agent that collects and analyzes real-time data from multiple sources. It provides customizable dashboards for tracking customer behavior, marketing campaign performance, and sales metrics with flexible integration options.
Ardis AI automates text analysis and extraction, converting unstructured data into searchable knowledge graphs. It integrates with existing data sources, provides dynamic visualization, and enables advanced natural language search. Teams use it to make text data accessible and queryable without manual processing.
Louiza Labs synthesizes clinical trial data, regulatory filings, and scientific literature to accelerate pharmaceutical research decisions. It provides market sizing, competitive intelligence, and risk assessment for drug candidates in weeks instead of months.
Findly is an AI co-pilot for business intelligence. It translates natural language questions into queries, generates reports, and builds dashboards automatically from your data.
Ecliptor provides NLP and customizable embedding models for analyzing large unstructured datasets in real-time. It connects to popular data sources via API, enabling analysis, visualization, and model building on text-heavy data.
Strand AI curates multimodal biological datasets (genomics, proteomics, imaging) optimized for AI model training. Researchers access integrated datasets, visualize complex biology, and collaborate on data annotation for drug discovery and biomarker studies.
Elevate automates data integration and roll-up processes using AI-driven analysis and real-time synchronization. The platform delivers customizable workflows and a consolidated dashboard, enabling data teams to transform raw data into actionable analytics without manual integration work.
CellChorus combines microscopy imaging with machine learning to analyze single-cell performance and interactions. Used by biotech and pharmaceutical researchers to accelerate drug discovery and understand cellular behavior.
Anomaly AI detects unusual patterns in large spreadsheet datasets and generates real-time alerts. Query data using natural language, get visual insights, and identify financial discrepancies or operational outliers automatically.
Data platform for preparing datasets and training machine learning models. Automates data preprocessing, feature engineering, and model training pipelines. Built for data teams managing end-to-end ML workflows.
Cinder automates data labeling and provides AI bias detection for machine learning models. Includes fairness testing, model evaluation tools, and collaboration workflows for data annotation.
Sieve combines AI algorithms and human review for data cleaning. It offers API access, an Excel plugin, and real-time validation. Use it to clean large datasets, ensure accuracy, and integrate data from multiple sources.
Sarus enables analytics and machine learning on personal data while preserving privacy through anonymization and data governance. It includes real-time analytics dashboards, ML model deployment, and API integration for secure data sharing.
Tableau is a business intelligence platform that converts raw data into interactive dashboards and reports. Sales, marketing, and operations teams use it to visualize metrics, identify trends, and monitor KPIs in real-time.
Mercator provides AI-assisted data analytics. It features real-time data processing, predictive analytics via machine learning, customizable dashboards, and collaboration tools. Data visualization is offered through interactive charts. Use cases include automating data analysis and visualization, improving decision-making, and enhancing data-driven reporting.
Instantly transform your data into actionable insights without coding. Dot analyzes complex datasets and generates clear, visual reports for business teams, enabling faster, data-driven decision-making across departments.
Voker provides real-time performance tracking and customizable dashboards for AI agents. It generates automated reports and integrates with popular AI frameworks. The tool is designed for monitoring AI agent performance and identifying areas for improvement.
Lotas provides AI tools for data science and 3D modeling. It offers automated data analysis, 3D visualization, and collaboration features. Customizable dashboards and integration with data storage solutions are included. Use cases include creating 3D models from data and automating analysis workflows.
Evidently AI monitors ML models in production, detecting data drift and model degradation before they impact performance. It visualizes metrics across ML frameworks and enforces fairness checks for regulatory compliance.
Centauri AI extracts and transforms financial data from multiple sources. It processes data in real-time, provides predictive insights via machine learning, and visualizes data through a dashboard. It integrates with existing financial systems.
Data orchestration platform with software-defined assets. Declarative approach to building, testing, and monitoring data pipelines with built-in lineage.
Redbird provides AI-driven analytics. It offers real-time data visualization, predictive analytics via machine learning, customizable dashboards/reports, and collaboration tools. Integrates with various data sources. Use cases include sales forecasting, customer segmentation, and market trend analysis.
Power BI is a data insights solution for microsoft business analytics. Key capabilities include interactive data visualization, real-time dashboard updates, and customizable reports and analytics.
Hightouch is a data insights solution for data activation platform. Key capabilities include reverse etl capabilities, real-time data syncing, and customizable data transformations.
MindsDB is an AI platform that automates machine learning and integrates with multiple data sources for real-time predictions. It provides a unified interface for connecting disparate data sources, training models, and generating AI-driven insights without manual ML engineering.
Segment is a customer data platform that collects, unifies, and routes customer data from multiple touchpoints to analytics tools, marketing platforms, and data warehouses. It enables businesses to create a single source of truth for customer information while ensuring data consistency across their entire technology stack.
Maven Bio analyzes life sciences data using AI. It provides real-time insights, reporting, and customizable dashboards. The platform includes collaboration tools and integrates with LIMS. Use cases include drug discovery, clinical trial analysis, and biomarker identification.
Looker is a data insights solution for business intelligence platform. Key capabilities include data exploration and visualization, customizable dashboards, and real-time data analytics.
Reworkd automates web data extraction for sales, marketing, and research teams. Extract prospect information, competitive intelligence, and market data at scale without technical expertise.
Chonkie is an open-source data ingestion tool for AI. It supports real-time data ingestion from multiple sources, offers customizable data transformation pipelines, and integrates with AI frameworks. A dashboard is included for monitoring.
Provides advanced rerankers and embeddings for semantic search and retrieval across documents, websites, and databases. Delivers human-level accuracy and speed for developers, enterprises, and platforms needing precise information retrieval at scale.
Evidence is a powerful B2B SaaS tool that enables users to build, version control, and publish data products using SQL, markdown, and AI, enhancing the data analytics experience.
PandasAI is an open-source data-integration platform. It processes natural language for data queries, integrates with data sources, and offers real-time data visualization. The tool provides customizable reporting and collaboration features.
IOMETE is a self-hosted data lakehouse designed for the AI era, ensuring data ownership, privacy, and cost efficiency while providing flexible deployment options.
At Invert, we’re uniting cutting-edge technology and AI with the science of bioprocessing to accelerate therapies and sustainable bioproducts worldwide. Join us on a mission that matters.
SID automates data retrieval using AI algorithms that connect to multiple data sources. It provides real-time analytics, customizable search parameters, and reporting tools designed for non-technical users analyzing large datasets.
Klarity is an AI analytics platform that generates insights from business data and provides personalized recommendations. Teams use dashboards and automated reports to make faster, data-driven decisions.
Data Driven Bioscience provides rapid cancer genomics profiling with DNA and RNA sequencing completed in 2 days. Reports integrate directly into EHRs, enabling clinicians to access genomic insights in their existing workflows. Used for diagnosis, treatment planning, and monitoring.
Firecrawl provides web scraping and browser automation tools to extract real-time data from websites. Enables data scientists and AI developers to collect, process, and integrate web data into machine learning models and business applications.
Roamaround uses AI to map and visualize data with real-time collaborative dashboards. Project teams track progress, customize views, and share insights through interactive visualizations integrated with productivity tools.
MinusX is an AI data science assistant that automates analysis in Jupyter notebooks and creates visualizations in Metabase. Teams clean data, build models, and communicate insights faster.
HyperGlue applies natural language processing to analyze text data from multiple sources and extract business insights. It provides sentiment analysis, customizable reporting, and real-time visualization dashboards for data-driven decision making.
HouseCanary provides AI-driven property valuation and market analysis across 130+ million properties. Uses 1,000+ data points per property to generate institutional-grade valuations and risk assessments for real estate investors.
Iambiq Technologies automates text-based data processing. It extracts and analyzes text data from documents using Natural Language Processing. Features include customizable workflows, real-time analytics, and integration with existing data management systems. It processes large volumes of text data efficiently.
Mundo AI provides curated multilingual training data for building and improving AI models. It offers customizable datasets, real-time updates, quality assurance, and API access for easy integration into model training workflows.
Crustdata provides real-time data APIs that enable AI agents to access fresh, structured, and verifiable data directly from sources of truth, replacing outdated data infrastructure built for human consumption.
Mozart Data integrates and transforms data from multiple sources into a centralized warehouse. Provides ETL automation, data modeling, and BI-ready dashboards for analytics teams.
Eventual is an AI data engine designed for processing data across any modality and scale. It enables building and managing customizable data pipelines, real-time analytics, and integrates with existing tools to handle large datasets efficiently.
Shaped is a real-time search and recommendation engine for feeds, searches, and AI agents. It indexes data from multiple sources with customizable ranking algorithms and fast query processing.
Artificial Societies simulates entire populations and their interactions using AI, enabling organizations to test policies, forecast outcomes, and understand complex social dynamics. Used for urban planning, policy evaluation, economic forecasting, and research.
David AI provides high-quality audio datasets and real-time audio processing for training and deploying audio AI models. It enables building speech recognition systems, voice activity detection, and acoustic analysis.
Livedocs is an AI-driven data analysis tool that enables businesses to quickly derive insights from their data, enhancing decision-making and operational efficiency.
Mito provides spreadsheet-like UI for Python data analysis with real-time collaboration. Automate data cleaning, build visualizations, and export code directly to Jupyter notebooks or Python scripts. Use for exploratory analysis, dashboards, and BI workflows without leaving your notebook.
Dartboard Energy provides real-time and predictive analytics for electricity markets. It offers customizable dashboards, reporting, and automated alerts for market fluctuations. The tool integrates with energy trading platforms.
The Synthesis Company uses AI to accelerate scientific evidence synthesis and literature reviews. It helps researchers analyze and summarize medical and academic research 100x faster.
Mica replaces humans in fixing bad data by deploying AI agents that resolve non-happy path errors in data pipelines. It connects to your tech stack, uses business context to handle exceptions, and ensures scalable, auditable, and cost-effective data operations.
Honeydew is a semantic layer that creates a single source of truth for data across BI and AI platforms. It standardizes data definitions, enforces governance policies, and enables self-service analytics. Used by analytics teams to ensure data consistency and reduce query errors.