Analytics platforms, data pipelines, BI tools, and data science frameworks with AI.
100 tools
Databricks is a data insights solution for unified analytics platform. Key capabilities include collaborative notebooks for data science and engineering, real-time data processing with apache spark, and machine learning model management and deployment.
Modern business intelligence platform built on AI-native architecture. Query data with natural language, build interactive dashboards, and share insights across teams.
Data research and preparation service for improving large language model performance. The LLM Data Company specializes in curating and validating post-training datasets to enhance model accuracy and reduce hallucinations.
HelixDB combines graph and vector capabilities in a single database, enabling both relationship queries and semantic search. Built for teams building AI applications, recommendation engines, and knowledge graphs at scale.
Snowflake is a cloud-native data warehouse that separates storage and compute for flexible scaling. Teams query structured and semi-structured data in real-time, share datasets across organizations securely, and integrate machine learning models without moving data.
Data security platform that monitors third-party GenAI tools for unauthorized data exposure. Scans LLM usage for compliance violations, audits data transfers, and blocks high-risk AI tool usage in real-time.
Neural network platform for brain research and clinical applications. Processes neuroimaging data (fMRI, EEG, MEG) with deep learning models to generate brain maps, predict treatment responses, and support clinical decision-making. Used by neuroscience labs, hospitals, and research institutions.
Turntable is an analytics OS that automates data visualization and real-time reporting. It connects to SQL databases and data warehouses to deliver collaborative dashboards without custom code. Teams use it to explore data, track KPIs, and share insights across departments.
Airtrain AI provides a no-code platform for data curation and annotation at scale. It automates dataset preparation, labeling, and quality evaluation for LLM fine-tuning. Teams can integrate with major LLM frameworks and track model performance with built-in evaluation metrics.
Basedash is a database client that generates BI dashboards from your existing SQL data without code. Query any PostgreSQL, MySQL, or MongoDB database to build custom reports, track metrics, and collaborate with non-technical stakeholders in minutes.
Automorphic is an AI framework for training domain-specific language models with minimal data. Embed specialized knowledge into models without large labeled datasets. Includes real-time collaboration and performance analytics.
BIOS Health decodes neural signals from implantable and wearable devices, converting real-time nerve data into actionable clinical insights. It uses machine learning to interpret neural patterns, integrates with EHRs, and provides a dashboard for clinicians to monitor patient status and adjust treatment.
LanceDB is an open-source vector database engineered for similarity search at scale. It provides serverless compute, sub-millisecond latency, and native integration with Python ML frameworks. AI engineers use it to store embeddings, power semantic search, and serve RAG and recommendation pipelines without managing infrastructure.
Data labeling and annotation for LLM training datasets. Provides crowdsourced and expert labeling, quality control, and integration with Hugging Face. Used by ML teams to prepare domain-specific training data for fine-tuning.
Fivetran is a data insights solution for automated data integration. Key capabilities include automated data connectors, real-time data replication, and schema migration.
dbt is a data insights solution for data transformation tool. Key capabilities include sql-based data transformation, version control for data models, and automated testing of data transformations.
Great Expectations is an open-source data quality framework that validates data pipelines through testable expectations. Define checks as code, run them continuously against production data, and get documentation and data profiling reports. Used by data engineers to prevent bad data from reaching ML models and dashboards.
Sciloop is an AI co-scientist that automates machine learning experiment management and analysis. It tracks hyperparameters, metrics, and datasets across experiments, compares model performance side-by-side, and provides automated insights on what drives accuracy. Integrates with TensorFlow, PyTorch, and scikit-learn. Built for data scientists to eliminate manual tracking and speed up iteration.
High-resolution spatial data platform for robotics and machine learning. Collects 3D environmental data, processes it in real-time, and provides customizable datasets via API. Powers perception models for autonomous systems and robotic applications.
Monte Carlo is a data observability platform that monitors data pipelines for quality issues, freshness delays, and schema changes. Detects anomalies in volume, distribution, and integrity, then alerts teams before broken data reaches downstream systems.
Prefect orchestrates data pipelines and workflows with Python-native syntax, automatic error handling, and observability. Execute ETL jobs, schedule dependencies, and monitor data processing with real-time insights.
camelAI is an AI-powered business intelligence platform that analyzes data in real-time and generates actionable insights. It features customizable dashboards, predictive analytics, automated reporting, and natural language query processing. Teams use it to identify trends, make data-driven decisions, and automate report generation.
Unsiloed AI parses unstructured data (documents, images, audio, video) using multimodal AI APIs. Extract structured data from messy sources in real time. Integrates directly into your data pipeline with analytics and reporting.
AWS (Amazon Web Services) is a comprehensive cloud computing platform providing alternatives to traditional infrastructure through 200+ services. It offers scalable computing via EC2, managed databases through RDS, serverless functions with Lambda, and content delivery via CloudFront. Designed for developers, startups, and enterprises seeking flexible cloud infrastructure without managing physical servers.
Tableau is a business intelligence platform that converts raw data into interactive dashboards and reports. Sales, marketing, and operations teams use it to visualize metrics, identify trends, and monitor KPIs in real-time.
Ecliptor provides NLP and customizable embedding models for analyzing large unstructured datasets in real-time. It connects to popular data sources via API, enabling analysis, visualization, and model building on text-heavy data.
Anomaly AI detects unusual patterns in large spreadsheet datasets and generates real-time alerts. Query data using natural language, get visual insights, and identify financial discrepancies or operational outliers automatically.
Aquarium Learning helps machine learning teams improve model performance by assessing and enhancing dataset quality. It offers tools for bias identification, automated augmentation, and version control, facilitating collaboration and reproducibility.
Strand AI curates multimodal biological datasets (genomics, proteomics, imaging) optimized for AI model training. Researchers access integrated datasets, visualize complex biology, and collaborate on data annotation for drug discovery and biomarker studies.
Zeit AI lets business users query datasets using plain English instead of SQL, automatically generating visualizations and reports. It powers real-time dashboards and KPI monitoring for data teams and executives who need insights without writing queries.
nao Labs is an open-source analytics agent. It collects and analyzes real-time data, integrates with multiple sources, and provides customizable dashboards. It supports analyzing customer behavior, tracking marketing campaigns, and visualizing sales data.
Travo analyzes real estate market data with AI to provide property valuations, market trend insights, and investment analysis. It integrates with CRM systems to help agents and investors make data-backed decisions.
Sureform collects, annotates, and validates human data for robotics and embodied AI systems. It provides real-time data validation and integrates with ML frameworks for training models on human motion and manipulation tasks.
DeepGrove provides AI-driven real-time data analytics and insights. It offers customizable dashboards, collaboration tools, cross-device compatibility, and predictive analytics for analyzing complex datasets.
Kater.ai is a business intelligence tool. It processes natural language for data queries and provides real-time data visualization. Features include customizable dashboards, collaboration tools, automated reporting, and alerts. It generates business reports, analyzes market trends, and monitors financial performance.
Data platform for preparing datasets and training machine learning models. Automates data preprocessing, feature engineering, and model training pipelines. Built for data teams managing end-to-end ML workflows.
Trainy manages GPU clusters for AI/ML workloads. It automates GPU resource allocation, provides real-time monitoring, and supports multi-cloud model deployment. The dashboard allows cluster management for AI/ML model training and workload orchestration.
Collaborative data annotation platform for machine learning teams. Provides version control, customizable workflows, and integration with popular ML frameworks. Designed for data scientists and research teams to label training data efficiently.
Novaflow analyzes biological experiment data with real-time insights and automated reporting. The platform integrates with lab instruments, provides data visualization dashboards, and enables team collaboration on research findings.
Findly is an AI co-pilot for business intelligence. It translates natural language questions into queries, generates reports, and builds dashboards automatically from your data.
Lightly automates data labeling and collaborative annotation for ML teams. Integrates with popular ML frameworks and provides real-time data insights. Used by machine learning engineers and data scientists to improve dataset quality and accelerate model training.
AskYourDatabase enables natural language queries across multiple database types, eliminating the need for SQL knowledge. It provides real-time data visualization and custom dashboards for analytics and reporting.
Ardis AI automates text analysis and extraction, converting unstructured data into searchable knowledge graphs. It integrates with existing data sources, provides dynamic visualization, and enables advanced natural language search. Teams use it to make text data accessible and queryable without manual processing.
Encord provides data annotation tools for machine learning teams with collaboration features, dataset version control, and integration with ML frameworks. It manages data storage, supports model training workflows, and enables real-time data analysis.
Aluna is a biomedical data platform designed for AI research. It provides comprehensive data sets, AI-driven analysis tools, and real-time updates. Researchers use it to analyze biomedical data efficiently and generate AI-driven insights. The platform also offers customizable reporting and integrates with machine learning frameworks.
Crunched is an AI-powered Excel analysis tool. It offers AI-driven data analysis, automated report generation, and advanced data visualization. Teams can collaborate and integrate with popular data sources. It streamlines data analysis and automates reporting tasks.
Monarcha is a spatial data platform combining AI analysis with geospatial visualization. It processes satellite imagery, climate data, and location-based datasets to generate predictive insights. Customizable dashboards and APIs enable automated reporting for urban planning, environmental monitoring, and logistics optimization.
Analyze single-cell performance and interactions with advanced imaging and machine learning. Designed for researchers and biologists studying cellular behavior, this platform combines microscopy data with AI algorithms to accelerate discovery and understand complex cell dynamics at scale.
Scale AI provides data labeling and annotation services. It offers automated data pipeline management and generates high-quality training data. The tool includes real-time data validation and quality assurance, with scalable infrastructure for AI model training.
Chamber automates AI model deployment and production monitoring. It handles model scaling, performance tracking, and infrastructure management. The platform integrates with existing ML pipelines and provides a dashboard for observability.
Sieve combines AI algorithms and human review for data cleaning. It offers API access, an Excel plugin, and real-time validation. Use it to clean large datasets, ensure accuracy, and integrate data from multiple sources.
Cinder automates data labeling and provides AI bias detection for machine learning models. Includes fairness testing, model evaluation tools, and collaboration workflows for data annotation.
Mercator provides AI-assisted data analytics. It features real-time data processing, predictive analytics via machine learning, customizable dashboards, and collaboration tools. Data visualization is offered through interactive charts. Use cases include automating data analysis and visualization, improving decision-making, and enhancing data-driven reporting.
Elevate automates data integration for roll-ups. It provides AI-driven data analysis, customizable workflows, real-time synchronization, and a user-friendly dashboard. Use it to automate data integration processes, create roll-ups for reporting, and transform data for analysis.
Panels collects high-quality audio data for AI labs. It offers customizable datasets, real-time analysis, and a user-friendly interface for data management. API access integrates with existing workflows.
Louiza Labs synthesizes clinical trial data, regulatory filings, and scientific literature to accelerate pharmaceutical research decisions. It provides market sizing, competitive intelligence, and risk assessment for drug candidates in weeks instead of months.
Sarus enables analytics and machine learning on personal data while preserving privacy through anonymization and data governance. It includes real-time analytics dashboards, ML model deployment, and API integration for secure data sharing.
Instantly transform your data into actionable insights without coding. Dot analyzes complex datasets and generates clear, visual reports for business teams, enabling faster, data-driven decision-making across departments.
Voker provides real-time performance tracking and customizable dashboards for AI agents. It generates automated reports and integrates with popular AI frameworks. The tool is designed for monitoring AI agent performance and identifying areas for improvement.
Lotas provides AI tools for data science and 3D modeling. It offers automated data analysis, 3D visualization, and collaboration features. Customizable dashboards and integration with data storage solutions are included. Use cases include creating 3D models from data and automating analysis workflows.
Evidently AI monitors ML models in production, detecting data drift and model degradation before they impact performance. It visualizes metrics across ML frameworks and enforces fairness checks for regulatory compliance.
Centauri AI extracts and transforms financial data from multiple sources. It processes data in real-time, provides predictive insights via machine learning, and visualizes data through a dashboard. It integrates with existing financial systems.
Data orchestration platform with software-defined assets. Declarative approach to building, testing, and monitoring data pipelines with built-in lineage.
Redbird provides AI-driven analytics. It offers real-time data visualization, predictive analytics via machine learning, customizable dashboards/reports, and collaboration tools. Integrates with various data sources. Use cases include sales forecasting, customer segmentation, and market trend analysis.
Power BI is a data insights solution for microsoft business analytics. Key capabilities include interactive data visualization, real-time dashboard updates, and customizable reports and analytics.
Hightouch is a data insights solution for data activation platform. Key capabilities include reverse etl capabilities, real-time data syncing, and customizable data transformations.
MindsDB is an AI platform that automates machine learning and integrates with multiple data sources for real-time predictions. It provides a unified interface for connecting disparate data sources, training models, and generating AI-driven insights without manual ML engineering.
Segment is a customer data platform that collects, unifies, and routes customer data from multiple touchpoints to analytics tools, marketing platforms, and data warehouses. It enables businesses to create a single source of truth for customer information while ensuring data consistency across their entire technology stack.
Maven Bio analyzes life sciences data using AI. It provides real-time insights, reporting, and customizable dashboards. The platform includes collaboration tools and integrates with LIMS. Use cases include drug discovery, clinical trial analysis, and biomarker identification.
Looker is a data insights solution for business intelligence platform. Key capabilities include data exploration and visualization, customizable dashboards, and real-time data analytics.
Lumetric uses AI and natural language processing to analyze spreadsheet data, automatically generating insights and detecting trends. It creates customizable dashboards for visualization and includes collaboration tools for team sharing. Automates financial reporting, identifies sales trends, and detects data anomalies.
Reworkd automates web data extraction for sales, marketing, and research teams. Extract prospect information, competitive intelligence, and market data at scale without technical expertise.
Chonkie is an open-source data ingestion tool for AI. It supports real-time data ingestion from multiple sources, offers customizable data transformation pipelines, and integrates with AI frameworks. A dashboard is included for monitoring.
Provides advanced rerankers and embeddings for semantic search and retrieval across documents, websites, and databases. Delivers human-level accuracy and speed for developers, enterprises, and platforms needing precise information retrieval at scale.
Evidence is a powerful B2B SaaS tool that enables users to build, version control, and publish data products using SQL, markdown, and AI, enhancing the data analytics experience.
PandasAI is an open-source data-integration platform. It processes natural language for data queries, integrates with data sources, and offers real-time data visualization. The tool provides customizable reporting and collaboration features.
IOMETE is a self-hosted data lakehouse designed for the AI era, ensuring data ownership, privacy, and cost efficiency while providing flexible deployment options.
At Invert, we’re uniting cutting-edge technology and AI with the science of bioprocessing to accelerate therapies and sustainable bioproducts worldwide. Join us on a mission that matters.
David AI provides high-quality audio datasets and real-time audio processing for training and deploying audio AI models. It enables building speech recognition systems, voice activity detection, and acoustic analysis.
Eventual is an AI data engine designed for processing data across any modality and scale. It enables building and managing customizable data pipelines, real-time analytics, and integrates with existing tools to handle large datasets efficiently.
Firecrawl provides web scraping and browser automation tools to extract real-time data from websites. Enables data scientists and AI developers to collect, process, and integrate web data into machine learning models and business applications.
Klarity is an AI analytics platform that generates insights from business data and provides personalized recommendations. Teams use dashboards and automated reports to make faster, data-driven decisions.
Crustdata provides real-time data APIs that enable AI agents to access fresh, structured, and verifiable data directly from sources of truth, replacing outdated data infrastructure built for human consumption.
Mito provides spreadsheet-like UI for Python data analysis with real-time collaboration. Automate data cleaning, build visualizations, and export code directly to Jupyter notebooks or Python scripts. Use for exploratory analysis, dashboards, and BI workflows without leaving your notebook.
Data Driven Bioscience provides rapid cancer genomics profiling with DNA and RNA sequencing completed in 2 days. Reports integrate directly into EHRs, enabling clinicians to access genomic insights in their existing workflows. Used for diagnosis, treatment planning, and monitoring.
The Synthesis Company uses AI to accelerate scientific evidence synthesis and literature reviews. It helps researchers analyze and summarize medical and academic research 100x faster.
Shaped is a search tool for real-time data retrieval. It offers customizable search algorithms and integrates with multiple data sources. Features include automated feed generation and agent-based query handling.
Mozart Data integrates and transforms data from multiple sources into a centralized warehouse. Provides ETL automation, data modeling, and BI-ready dashboards for analytics teams.
Roamaround uses AI to map and visualize data with real-time collaborative dashboards. Project teams track progress, customize views, and share insights through interactive visualizations integrated with productivity tools.
MinusX is an AI data science assistant that automates analysis in Jupyter notebooks and creates visualizations in Metabase. Teams clean data, build models, and communicate insights faster.
Mica AI automates data cleaning, validation, and enrichment using AI. It provides real-time data quality monitoring with a dashboard for insights, and integrates directly with existing data sources.
Livedocs is an AI-driven data analysis tool that enables businesses to quickly derive insights from their data, enhancing decision-making and operational efficiency.
Dartboard Energy provides real-time and predictive analytics for electricity markets. It offers customizable dashboards, reporting, and automated alerts for market fluctuations. The tool integrates with energy trading platforms.
PromptLoop automates B2B data collection from multiple sources. It offers customizable AI models for specific dataset needs, real-time data processing, and analysis. Manage datasets via a user-friendly interface and integrate with data visualization tools.
HyperGlue applies natural language processing to analyze text data from multiple sources and extract business insights. It provides sentiment analysis, customizable reporting, and real-time visualization dashboards for data-driven decision making.
HouseCanary provides AI-driven property valuation and market analysis across 130+ million properties. Uses 1,000+ data points per property to generate institutional-grade valuations and risk assessments for real estate investors.
PredictLeads provides structured company intelligence data accessible via APIs, flat files, MCP, and webhooks. It identifies high-growth companies and tracks company signals to power targeted prospecting, lead scoring, and personalized outreach.
Secoda is a data enablement platform that helps modern data teams centralize, document, and govern their data assets while enabling self-service access for business users.
Datafold automates data engineering workflows with AI-driven quality checks and anomaly detection. It integrates with data warehouses, visualizes data lineage for compliance, and includes collaboration tools to help data teams maintain data accuracy and trust.
Artificial Societies simulates entire populations and their interactions using AI, enabling organizations to test policies, forecast outcomes, and understand complex social dynamics. Used for urban planning, policy evaluation, economic forecasting, and research.