Commercial Products
Proprietary software that apache/spark can replace
Databricks
Databricks is a unified analytics platform built on Apache Spark for large-scale data processing, AI, and machine learning.
Snowflake
Snowflake is a cloud data platform providing scalable data warehousing, analytics, and processing capabilities.
Google BigQuery
Google BigQuery is a serverless, scalable data warehouse for running SQL queries on petabytes of data.
Amazon EMR
Amazon EMR is a managed cluster platform for processing large-scale data using frameworks like Spark and Hadoop.
Similar Open-Source Projects
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
home-assistant/core
:house_with_garden: Open source home automation that puts local control and privacy first.
spring-projects/spring-boot
Spring Boot helps you to create Spring-powered, production-grade applications and services with absolute minimum fuss.
elastic/elasticsearch
Free and Open Source, Distributed, RESTful Search Engine
ApacheInfra/superset
Apache Superset is a Data Visualization and Data Exploration Platform
kdn251/interviews
Everything you need to know to get the job.
odoo/odoo
Odoo. Open Source Apps To Grow Your Business.
pandas-dev/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
ClickHouse
ClickHouse® is a real-time analytics database management system
pyenv/pyenv
Simple Python version management
aymericdamien/TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
getsentry/sentry
Developer-first error tracking and performance monitoring
faif/python-patterns
A collection of design patterns/idioms in Python
mitmproxy/mitmproxy
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
appsmith
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
Sunshine
Self-hosted game stream host for Moonlight.
SeleniumHQ/selenium
A browser automation framework and ecosystem.
donnemartin/interactive-coding-challenges
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
encode/django-rest-framework
Web APIs for Django. 🎸