https://github.com/apache/incubator-superset Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
https://github.com/apache/spark Mirror of Apache Spark
https://github.com/apache/incubator-predictionio PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
https://github.com/phanan/htaccess A collection of useful .htaccess snippets.
https://github.com/apache/kafka Mirror of Apache Kafka
https://github.com/allinurl/goaccess GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
https://github.com/apache/incubator-weex Mirror of Apache Weex
https://github.com/apache/thrift Mirror of Apache Thrift
https://github.com/apache/cassandra Mirror of Apache Cassandra
https://github.com/apache/hadoop Mirror of Apache Hadoop
https://github.com/mesosphere/marathon Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
https://github.com/apache/zookeeper Mirror of Apache Hadoop ZooKeeper
https://github.com/scylladb/scylla NoSQL data store using the seastar framework, compatible with Apache Cassandra
https://github.com/apache/mesos Mirror of Apache Mesos
https://github.com/apache/couchdb Apache CouchDB
https://github.com/orientechnologies/orientdb OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text, Geospatial and Key-Value models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries. OrientDB Community Edition is Open Source using a liberal Apache 2 license.
https://github.com/apache/zeppelin Mirror of Apache Zeppelin
https://github.com/yahoo/anthelion Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages
https://github.com/apache/flink Mirror of Apache Flink
https://github.com/Activiti/Activiti Activiti is a light-weight workflow and Business Process Management (BPM) Platform targeted at business people, developers and system admins. Its core is a super-fast and rock-solid BPMN 2 process engine for Java. It's open-source and distributed under the Apache license. Activiti runs in any Java application, on a server, on a cluster or in the cloud. It integrates perfectly with Spring, it is extremely lightweight and based on simple concepts.
https://github.com/apache/rocketmq Mirror of Apache RocketMQ
https://github.com/lrsjng/h5ai HTTP web server index for Apache httpd, lighttpd, nginx and Cherokee.
https://github.com/JerryLead/SparkInternals Notes talking about the design and implementation of Apache Spark
https://github.com/apache/groovy Mirror of Apache Groovy
https://github.com/twitter/distributedlog A high performance replicated log service. (The development is moved to Apache Incubator)