Aerospike

Aerospike is a distributed NoSQL database and key-value store architected to meet the performance needs of today’s web-scale applications—providing robustness and strong consistency with no downtime

  • Automation of operations and monitoring
  • Creation and management of clusters
  • Replication of data between clusters in different data centers using XDR (Cross Data Center Replication)
  • Kernel level optimizations for efficient operation of clusters
  • Creation of raw partitions to be used by Aerospike
  • Benchmarking of SSD's for performance using ACT benchmarking tool

Vertica

The Vertica Analytics Platform is purpose built from the very first line of code for Big Data analytics. It is designed for use in data warehouses and other big data workloads where speed, scalability, simplicity, and openness are crucial to the success of analytics. Vertica relies on a tested, reliable distributed architecture and columnar compression to deliver blazingly fast speed. A simplified license and the capability to deploy anywhere delivers on the promise of big data analytics like no other solution.

  • Automation of operations and monitoring
  • Creation and management of clusters
  • Database level optimizations for efficient OLAP processing

PostgreSQL

PostgreSQL is a powerful, open source object-relational database system. It has more than 15 years of active development and a proven architecture that has earned it a strong reputation for reliability, data integrity, and correctness.

  • Automation of operations and monitoring
  • Setup of database for OLTP and OLAP purposes
  • Automation of Streaming Replication (synchronous and asynchronous)
  • Development of PostgreSQL logs analyzer that gives (User, Database, Table, Query Type) access level reports in real time

MySQL

MySQL is the most popular Open Source Relational SQL Database Management System. MySQL is one of the best RDBMS being used for developing various web-based software applications.

  • Automation of operations and monitoring
  • Setup of database for OLTP purposes
  • Setup of various replication architectures (Master-Master, Master-Slave, Cascading, etc)

Druid

Druid is an open-source data store designed for sub-second queries on real-time and historical data. It is primarily used for business intelligence (OLAP) queries on event data. Druid provides low latency (real-time) data ingestion, flexible data exploration, and fast data aggregation. Existing Druid deployments have scaled to trillions of events and petabytes of data. Druid is most commonly used to power user-facing analytic applications.

  • Automation of operations and monitoring
  • Setup and management of clusters

Miscellaneous

  • Programming/Scripting Languages known: C, JAVA, Python, Bash
  • Operating Systems known: Linux, Windows, MacOS
  • Monitoring tools known: Diamond, Graphite, Nagios, Grafana
  • Databases comfortable with: Aerospike, Vertica, PostgreSQL, MySQL
  • Web development languages/frameworks known: Html, Css, Javascript, jQuery, Bootstrap, Django
  • Areas of Interest: Machine Learning, Data Science, Distributed Systems, Operating Systems, Networking, Cloud Computing, Network Security