category

DatabaseMachine learningKuberneteseCommerceCloudWeb Application
Airflow Data Pipeline: Migrating and Cleansing Bigtable Data to Snowflake

2023-12-16

A practical implementation of a Kubernetes-based Airflow 2.9 pipeline to transfer and clean data from Google Bigtable to Snowflake using scalable infrastructure and programmatic validation steps.

Delta Lake or Hive on AWS – Making an Informed Decision

2023-07-03

Comparing Delta Lake and Hive on AWS EMR for running analytics directly against data in Amazon S3.

Kafka for IoT: High-Throughput Streaming with React Clients and CouchDB Persistence

2023-06-27

Technical specification of Kafka’s producer–consumer model for IoT fleets, with a React Native producer example and CouchDB for persistence.

Apache Airflow 2.x on Kubernetes – Production-Ready Data Orchestration for Big Data

2023-03-08

Technical specification for deploying Apache Airflow 2.x with the Kubernetes Executor in production, optimized for big data pipelines and compared to Azure Data Factory.

Real-Time Graph Analytics with Memgraph: Use Cases and Deployment for Small to Midsize Projects

Explore how Memgraph enables real-time graph analytics for fraud detection, recommendation engines, and supply chain insights. Learn how to deploy it efficiently on a budget using Kubernetes, Docker.

Latest from Our Blog

Blog Illustration

Trending

Serverless Database Showdown: Oracle, Azure, Redshift, and AuroraOrchestrating Spark on AWS EMR from Apache Airflow — The Low-Ops WayCase Study: A Lightweight Intrusion Detection System with OpenFaaS and PyTorchBuilding Resilient Kubernetes Clusters with Portworx Community EditionIntegrating Shopify into a Next.js React Web App