How to build E-commerce Data Pipeline with Airbyte?
Create a seamless and efficient data pipeline for e-commerce analytics. Dive into the practical implementation of a data workflow using Airbyte, dbt, Dagster, and Google BigQuery.
Chat with your data using OpenAI, Pinecone, Airbyte and Langchain
Learn how to build a connector development support bot for Slack that knows all your APIs, open feature requests and previous Slack conversations by heart
Measure Customer Support Sentiment Analysis with GPT, Airbyte and MindsDB
Learn how to measure customer support sentiment analysis using GPT, Airbyte, and MindsDB. Set up sentiment analysis of Intercom chats, extract and analyze the data with GPT models, and visualize the results using Metabase.
Airbyte and LlamaIndex: ELT and Chat with your data warehouse without writing SQL
Learn how to chat with your data warehouse using Airbyte and LlamaIndex. Discover the power of querying databases with natural language, bypassing the need for SQL expertise and memorization of complex database schemas.
How to implement AI data pipeline: Langchain, Dagster & Airbyte
Learn how to set up a maintainable and scalable pipeline for integrating diverse data sources into large language models using Airbyte, Dagster, and LangChain.
How to write an Airbyte Python Destination: DuckDB
A guide on how to create a Python Destination (DuckDB). Code snippets linked to a single PR.
Airflow and Airbyte OSS - Better Together
Learn how to create an Airflow DAG (directed acyclic graph) that triggers Airbyte synchronizations.
Deploy a Self-service Business Intelligence Project With Whaly & Airbyte
Learn how to move your data to a data warehouse with Airbyte, model it, and build a self-service layer with Whaly’s BI platform.
Validate data replication pipelines with data-diff
Learn to replicate data from Postgres to Snowflake with Airbyte, and compare replicated data with data-diff.
Version control Airbyte configurations with Octavia CLI
Use Octavia CLI to import, edit, and apply Airbyte application configurations to replicate data from Postgres to BigQuery.
Explore Airbyte's Change Data Capture (CDC) replication
Learn how Airbyte’s Change Data Capture (CDC) synchronization replication works.
Export Postgres data to CSV, JSON, Parquet and Avro files in S3
Learn how to easily export Postgres data to CSV, JSON, Parquet, and Avro file formats stored in AWS S3.
Explore Airbyte's incremental refresh data synchronization
Learn how Airbyte’s incremental synchronization replication modes work.
Build an open data lakehouse with Dremio and Airbyte
Learn how to move all your data to a data lake and connect your data lake with the Dremio lakehouse platform.
Explore Airbyte's full refresh synchronization
Learn the inner workings of Airbyte’s full refresh overwrite and full refresh append synchronization modes.
MySQL CDC: Build an ELT pipeline from MySQL Database
Easily set up MySQL CDC using Airbyte, harnessing the power of a robust tool like Debezium to construct a near real-time ELT pipeline.
Build a connector to extract data from the Webflow API
Learn how to use Airbyte’s Python CDK to write a source connector that extracts data from the Webflow API.
How to Load Data Into Databricks Lakehouse
Learn how to load data to a Databricks Lakehouse and run simple analytics.
Identify data quality issues on data ingestion pipelines with dbt and re_data
Learn how to detect data quality issues on your Airbyte syncs with re_data.
Build an EL(T) from Postgres CDC (Change Data Capture)
Set up Postgres CDC (Change Data Capture) in minutes using Airbyte, leveraging Debezium to build a near real-time EL(T).
Create an open-source dbt package to analyze Github data
Learn how to create a dbt package to analyze Github data extracted with Airbyte.
Orchestrate data ingestion and transformation pipelines with Dagster
Learn how to ingest and transform Github and Slack data with SQL and Python-based transformations.
Orchestrate ELT pipelines with Prefect, Airbyte and dbt
Learn how to build an ELT pipeline to discover GitHub users that have contributed to the Prefect, Airbyte, and dbt repositories.
How to Build a Single Customer View in 3 Quick Steps
Learn how to use a data integration tool (Airbyte) and a data transformation tool (dbt) to create a single customer view on a data warehouse (BigQuery).
Set up a modern data stack with Docker
Learn how to quickly set up a modern data stack using Docker Compose with Airbyte, BigQuery, dbt, Airflow and Superset.
How to Scrape LinkedIn Profiles Using Airflow & BeautifulSoup
Learn how to easily automate your LinkedIn Scraping with Airflow and Beautiful Soup.
Visualize the time spent by your team in Zoom calls
Learn how to visualize how much time your team is spending in Zoom calls with the Airbyte Zoom connector and Tableau.
How to Build a Slack Analytics Dashboard Using Apache Superset
Build a Slack activity dashboard quickly using the Slack Airbyte connector and Apache Superset.
How to Perform PostgreSQL Replication in 4 Quick Steps
Experience swift Postgres replication, effortlessly transferring data between databases in just 10 minutes.
Build a GitHub Analytics Dashboard Using Metabase & Airbyte
Using the Airbyte GitHub connector and Metabase, we can create insightful dashboards for GitHub projects.
Search your entire Slack history on a free plan
Learn how to bypass Slack's message history restriction and access all of your messages, even if you aren't on a paid Slack plan.
Forecast purchase orders for your Shopify store with MindsDB
Use the integrated machine learning in MindsDB to forecast Shopify store metrics.
Building a Knowledge Management System with PyAirbyte and Vector Databases
Discover how to build efficient knowledge management systems using PyAirbyte and vector databases for streamlined data access.
Automating Customer Support Analytics: Zendesk + Airbyte + OpenAI Integration
Automate customer support analytics with Zendesk, Airbyte, and OpenAI integration. Unlock insights and enhance support efficiency.
Creating a GitHub Documentation Chatbot Using PyAirbyte and pgvector
Learn how to build a GitHub documentation chatbot with PyAirbyte and PG Vector for seamless data retrieval and enhanced user experience.
Healthcare Data Integration: FHIR API Connector with Airbyte's AI Assistant
Streamline healthcare data integration with Airbyte's AI Assistant and FHIR API connector. Simplify workflows and improve insights.
Financial Market Monitoring with Airbyte and Polygon.io Integration
Discover financial market monitoring using Airbyte and Polygon.io integration. Streamline data for actionable insights
Building a Social Media Sentiment Analyzer Using Airbyte and Twitter API
Build a social media sentiment analyzer using Airbyte and Twitter API. Simplify data integration and analyze trends effectively.
Full-Stack AI Task Prioritization Chatbot with Asana, Airbyte, Milvus, and Next.js
Build a quick full-stack AI application which arranges your Asana tasks for you in order of priority using MIlvus, Airbyte Cloud, and Next.js.
A Beginner's Guide to Qdrant: Installation, Setup, and Basic Operations
Learn how to install and set up Qdrant, a powerful vector database for AI applications. This beginner's guide walks you through basic operations to manage and query embeddings.
One-click Data Pipeline for Profitability in E-commerce
In a world where e-commerce business models are relatively uniform, lies a huge opportunity in analytics of building modular, reusable data transformation models. This tutorial is about open sourcing the full end to end pipeline around a critical use case for every e-commerce: profitability calculation!
Sentiment analysis using Google sheets and Snowflake Cortex
Learn how to load user review data from Google Sheets intoS nowflake Cortex based vector store, and perform sentiment analysis using Snowflake Cortex's sentiment function.
Extract crypto data to Snowflake using PyAirbyte
Learn how to use PyAirbyte to extract cryptocurrency data from CoinAPI.io, and load it to Snowflake, followed by a series of transformations and analyses to derive meaningful insights from this data.
Extract insights from GitHub using PyAirbyte
Learn how to use PyAirbyte to extract data from Github, followed by a series of transformations and analyses to derive meaningful insights from this data. In particular, we demonstrate PyAirbyte capabilities for extracting data incrementally.
Extract data from Postgres using PyAirbyte
Learn how to leverage PyAirbyte and use Postgres as a Cache, while running in a Google Colab only. It installs packages on the system and requires sudo access.
Extract insights from Shopify using PyAirbyte
Learn how to use PyAirbyte to extract product-related data from Shopify, followed by a series of transformations and analyses to derive meaningful insights from this data.
Extract insights from Google Analytics 4 using PyAirbyte
Learn how to use PyAirbyte to extract data from Google Analytics 4, followed by a series of transformations and analyses to derive meaningful insights from this data.
Extract crypto data from CoinAPI using PyAirbyte
Learn how to use PyAirbyte to extract cryptocurrency data from CoinAPI.io, followed by a series of transformations and analyses to derive meaningful insights from this data.
End-to-end RAG using GitHub, PyAirbyte, and Langchain
Learn how to use the PyAirbyte library to read records from Github, converts those records to documents, which can then be passed to LangChain for RAG.
Leverage PyAirbyte with this demo
This is a demo of how you can leverage PyAirbyte to load the source data and read it from PyAirbyte cache, read its progress, create graphs and more.
Perform RAG with Vectara
Learn how to use data stored in Airbyte's Vectara destination to perform RAG.
Build an AI chatbot with Snowflake Cortex
Lean how to use data stored in Airbyte's Snowflake Cortex destination to perform RAG by building a Product Assistant—an AI chatbot capable of answering product-related questions using data from multiple Airbyte-related sources.
Low-Latency Data Availability Stack
Build a Low-Latency Data Availability solution that syncs data from an existing Postgres database to a BigQuery dataset using Airbyte, using Change Data Capture (CDC) and Postgres Write Ahead Log (WAL).
Weather Data Stack with dbt, Dagster and BigQuery
Easily set up a data stack using Airbyte, dbt, BigQuery, and Dagster to pull weather data from WeatherStack API, put it into BigQuery, and play around with it using dbt and Dagster.
Database snapshot to S3 then to warehouse
Build a full data stack that creates a table snapshot from a database and stores it in an Amazon S3 bucket as a JSONL file using Airbyte and then loads the snapshot file to a preferred data warehouse.
Optimizing error resolution with Sentry, dbt, Dagster and Snowflake
Configure an error analysis stack utilizing Sentry, Airbyte, Snowflake, dbt, and Dagster.
ELT simplified Stack With Github, Airbyte, dbt, Prefect and BigQuery
Build an "ELT simplified Stack" repository to pull Github data, put it into BigQuery, and play around with it using dbt and Prefect.
End-to-end RAG using Airbyte's Terraform, dbt, Notion, BigQuery and Pinecone
Learn how to build a full data stack using Airbyte Cloud, Terraform, and dbt to move data from Notion -> BigQuery -> Pinecone for interacting with fetched data through an LLM and form a full fledged RAG.
End-to-end RAG using Airbyte Cloud, S3, BigQuery and Pinecone
Learn how to build a full data stack using Airbyte Cloud, Terraform, and dbt to move data from S3 -> BigQuery -> Pinecone for interacting with fetched data through an LLM and form a full fledged RAG.
Customer Segmentation Analytics Stack With Shopify, Airbyte, dbt, Dagster and BigQuery
Learn how to easily set up a data stack using Shopify, Airbyte, dbt, BigQuery, and Dagster. Pull Shopify data, put it into BigQuery, and play around with it using dbt and Dagster.
End-to-end RAG using a file source, PyAirbyte, Pinecone, and Langchain
Learn how to build a RAG pipeline, extracting data from a file source using PyAirbyte, storing it in a Pinecone vector store, and then using LangChain to perform RAG on the stored data.
End-to-end RAG using Airbyte Cloud, S3 and Vectara
Learn how to build an end-to-end RAG pipeline, extracting data from S3 using Airbyte Cloud to load it on Vectara and set up a RAG there.
Using PyAirbyte as a data orchestrator for hosted Airbyte connections
Learn how to automate and monitor Airbyte Cloud sync jobs using PyAirbyte. It includes setting up job executions, handling dependencies, sending real-time status updates, and visually representing job details and outcomes on a timeline.
End-to-end RAG using Salesforce, Airbyte Cloud and Weaviate
Learn how to build an end-to-end RAG pipeline, extracting data from Salesforce using Airbyte Cloud to load it on Weaviate and set up a RAG there.
End-to-end RAG with Airbyte Cloud, Microsoft Sharepoint, and Milvus (Zilliz)
Learn how to build an end-to-end RAG pipeline, extracting data from Microsoft Sharepoint using Airbyte Cloud, loading it on Milvus (Zilliz), and then using LangChain to perform RAG on the stored data.
Storing GitHub vector data into Snowflake Cortex using PyAirbyte
Learn how to load data from GitHub airbyte-source into Snowflake using PyAirbyte, and afterwards convert the stream data into vector.
End-to-end RAG with Airbyte Cloud, Google Drive, and Snowflake Cortex
Learn how to build an end-to-end Retrieval-Augmented Generation (RAG) pipeline. We will extract data from Google Drive using Airbyte Cloud to load it on Snowflake Cortex.
PyAirbyte Custom Snowflake Cache Demo
Learn how to use PyAirbyte to ingest cryptocurrency data from CoinAPI.io into Snowflake.
End-to-end RAG using Jira, PyAirbyte, Pinecone, and LangChain
Learn how to build a RAG pipeline, extracting data from Jira using PyAirbyte, storing it in a Pinecone vector store, and then using LangChain to perform RAG on the stored data.
End-to-end RAG using Milvus Lite and PyAirbyte - fully in Python
Learn how to build a simple RAG (Retrieval-Augmented Generation) pipeline with Milvus Lite and PyAirbyte, for a fully local development in Python.
End-to-end RAG using Github, PyAirbyte and Chroma Vector DB
Learn how to set up a RAG pipeline from GitHub, using PyAirbyte, storing the data in Chroma, using LangChain to perform RAG on the stored data.
Illustrating the Usage of langchain _airbyte Package
The langchain-airbyte package integrates LangChain with Airbyte. It has a very powerful function AirbyteLoader which can be used to load data as document into langchain from any Airbyte source.
How to add custom source to PyAirbyte using the no-code builder
Learn how to add custom sources built from the Connector Builder to PyAirbyte, Airbyte's open-source Python library.
End-to-end RAG using Github, PyAirbyte and Weaviate
Learn how to load data from Github into Weaviate using PyAirbyte, then to use source-github and its stream 'issues'.
End-to-end RAG using Airbyte Cloud, S3 and Pinecone
Learn how to build a full data stack using Airbyte Cloud, Terraform, and dbt to move data from S3 -> BigQuery -> Pinecone for interacting with fetched data through an LLM and form a full fledged RAG.
End-to-end RAG using Facebook Marketing, PyAirbyte, Milvus (Zilliz), and Langchain
Learn how to use PyAirbyte to load data from Facebook marketing, store the data in Milvus (Zilliz) vector store and perform a short RAG demo (using OpenAI/LangChain).
Quickstart for End-to-end RAG using Gitlab, PyAirbyte, and Qdrant
Learn how to build an end-to-end RAG pipeline, extracting data from Gitlab using PyAirbyte, storing it in Qdrant, and then using LangChain to perform RAG on the stored data.
End-to-end RAG using Google Drive, PyAirbyte, Pinecone, and LangChain
Learn how to build an end-to-end RAG pipeline, extracting data from Google Drive using PyAirbyte, storing it in Pinecone, and then using LangChain to perform RAG on the stored data.
RAG based recommendation system on Shopify, using PyAirbyte, Langchain and Pinecone
Learn how to build an end-to-end RAG pipeline, extracting data from Shopify using PyAirbyte, storing it on Pinecone, and then use LangChain to perform RAG on the stored data.
Building data chat agent with PyAirbyte Polygon.io source & Langchain
Learn how to use polygon.io as a data source and use the Langchain experimental agent.
Streamlining Amazon Product Review Analysis with Apify and Snowflake Cortex
Learn how to scrape customer reviews from an Amazon product page, loading the data into Snowflake Cortex, and performing summarization.
End-to-end RAG using S3, PyAirbyte, Pinecone, and Langchain
Learn how to build an end-to-end RAG pipeline, extracting data from an S3 bucket using PyAirbyte, storing it in a Pinecone vector store, and then use LangChain to perform RAG on the stored data.
Scraping web data from Apify source into Airbyte for Langchain
Learn how to scrape data from a website and load it in a database using PyAirbyte and LangChain. Integrating web data into LLMs can enhance their performance by providing up-to-date and relevant information.
End-to-end RAG using Airbyte Cloud, S3 and Snowflake Cortex
Learn how to set up an end-to-end RAG pipeline using Airbyte Cloud, Amazon S3, and Snowflake Cortex.
Airbyte, dbt and Prefect (PAD) Stack with BigQuery
Explore the synergy of Airbyte, Prefect, dbt, and BigQuery with this quickstart template! Seamlessly extract data from Postgres, transform it using dbt, and manage workflows effortlessly with Prefect.
Migrating Data from MongoDB to Postgres with Airbyte
Seamlessly synchronize NoSQL MongoDB data to MySQL databases using the Airbyte terraform provider, packed with flexibility for diverse integrations.
Postgres Snowflake Data Integration Stack
Navigate Postgres to Snowflake integrations effortlessly with Airbyte and terraform. Launch into a world of data connectivity with our streamlined template.
Postgres to MySQL Database Migration Stack
Learn how to migrate tables and data between databases with Airbyte, leveraging the strengths of Change Data Capture and Postgres Write Ahead Log.
API to Warehouse Basic Stack with Airbyte
Seamlessly extract data from diverse APIs using Airbyte and store it in popular data warehouses. This post spotlights the integration of Github API with Snowflake.
Airbyte, dbt and Prefect (PAD) Stack with Snowflake
Unlock the power of Airbyte, Prefect, dbt, and Snowflake with this comprehensive quickstart template. Extract data smoothly from Postgres, transform it with dbt, and orchestrate workflows with Prefect.
Airbyte, dbt and Airflow (ADA) Stack with Snowflake
Create a robust data stack with Airbyte & Airflow; move data from Postgres to Snowflake and transform with dbt. Your quickstart to seamless data integration and transformation.
AI Stack With Airbyte, LangChain and Dagster
Unlock the full potential of Large Language Models (LLMs) like ChatGPT with optimized data pipelines. Learn how to integrate and manage data from various sources using Airbyte, ensure scalability with Dagster, and streamline access with LangChain.
Airbyte, dbt and Dagster (DAD) Stack with Snowflake
This guide showcases a cohesive data stack template, from data extraction with Airbyte to transformation via dbt, orchestrated effortlessly with Dagster.
Outdoor Activity Analytics Stack with Airbyte, dbt, Dagster and BigQuery
Uncover trends in outdoor activities with a streamlined data stack setup. Leverage Recreation API data through Airbyte into BigQuery, refined by dbt and Dagster.
Building Data Pipeline Orchestration With Dagster, dbt, and Airbyte (DAD)
The ultimate starter for building a full data stack using Airbyte, Dagster, dbt, and BigQuery.
MySQL to PostgreSQL Incremental Data Stack
Migrate from MySQL to Postgres with minimal fuss. This quickstart template uses CDC and makes incremental data syncing a breeze. Discover how simple database migration can be with Airbyte and Terraform.
Shopping Cart Analytics Stack With Shopify, Airbyte, dbt, Dagster and BigQuery
Optimize your e-commerce strategy with the Shopping Cart Analytics quickstart. This example stack uses Shopify, Airbyte, dbt, BigQuery, and Dagster. Extract, analyze, and leverage Shopify data to gain actionable insights with ease.
MongoDB to MySQL Data Stack
Efficiently move data from MongoDB to MySQL through Airbyte's terraform provider. Start your NoSQL to SQL data synchronization with our quickstart template.
Aggregating Data from multiple sources using Airbyte's Terraform Provider, dbt, Dagster
Experience the synergy of Airbyte, dbt, and Dagster as we extract from Postgres and MySQL, transform, and load into BigQuery with ease.
Airbyte, dbt, Snowflake and Looker (ADSL) Stack
Experience a full data stack with Airbyte, dbt, Snowflake, and Looker. Effortlessly move from data extraction to insightful analytics, all within one cohesive template.