Extract insights from GitHub using PyAirbyte
Learn how to use PyAirbyte to extract data from Github, followed by a series of transformations and analyses to derive meaningful insights from this data. In particular, we demonstrate PyAirbyte capabilities for extracting data incrementally.
Extract data from Postgres using PyAirbyte
Learn how to leverage PyAirbyte and use Postgres as a Cache, while running in a Google Colab only. It installs packages on the system and requires sudo access.
Extract crypto data to Snowflake using PyAirbyte
Learn how to use PyAirbyte to extract cryptocurrency data from CoinAPI.io, and load it to Snowflake, followed by a series of transformations and analyses to derive meaningful insights from this data.
End-to-end RAG using a file source, PyAirbyte, Pinecone, and Langchain
Learn how to build a RAG pipeline, extracting data from a file source using PyAirbyte, storing it in a Pinecone vector store, and then using LangChain to perform RAG on the stored data.
End-to-end RAG using Airbyte Cloud, S3, BigQuery and Pinecone
Learn how to build a full data stack using Airbyte Cloud, Terraform, and dbt to move data from S3 -> BigQuery -> Pinecone for interacting with fetched data through an LLM and form a full fledged RAG.
Storing GitHub vector data into Snowflake Cortex using PyAirbyte
Learn how to load data from GitHub airbyte-source into Snowflake using PyAirbyte, and afterwards convert the stream data into vector.
One-click Data Pipeline for Profitability in E-commerce

In a world where e-commerce business models are relatively uniform, lies a huge opportunity in analytics of building modular, reusable data transformation models. This tutorial is about open sourcing the full end to end pipeline around a critical use case for every e-commerce: profitability calculation!
Sentiment analysis using Google sheets and Snowflake Cortex
Learn how to load user review data from Google Sheets intoS nowflake Cortex based vector store, and perform sentiment analysis using Snowflake Cortex's sentiment function.
Using PyAirbyte as a data orchestrator for hosted Airbyte connections
Learn how to automate and monitor Airbyte Cloud sync jobs using PyAirbyte. It includes setting up job executions, handling dependencies, sending real-time status updates, and visually representing job details and outcomes on a timeline.
PyAirbyte Custom Snowflake Cache Demo
Learn how to use PyAirbyte to ingest cryptocurrency data from CoinAPI.io into Snowflake.
End-to-end RAG using Github, PyAirbyte and Chroma Vector DB
Learn how to set up a RAG pipeline from GitHub, using PyAirbyte, storing the data in Chroma, using LangChain to perform RAG on the stored data.
End-to-end RAG using Milvus Lite and PyAirbyte - fully in Python
Learn how to build a simple RAG (Retrieval-Augmented Generation) pipeline with Milvus Lite and PyAirbyte, for a fully local development in Python.
End-to-end RAG using Jira, PyAirbyte, Pinecone, and LangChain
Learn how to build a RAG pipeline, extracting data from Jira using PyAirbyte, storing it in a Pinecone vector store, and then using LangChain to perform RAG on the stored data.
Illustrating the Usage of langchain _airbyte Package
The langchain-airbyte package integrates LangChain with Airbyte. It has a very powerful function AirbyteLoader which can be used to load data as document into langchain from any Airbyte source.
End-to-end RAG using Facebook Marketing, PyAirbyte, Milvus (Zilliz), and Langchain
Learn how to use PyAirbyte to load data from Facebook marketing, store the data in Milvus (Zilliz) vector store and perform a short RAG demo (using OpenAI/LangChain).
End-to-end RAG using Github, PyAirbyte and Weaviate
Learn how to load data from Github into Weaviate using PyAirbyte, then to use source-github and its stream 'issues'.
End-to-end RAG using Airbyte Cloud, S3 and Pinecone
Learn how to build a full data stack using Airbyte Cloud, Terraform, and dbt to move data from S3 -> BigQuery -> Pinecone for interacting with fetched data through an LLM and form a full fledged RAG.
End-to-end RAG using Google Drive, PyAirbyte, Pinecone, and LangChain
Learn how to build an end-to-end RAG pipeline, extracting data from Google Drive using PyAirbyte, storing it in Pinecone, and then using LangChain to perform RAG on the stored data.
Building data chat agent with PyAirbyte Polygon.io source & Langchain
Learn how to use polygon.io as a data source and use the Langchain experimental agent.
Quickstart for End-to-end RAG using Gitlab, PyAirbyte, and Qdrant
Learn how to build an end-to-end RAG pipeline, extracting data from Gitlab using PyAirbyte, storing it in Qdrant, and then using LangChain to perform RAG on the stored data.
RAG based recommendation system on Shopify, using PyAirbyte, Langchain and Pinecone
Learn how to build an end-to-end RAG pipeline, extracting data from Shopify using PyAirbyte, storing it on Pinecone, and then use LangChain to perform RAG on the stored data.
Streamlining Amazon Product Review Analysis with Apify and Snowflake Cortex
Learn how to scrape customer reviews from an Amazon product page, loading the data into Snowflake Cortex, and performing summarization.
End-to-end RAG using S3, PyAirbyte, Pinecone, and Langchain
Learn how to build an end-to-end RAG pipeline, extracting data from an S3 bucket using PyAirbyte, storing it in a Pinecone vector store, and then use LangChain to perform RAG on the stored data.
Scraping web data from Apify source into Airbyte for Langchain
Learn how to scrape data from a website and load it in a database using PyAirbyte and LangChain. Integrating web data into LLMs can enhance their performance by providing up-to-date and relevant information.
End-to-end RAG using Airbyte Cloud, S3 and Snowflake Cortex
Learn how to set up an end-to-end RAG pipeline using Airbyte Cloud, Amazon S3, and Snowflake Cortex.
Oracle Database Replication: Step by Step Guide + Tools
Learn how to set up Oracle database replication with our comprehensive step-by-step guide. Additionally, discover the best tools for efficient replication.
SQL Developer Export to Excel & CSV: Three Easy Ways
This guide provides you with the knowledge of popular and straightforward methods for Oracle SQL developer export to Excel and CSV with some best practices.
How to do Salesforce Data Integration: The Use Cases
This article helps you understand Salesforce data integration, data integration Salesforce tools, and its common use cases.
A Comprehensive Guide to BigQuery Data Types
A guide that will inform you briefly about every BigQuery data type.
dbt Core vs. dbt Cloud: Know the Differences
Take a look at the dbt Core vs. dbt Cloud comparison to gauge which tool is better suited for your business requirements.