Pandas read_csv from url

python

pandas

dataframe

csv

byNikita Barsukov·Mar 2, 2025

Here's how to fetch a CSV file directly from the web into the comfort of your pandas DataFrame:

import pandas as pd

# Replace "your_url" with your actual URL
df = pd.read_csv('your_url')

Do make sure your URL is a direct path to a CSV file.

Adjusting for different delimiters

Not all CSV files are created equal — some aren't even separated by commas! Fear not, pandas has a sep parameter for that:

import pandas as pd

# For instance, dealing with a rebel semicolon-separated file
df = pd.read_csv('your_url', sep=';')

See, pandas does have an eye for rebels 🕶. For secured URLs requiring authentication, consider enlisting requests to the rescue.

Pandas and Python versions

Make sure you have Pandas v0.19.2 or higher and Python 3.x within reach. These versions offer built-in URL support. If by any chance you're with older versions, use this requests.get and io.StringIO hack:

import requests
import pandas as pd
from io import StringIO

# RSS (Really, Slight Suffering) 101 
url = "your_url"
s = requests.get(url).content
df = pd.read_csv(StringIO(s.decode('utf-8')))

Advanced DataFrame loading

Tailor your DataFrame to your needs by specifying which columns to load using the usecols parameter, or prescribe column data types with dtype. A glimpse at your data is just a df.head() away:

# When you only wanna see the columns you like
df = pd.read_csv('your_url', usecols=['Column1', 'Column2'], dtype={'Column1': float})
print(df.head()) # Peek-a-boo! 🙈

Navigating URL pitfalls

Stumbling blocks might emerge if your URL leads to a webpage or a redirection and not the actual CSV file itself. If your CSV file is a GitHub resident, ensure your URL indicates its raw version:

# GitHub is raw-some!
df = pd.read_csv('https://raw.github.com/user/project/branch/folder/data.csv')

Compatibility: For Python 2.x users, switch io.StringIO to StringIO.StringIO.

Managing diverse data sources

Fear not non-HTTP URL protocols like FTP, S3, and file systems as pd.read_csv vouches for their legitimacy:

# PANDAS: Protocols Are No Darn Annoying Stuff
df = pd.read_csv('s3://bucket-name/data.csv') # an Amazon S3 example

Ensure to handle required libraries or permissions. No jumping through hoops, hopefully 🙏.

Verifying your URL

Before your DataFrame can feast on the CSV file, ascertain that the URL is valid and accessible. HTTP status checks could be your saving grace from a blindfolded fetch attempt.

explain-codes / Python / Pandas read_csv from url

Linked

Create Pandas DataFrame from a string



How to skip the headers when processing a csv file using Python?



Python import csv to list



How to add header row to a pandas DataFrame



How to send a "multipart/form-data" with requests in Python?

