Python Language => Urllib

HTTP GET

Python 2.x 2.7

Python 2

import urllib
response = urllib.urlopen('http://stackoverflow.com/documentation/')

Bei Verwendung von urllib.urlopen() wird ein urllib.urlopen() , das ähnlich wie eine Datei behandelt werden kann.

print response.code
# Prints: 200

Der response.code repräsentiert den http-Rückgabewert. 200 ist in Ordnung, 404 ist nicht gefunden usw.

print response.read()
'<!DOCTYPE html>\r\n<html>\r\n<head>\r\n\r\n<title>Documentation - Stack. etc'

response.read() und response.readlines() können verwendet werden, um die aus der Anforderung zurückgegebene tatsächliche HTML-Datei zu lesen. Diese Methoden funktionieren ähnlich wie file.read*

Python 3.x 3.0

Python 3

import urllib.request

print(urllib.request.urlopen("http://stackoverflow.com/documentation/"))
# Prints: <http.client.HTTPResponse at 0x7f37a97e3b00>

response = urllib.request.urlopen("http://stackoverflow.com/documentation/")

print(response.code)
# Prints: 200
print(response.read())
# Prints: b'<!DOCTYPE html>\r\n<html>\r\n<head>\r\n\r\n<title>Documentation - Stack Overflow</title>

Das Modul wurde für Python 3.x aktualisiert, die Anwendungsfälle bleiben jedoch im Wesentlichen gleich. urllib.request.urlopen gibt ein ähnliches dateiähnliches Objekt zurück.

HTTP POST

Um POST-Daten zu übergeben, übergeben Sie die codierten Abfrageargumente als Daten an urlopen ()

Python 2.x 2.7

Python 2

import urllib
query_parms = {'username':'stackoverflow', 'password':'me.me'}
encoded_parms = urllib.urlencode(query_parms)
response = urllib.urlopen("https://stackoverflow.com/users/login", encoded_parms)
response.code
# Output: 200
response.read()
# Output: '<!DOCTYPE html>\r\n<html>\r\n<head>\r\n\r\n<title>Log In - Stack Overflow'

Python 3.x 3.0

Python 3

import urllib
query_parms = {'username':'stackoverflow', 'password':'me.me'}
encoded_parms = urllib.parse.urlencode(query_parms).encode('utf-8')
response = urllib.request.urlopen("https://stackoverflow.com/users/login", encoded_parms)
response.code
# Output: 200
response.read()
# Output: b'<!DOCTYPE html>\r\n<html>....etc'

Empfangene Bytes nach Inhaltstypen codieren

Die empfangenen Bytes müssen mit der korrekten Zeichencodierung dekodiert werden, um als Text interpretiert zu werden:

Python 3.x 3.0

import urllib.request

response = urllib.request.urlopen("http://stackoverflow.com/")
data = response.read()

encoding = response.info().get_content_charset()
html = data.decode(encoding)

Python 2.x 2.7

import urllib2
response = urllib2.urlopen("http://stackoverflow.com/")
data = response.read()

encoding = response.info().getencoding()
html = data.decode(encoding)

Modified text is an extract of the original Stack Overflow Documentation

Lizenziert unter CC BY-SA 3.0

Nicht angeschlossen an Stack Overflow

Python Language
Urllib

Suche…

HTTP GET

Python 2

Python 3

HTTP POST

Python 2

Python 3

Empfangene Bytes nach Inhaltstypen codieren