web scraping - how to use python requests to login to website -

- March 15, 2013

im trying login , scrape job site , send me notification when ever key words found.i think have correctly traced xpath value of feild "login[iovation]" cannot extract value, here have done far login

import requests lxml import html header = {"user-agent":"mozilla/4.0 (compatible; msie 5.5;windows nt)"} login_url = 'https://www.upwork.com/ab/account-security/login' session_requests = requests.session() #get csrf result = session_requests.get(login_url) tree=html.fromstring(result.text) auth_token = list(set(tree.xpath('//*[@name="login[_token]"]/@value'))) auth_iovat = list(set(tree.xpath('//*[@name="login[iovation]"]/@value'))) # create payload payload = {     "login[username]": "myemail@gmail.com",      "login[password]": "pa$$w0rd",      "login[_token]": auth_token,         "login[iovation]": auth_iovation,          "login[redir]": "/home"  }  #perform login scrapeurl='https://www.upwork.com/ab/find-work/' result=session_requests.post(login_url, data = payload, headers = dict(referer = login_url)) #test result print result.text

this screen shot of form data when login

this because upworks uses called iovation (https://www.iovation.com/) reduce fraud. iovation uses digital fingerprint of device/browser, sent via login[iovation] parameter.

if @ javascripts loaded on site, find 2 javascript being loaded iesnare.com domain. domain , many others owned iovaiton drop third party javascript identify device/browser.

i think if copy string successful login , send on along http headers including browser agent in python code, should okie.

Search This Blog

Swift

web scraping - how to use python requests to login to website -

Comments

Post a Comment

Popular posts from this blog

asp.net - How to correctly use QUERY_STRING in ISAPI rewrite? -

jsf - "PropertyNotWritableException: Illegal Syntax for Set Operation" error when setting value in bean -

arrays - Algorithm to find ideal starting spot in a circle -