Advertisement

Python学习(爬取信息1)

阅读量:

0. 前言

本部分为爬虫自学教程

1. 代码部分

1、爬取信息(学习)

复制代码
    import requests(模块)
    import bs4(未安装模块)
    res=requests.get("https://movie.douban.com/top250")
    soup=bs4.BeautifulSoup(res.text,"html.parser")
    targets=soup.find_all("div",class_="hd")
    for each in targets:
    print(each.a.span.text)
    
    
      
      
      
      
      
      
      
    
复制代码
    结果未出来
    
    
      
    

2、参考网站 https://ilovefishc.com/dvd/

3、学习爬取豆瓣电影信息

复制代码
    import requests
    from bs4 import BeautifulSoup
    
    for i in range (0,10):
    url = "https://movie.douban.com/top250?start="+(str(i*25))
    #获取网页
    response = requests.get(url)
    #解析网页
    soup = BeautifulSoup(response.text,"html.parser")
    movie_list = soup.find_all(name='div',attrs={'class':'info'})
    #print(movie_list)
    print("\n"+str(i+1)+" 页:\n")
    #遍历网页信息
    for movie_information in movie_list:
        m_name = movie_information.find(name = 'span',class_ = 'title').text
        m_score = movie_information.find(name = 'span',class_='rating_num').text
        print(m_name+"            "+m_score)
    
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    

全部评论 (0)

还没有任何评论哟~