<ruby id="bdb3f"></ruby>

    <p id="bdb3f"><cite id="bdb3f"></cite></p>

      <p id="bdb3f"><cite id="bdb3f"><th id="bdb3f"></th></cite></p><p id="bdb3f"></p>
        <p id="bdb3f"><cite id="bdb3f"></cite></p>

          <pre id="bdb3f"></pre>
          <pre id="bdb3f"><del id="bdb3f"><thead id="bdb3f"></thead></del></pre>

          <ruby id="bdb3f"><mark id="bdb3f"></mark></ruby><ruby id="bdb3f"></ruby>
          <pre id="bdb3f"><pre id="bdb3f"><mark id="bdb3f"></mark></pre></pre><output id="bdb3f"></output><p id="bdb3f"></p><p id="bdb3f"></p>

          <pre id="bdb3f"><del id="bdb3f"><progress id="bdb3f"></progress></del></pre>

                <ruby id="bdb3f"></ruby>

                企業??AI智能體構建引擎,智能編排和調試,一鍵部署,支持知識庫和私有化部署方案 廣告
                ```python """ @Date 2021/3/29 """ from selenium import webdriver import time from lxml import etree from selenium.webdriver.common.by import By from selenium.webdriver.support.ui import WebDriverWait from selenium.webdriver.support import expected_conditions as EC class LagouSpider(object): def __init__(self): self.url = "https://www.lagou.com/jobs/list_python?labelWords=&fromSearch=true&suginput=" self.driver = webdriver.Chrome() def run(self): self.driver.get(self.url) # 取消頁面彈出框 self.driver.find_element_by_class_name('body-btn').click() while True: # 獲取頁面源代碼 sourse = self.driver.page_source self.parse_list_page(sourse) time.sleep(1) next_btn = WebDriverWait(driver=self.driver, timeout=10).until( EC.presence_of_element_located((By.XPATH, "//span[contains(@class, 'pager_next')]")) ) # 翻頁,點擊下一頁 # self.driver.find_element_by_class_name('pager_next ').click() # 判斷當前頁面有沒有最后一頁的class pager_next pager_next_disabled # self.driver.find_element_by_xpath("//span[contains(@class, 'pager_next')]").click() if "pager_next pager_next_disabled" in next_btn.get_attribute('class'): break else: # 模糊定位 next_btn.click() def parse_list_page(self, sourse): html = etree.HTML(sourse) links = html.xpath("//a[@class='position_link']/@href") for link in links: self.requests_detail_page(link) time.sleep(1) def requests_detail_page(self, link): self.driver.get(link) sourse = self.driver.page_source self.parse_detail_page(sourse) def parse_detail_page(self, sourse): html = etree.HTML(sourse) # data = [] # job = {} job_name = html.xpath("//div[@class='job-name']/h1/text()")[0] job_detail = html.xpath("//div[@class='job-detail']/p/text()") print(job_name, job_detail) # [{python,xxxx}] # job['job_name'] = job_name if __name__ == '__main__': lg = LagouSpider() lg.run() ```
                  <ruby id="bdb3f"></ruby>

                  <p id="bdb3f"><cite id="bdb3f"></cite></p>

                    <p id="bdb3f"><cite id="bdb3f"><th id="bdb3f"></th></cite></p><p id="bdb3f"></p>
                      <p id="bdb3f"><cite id="bdb3f"></cite></p>

                        <pre id="bdb3f"></pre>
                        <pre id="bdb3f"><del id="bdb3f"><thead id="bdb3f"></thead></del></pre>

                        <ruby id="bdb3f"><mark id="bdb3f"></mark></ruby><ruby id="bdb3f"></ruby>
                        <pre id="bdb3f"><pre id="bdb3f"><mark id="bdb3f"></mark></pre></pre><output id="bdb3f"></output><p id="bdb3f"></p><p id="bdb3f"></p>

                        <pre id="bdb3f"><del id="bdb3f"><progress id="bdb3f"></progress></del></pre>

                              <ruby id="bdb3f"></ruby>

                              哎呀哎呀视频在线观看