Python/web_programming/fetch_jobs.py

"""
Scraping jobs given job title and location from indeed website
"""

from __future__ import annotations

from collections.abc import Generator

import requests
from bs4 import BeautifulSoup

url = "https://www.indeed.co.in/jobs?q=mobile+app+development&l="


def fetch_jobs(location: str = "mumbai") -> Generator[tuple[str, str], None, None]:
    soup = BeautifulSoup(
        requests.get(url + location, timeout=10).content, "html.parser"
    )
    # This attribute finds out all the specifics listed in a job
    for job in soup.find_all("div", attrs={"data-tn-component": "organicJob"}):
        job_title = job.find("a", attrs={"data-tn-element": "jobTitle"}).text.strip()
        company_name = job.find("span", {"class": "company"}).text.strip()
        yield job_title, company_name


if __name__ == "__main__":
    for i, job in enumerate(fetch_jobs("Bangalore"), 1):
        print(f"Job {i:>2} is {job[0]} at {job[1]}")
Job fetching (#2219) * Adding job scarping algorithm to web programming * Delete fetch_jobs.py * Adding Jobs Scraping to web programming * Add Python type hints Co-authored-by: Christian Clauss <cclauss@me.com> 2020-08-21 21:58:26 +00:00			`"""`
			`Scraping jobs given job title and location from indeed website`
			`"""`
[pre-commit.ci] pre-commit autoupdate (#11322) * [pre-commit.ci] pre-commit autoupdate updates: - [github.com/astral-sh/ruff-pre-commit: v0.2.2 → v0.3.2](https://github.com/astral-sh/ruff-pre-commit/compare/v0.2.2...v0.3.2) - [github.com/pre-commit/mirrors-mypy: v1.8.0 → v1.9.0](https://github.com/pre-commit/mirrors-mypy/compare/v1.8.0...v1.9.0) * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> 2024-03-13 06:52:41 +00:00
from __future__ import annotations (#2464) * from __future__ import annotations * fixup! from __future__ import annotations * fixup! from __future__ import annotations * fixup! Format Python code with psf/black push Co-authored-by: github-actions <${GITHUB_ACTOR}@users.noreply.github.com> 2020-09-23 11:30:13 +00:00			`from __future__ import annotations`

pre-commit autoupdate: pyupgrade v2.34.0 -> v2.37.0 (#6245) * pre-commit autoupdate: pyupgrade v2.34.0 -> v2.37.0 * pre-commit run --all-files 2022-07-11 08:19:52 +00:00			`from collections.abc import Generator`
Job fetching (#2219) * Adding job scarping algorithm to web programming * Delete fetch_jobs.py * Adding Jobs Scraping to web programming * Add Python type hints Co-authored-by: Christian Clauss <cclauss@me.com> 2020-08-21 21:58:26 +00:00
			`import requests`
			`from bs4 import BeautifulSoup`

			`url = "https://www.indeed.co.in/jobs?q=mobile+app+development&l="`


from __future__ import annotations (#2464) * from __future__ import annotations * fixup! from __future__ import annotations * fixup! from __future__ import annotations * fixup! Format Python code with psf/black push Co-authored-by: github-actions <${GITHUB_ACTOR}@users.noreply.github.com> 2020-09-23 11:30:13 +00:00			`def fetch_jobs(location: str = "mumbai") -> Generator[tuple[str, str], None, None]:`
Enable ruff S113 rule (#11375) * Enable ruff S113 rule * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> 2024-04-21 17:34:18 +00:00			`soup = BeautifulSoup(`
			`requests.get(url + location, timeout=10).content, "html.parser"`
			`)`
Job fetching (#2219) * Adding job scarping algorithm to web programming * Delete fetch_jobs.py * Adding Jobs Scraping to web programming * Add Python type hints Co-authored-by: Christian Clauss <cclauss@me.com> 2020-08-21 21:58:26 +00:00			`# This attribute finds out all the specifics listed in a job`
			`for job in soup.find_all("div", attrs={"data-tn-component": "organicJob"}):`
			`job_title = job.find("a", attrs={"data-tn-element": "jobTitle"}).text.strip()`
			`company_name = job.find("span", {"class": "company"}).text.strip()`
			`yield job_title, company_name`


			`if __name__ == "__main__":`
			`for i, job in enumerate(fetch_jobs("Bangalore"), 1):`
			`print(f"Job {i:>2} is {job[0]} at {job[1]}")`