Mod Archive Forums Mod Archive Forums
Advanced search  

News:

Please note: Your main modarchive.org account will not work here, you must create a forum account to post on the forums.

Pages: [1]   Go Down

Author Topic: python script for scrapping mod files from site to m3u  (Read 1985 times)

0 Members and 1 Guest are viewing this topic.

olexander

  • New User
  • Offline Offline
  • Posts: 6
    • View Profile
python script for scrapping mod files from site to m3u
« on: August 07, 2024, 22:32:53 »

Hello, apologize if this is incorrect action here :)
I am with help of google search and google gemini try make python script for scrapping mod files from site to m3u(direct url links for mod files) to listen directly at media players.
Here example for "Counrty" genre
Code: [Select]
base_url = 'https://modarchive.org/index.php?query=18&request=search&search_type=genre&'Here two pages - so
Code: [Select]
num_pages = 2make changes and launch(i am at linux)
Code: [Select]
./<namescript>.py > country.m3uwait, ok, add country.m3u to your madia player, bit wait for loading, try listen :)

Code: [Select]
#!/usr/bin/env python3

import requests
from bs4 import BeautifulSoup



def scrape_urls(base_url, num_pages):
  all_urls = []
  for page in range(1, num_pages + 1):
    url = f"{base_url}page={page}#mods"  # Adjust URL pattern as needed
    response = requests.get(url)
    soup = BeautifulSoup(response.text, 'html.parser')

    links = soup.find_all('a')
    for link in links:
      href = link.get('href')
      if href and href.startswith('https:'):

        all_urls.append(href)

  return all_urls

# Example usage
# chillout
# base_url = 'https://modarchive.org/index.php?query=106&request=search&search_type=genre&'
# minimal
# base_url = 'https://modarchive.org/index.php?query=101&request=search&search_type=genre&'
# country
base_url = 'https://modarchive.org/index.php?query=18&request=search&search_type=genre&'
num_pages = 2
all_urls = scrape_urls(base_url, num_pages)
for url in all_urls:
    print(url)
Logged
Pages: [1]   Go Up