fix: add .venv to .gitignore to exclude virtual environment files

feat: enhance README with support for Last.fm and MusicBrainz, add unrated-only mode and clarify provider scoring
feat: add support for multiple popularity providers and unrated-only mode in track processing
2026-06-07 12:24:26 -04:00 · 2026-05-30 17:43:52 -04:00 · 2026-05-30 17:43:52 -04:00 · 2026-05-30 17:43:51 -04:00 · 2026-05-30 17:43:51 -04:00 · 2025-06-05 09:04:56 +12:00
9 changed files with 764 additions and 319 deletions
@@ -3,5 +3,4 @@ NAV_USER=your_navidrome_username
 NAV_PASS=your_navidrome_password
 SPOTIFY_CLIENT_ID=your_spotify_client_id
 SPOTIFY_CLIENT_SECRET=your_spotify_client_secret
-WEB_API_KEY=changeme
+LASTFM_API_KEY=your_lastfm_api_key
 ENABLE_WEB_API_KEY=True
@@ -1,4 +1,5 @@
-data/
+logs/
 __pycache__/
 .venv/
 .env
 docker-compose.yml
 __pycache__
@@ -10,8 +10,5 @@ COPY . .
 # Install any needed packages specified in requirements.txt
 RUN pip install --no-cache-dir -r requirements.txt
-# Expose port 5000 for the Flask app
+# Use an entrypoint script
-EXPOSE 5000
+ENTRYPOINT ["python", "./sptnr.py"]
 # Set the entrypoint to Python
 ENTRYPOINT ["python", "sptnr.py"]
@@ -15,7 +15,7 @@ This script was developed as a solution to repurpose the star ratings in Navidro
 7. [Examples](#examples)
 8. [Resuming Interrupted Sessions](#resuming-interrupted-sessions)
 9. [Managing Docker Containers](#managing-docker-containers)
-10. [Mapping Spotify Popularity to Navidrome Ratings](#mapping-spotify-popularity-to-navidrome-ratings)
+10. [Mapping Spotify/Lastfm/MusicBrainz Popularity to Navidrome Ratings](#mapping-provider-scores-to-navidrome-ratings)
 11. [Estimated Processing Times](#estimated-processing-times)
 12. [Importance of Accurate Metadata for Track Lookup](#importance-of-accurate-metadata-for-track-lookup)
 13. [Logs](#logs)
@@ -23,8 +23,11 @@ This script was developed as a solution to repurpose the star ratings in Navidro
 ## Features
 - **Spotify Integration**: Connects to Spotify's API to fetch popularity data for tracks.
- **Navidrome Integration**: Updates track ratings in Navidrome based on Spotify popularity.
+- **Last.fm Support**: Can use Last.fm artist top-track position, listeners, and playcounts as an alternate popularity source. (This is a great option from spotify, will pull ratings on most songs.)
 - **MusicBrainz Support**: Can use MusicBrainz community ratings as a third rating source. (Currently not a lot of ratings available, but worth having as an option for the future.)
 - **Navidrome Integration**: Updates track ratings in Navidrome based on provider scores.
 - **Flexible Processing**: Process specific artists, albums, or a range of artists or albums.
 - **Unrated-Only Mode**: Skip tracks that already have a rating and only update unrated songs.
 - **Preview Mode**: Run the script in preview mode to see changes without making any actual updates.
 - **Logging**: Detailed logging of the process, both in the console and to a file.
 - **Docker Support**: Run the script in a Docker container for consistent environments and ease of use.
@@ -32,8 +35,9 @@ This script was developed as a solution to repurpose the star ratings in Navidro
 ## Requirements
 - Python 3.x or Docker
 - A Spotify Developer account with an API key ([Get your Spotify API key here](https://developer.spotify.com/dashboard/create))
 - Access to a Navidrome server
 - Optional: A Spotify Developer account with an API key ([Get your Spotify API key here](https://developer.spotify.com/dashboard/create))
 - Optional: a Last.fm API key if you want to use Last.fm as the popularity provider
 **Compatibility Note**: While this script was built with Navidrome in mind, it should theoretically work on any Subsonic server. If you successfully use it with other Subsonic servers, please open an issue to let me know, so I can document it and assist others.
@@ -51,6 +55,8 @@ docker run -t \
  krestaino/sptnr:latest
 ```
 If you want to use Last.fm instead of Spotify, add `-e LASTFM_API_KEY=your_lastfm_api_key` and run with `--provider lastfm`.
 **Note**: The `-t` flag is used to allocate a pseudo-terminal which assists in displaying colored and bold text in the terminal output, which this script uses.
 ### Using Docker Compose
@@ -138,6 +144,9 @@ The script supports various options for flexible usage. Below are examples of ho
 - `-b, --album ALBUM_ID`: Process a specific album. Multiple albums can be specified.
 - `-s, --start START_INDEX`: Start processing from the artist at the specified index (0-based).
 - `-l, --limit LIMIT`: Limit the processing to a specific number of artists from the start index.
 - `-d, --lock-duration DURATION`: Number of days to lock song updates (0 to force update every time).
 - `--provider spotify|lastfm|musicbrainz`: Choose the source used to derive the rating. Spotify is the default.
 - `--unrated-only`: Only update songs that do not already have a Navidrome rating.
 ### Command Formats
@@ -187,6 +196,32 @@ The script supports various options for flexible usage. Below are examples of ho
  - Docker Compose: `docker-compose run sptnr -s 10 -l 5`
  - Docker Run: `docker run -t [env vars] krestaino/sptnr:latest -s 10 -l 5`
 - **Only Update Unrated Songs**:
   Skip tracks that already have a rating in Navidrome.
   - Python: `python sptnr.py --unrated-only`
   - Docker Compose: `docker-compose run sptnr --unrated-only`
   - Docker Run: `docker run -t [env vars] krestaino/sptnr:latest --unrated-only`
 - **Use Last.fm as the Provider**:
   Use Last.fm artist top-track position, listener, and playcount data instead of Spotify popularity.
   - Python: `LASTFM_API_KEY=... python sptnr.py --provider lastfm`
   - Docker Compose: `docker-compose run -e LASTFM_API_KEY=... sptnr --provider lastfm`
   - Docker Run: `docker run -t -e LASTFM_API_KEY=... [env vars] krestaino/sptnr:latest --provider lastfm`
   Last.fm does not provide a Spotify-style popularity number, so the script derives one from three signals:
   - **Artist top-track position**: where the track appears in Last.fm's `artist.getTopTracks` results for that artist. This is the largest part of the score.
   - **Listener reach**: the track's unique Last.fm listener count, scaled logarithmically.
   - **Replay engagement**: plays per listener, used as a small capped bonus.
   This keeps ordinary catalog tracks mostly in the 2-3 star range while letting occasional artist standouts reach 4-5 stars.
 - **Use MusicBrainz as the Provider**:
   Use MusicBrainz ratings to derive the same 0-5 Navidrome rating.
   - Python: `python sptnr.py --provider musicbrainz`
   - Docker Compose: `docker-compose run sptnr --provider musicbrainz`
   - Docker Run: `docker run -t [env vars] krestaino/sptnr:latest --provider musicbrainz`
 ## Resuming Interrupted Sessions
 In cases where your session gets interrupted - for instance, if your machine goes to sleep, you encounter rate limits from Spotify, or for any other reason that causes the script to not complete - you have the option to resume from where you left off.
@@ -213,9 +248,18 @@ In this project, `docker-compose run` is used instead of `docker-compose up`. Th
 docker container prune
 ```
-## Mapping Spotify Popularity to Navidrome Ratings
+## Mapping Provider Scores to Navidrome Ratings
-The script translates Spotify's popularity metric, which ranges from 0 to 100, into Navidrome's 5-star rating system. This conversion allows you to quickly gauge a track's popularity on Spotify directly within Navidrome. The mapping is as follows:
+The script translates provider scores into Navidrome's 5-star rating system. Each provider gets its score a little differently:
 - **Spotify**: Uses Spotify's native track `popularity` value, which is already a 0 to 100 score.
 - **Last.fm**: Derives a 0 to 100 score from three signals:
  - artist top-track position from Last.fm's `artist.getTopTracks` results
  - listener reach from the track's unique listener count
  - replay engagement from plays per listener
 - **MusicBrainz**: Reads the community recording rating from MusicBrainz's recording lookup endpoint. MusicBrainz ratings are already on a 0 to 5 scale, so they are rounded directly to Navidrome's 0 to 5 rating range instead of using the 0 to 100 mapping below.
 For Spotify and Last.fm, the resulting 0 to 100 score is mapped as follows:
 - **0 to 16**: Mapped to 0 stars in Navidrome (Not popular)
 - **17 to 33**: Mapped to 1 star (Low popularity)
@@ -240,16 +284,16 @@ These estimates are based on the script's performance with my library of 6,481 t
 ## Importance of Accurate Metadata for Track Lookup
-The effectiveness of this script heavily relies on the accuracy of the artist, album, and track titles in your music library. For Spotify to successfully recognize and match songs, these metadata details need to be precise.
+The effectiveness of this script heavily relies on the accuracy of the artist, album, and track titles in your music library. For the supported providers to successfully recognize and match songs, these metadata details need to be precise.
-I personally recommend using **MusicBrainz** to tag your music library. MusicBrainz is a comprehensive music database that provides reliable and standardized music metadata, which significantly enhances the accuracy of track matching with Spotify.
+I personally recommend using **MusicBrainz** to tag your music library. MusicBrainz is a comprehensive music database that provides reliable and standardized music metadata, which significantly enhances the accuracy of track matching with any of the supported providers.
 However, it's important to acknowledge that even with a perfectly tagged MusicBrainz library, discrepancies can still occur between Spotify and MusicBrainz data. This may result in the script missing some songs during the matching process.
 To give you an idea of the matching accuracy you can expect, here are some statistics from my own library, which is tagged using MusicBrainz:
 - **Total Tracks**: 6,481
- **Tracks Found on Spotify**: 6,390
+- **Tracks Matched**: 6,390
 - **Tracks Not Found**: 91
 - **Match Percentage**: 98.6%
@@ -261,9 +305,9 @@ Logs are stored in the `logs` directory, and each script execution creates a new
 The script logs its actions in a straightforward format, using `p:49 → r:2` to summarize operations:
- `p:49` indicates the Spotify popularity score, where `49` is the specific score.
+- `p:49` indicates the provider score, where `49` is the specific score.
 - `→` symbolizes the mapping performed by the script.
- `r:2` shows the Navidrome rating assigned based on the Spotify score.
+- `r:2` shows the Navidrome rating assigned based on the provider score.
 ### Terminal Output Colors
@@ -1 +1 @@
-2.0.0-alpha
+1.3.0
@@ -9,21 +9,11 @@ services:
    # Uncomment the next line to build the Docker image locally
    # build: .
    # Uncomment the next line to run the script
    # entrypoint: ["python", "sptnr.py"]
    # Uncomment the next line to start the web server
    # entrypoint: ["gunicorn", "-b", "0.0.0.0:5000", "server:sptnr"]
    environment:
      - NAV_BASE_URL=your_navidrome_server_url
      - NAV_USER=your_navidrome_username
      - NAV_PASS=your_navidrome_password
      - SPOTIFY_CLIENT_ID=your_spotify_client_id
      - SPOTIFY_CLIENT_SECRET=your_spotify_client_secret
      - WEB_API_KEY=changeme
      - ENABLE_WEB_API_KEY=False
    volumes:
-      - ./data:/usr/src/app/data
+      - ./logs:/usr/src/app/logs
    ports:
      - "3333:5000"
@@ -1,7 +1,4 @@
 requests==2.31.0
 python-dotenv==1.0.0
 colorama==0.4.6
-tqdm==4.66.1
+tqdm==4.66.1
 gunicorn==20.1.0
 Flask==2.1.2
 Werkzeug==2.0.3
@@ -1,133 +0,0 @@
 from flask import Flask, request, jsonify
 import subprocess
 import threading
 import os
 from dotenv import load_dotenv
 import functools
 import re
 import datetime
 load_dotenv()
 sptnr = Flask(__name__)
 WEB_API_KEY = os.getenv("WEB_API_KEY")
 ENABLE_WEB_API_KEY = os.getenv("ENABLE_WEB_API_KEY", "True") == "True"
 LOG_DIR = "data/logs"
 def api_key_required(f):
    @functools.wraps(f)
    def decorated_function(*args, **kwargs):
        if ENABLE_WEB_API_KEY and request.args.get("api_key") != WEB_API_KEY:
            return jsonify({"error": "Unauthorized"}), 401
        return f(*args, **kwargs)
    return decorated_function
 def run_script(cmd):
    subprocess.run(cmd)
@sptnr.route("/process", methods=["GET", "POST"])
@api_key_required
 def process_request():
    cmd = ["python3", "sptnr.py"]
    if request.args.get("preview", "") == "true":
        cmd.append("--preview")
    if request.args.get("force", "") == "true":
        cmd.append("--force")
    artist_ids = request.args.getlist("artist")
    for artist_id in artist_ids:
        cmd.extend(["--artist", artist_id])
    album_ids = request.args.getlist("album")
    for album_id in album_ids:
        cmd.extend(["--album", album_id])
    start = request.args.get("start")
    if start:
        cmd.extend(["--start", start])
    limit = request.args.get("limit")
    if limit:
        cmd.extend(["--limit", limit])
    thread = threading.Thread(target=run_script, args=(cmd,))
    thread.start()
    return jsonify({"message": "Processing started"})
@sptnr.route("/logs")
@api_key_required
 def list_logs():
    try:
        logs = os.listdir(LOG_DIR)
        logs = [log for log in logs if log.endswith(".log")]  # Filter log files
        full_paths = [os.path.join(LOG_DIR, log) for log in logs]
        logs_sorted = sorted(full_paths, key=os.path.getmtime, reverse=True)
        table_rows = []
        for log_path in logs_sorted:
            with open(log_path, "r") as file:
                last_line = file.readlines()[-1]
                tracks, found, not_found, match, time = parse_log_data(last_line)
            log_name = os.path.basename(log_path)
            timestamp = int(log_name.split("_")[1].split(".")[0])
            log_datetime = datetime.datetime.fromtimestamp(timestamp)
            formatted_datetime = log_datetime.strftime("%Y-%m-%d %H:%M:%S")
            table_rows.append(
                f"<tr><td><a href='/logs/{log_name}'>{log_name}</a></td><td>{formatted_datetime}</td><td>{tracks}</td><td>{found}</td><td>{not_found}</td><td>{match}</td><td>{time}</td></tr>"
            )
        table_html = f"<table border='1'><tr><th>Log File</th><th>Date & Time</th><th>Tracks</th><th>Found</th><th>Not Found</th><th>Match</th><th>Time</th></tr>{''.join(table_rows)}</table>"
        return table_html
    except Exception as e:
        return f"An error occurred: {e}", 500
 def parse_log_data(line):
    # Regular expressions to extract the required data
    tracks_pattern = r"Tracks: (\d+)"
    found_pattern = r"Found: (\d+)"
    not_found_pattern = r"Not Found: (\d+)"
    match_pattern = r"Match: ([\d.]+%)"
    time_pattern = r"Time: ([\ds]+)"
    # Extracting data using regular expressions
    tracks = re.search(tracks_pattern, line)
    found = re.search(found_pattern, line)
    not_found = re.search(not_found_pattern, line)
    match = re.search(match_pattern, line)
    time = re.search(time_pattern, line)
    # Getting the values or "N/A" if not found
    tracks = tracks.group(1) if tracks else "N/A"
    found = found.group(1) if found else "N/A"
    not_found = not_found.group(1) if not_found else "N/A"
    match = match.group(1) if match else "N/A"
    time = time.group(1) if time else "N/A"
    return tracks, found, not_found, match, time
@sptnr.route("/logs/<filename>")
@api_key_required
 def view_log(filename):
    try:
        full_path = os.path.join(LOG_DIR, filename)
        if not os.path.exists(full_path) or not os.path.isfile(full_path):
            return "File not found", 404
        with open(full_path, "r") as file:
            content = file.read()
        # Convert content to HTML-friendly format
        content = content.replace("\n", "<br>")
        return f"<pre>{content}</pre>"
    except Exception as e:
        return f"An error occurred: {e}", 500
 if __name__ == "__main__":
    sptnr.run(debug=False, host="0.0.0.0", port=3333)
Author	SHA1	Message	Date
sickprodigy	63ea596a9f	fix: add .venv to .gitignore to exclude virtual environment files	2026-06-07 12:24:26 -04:00
sickprodigy	b6fd70b05a	feat: enhance README with support for Last.fm and MusicBrainz, add unrated-only mode and clarify provider scoring	2026-05-30 17:43:52 -04:00
sickprodigy	4581c5ac20	feat: add support for multiple popularity providers and unrated-only mode in track processing	2026-05-30 17:43:52 -04:00
sickprodigy	fe0cc31b0d	fix: update .gitignore to include __pycache__ and logs directory	2026-05-30 17:43:51 -04:00
sickprodigy	23ba6bb4ee	fix: add LASTFM_API_KEY to environment example	2026-05-30 17:43:51 -04:00
EffakT	e3997167e3	fix: ZeroDivisionError when Total_Tracks is 0	2025-06-05 09:04:56 +12:00
EffakT	c1c20adbfb	Move lock file to logs dir to fix docker support	2025-06-02 10:47:04 +12:00
EffakT	26e22cd385	Switch lock file to atomic, to prevent corruption, More ratelimit delay to run on album instead of song. Don't remove song variations form parentheses (remix, instrumental, etc)	2025-06-01 09:03:27 +12:00
EffakT	c52a6eeb9e	handle corrupted lock files	2025-05-28 08:58:22 +12:00
EffakT	76e2bca65d	refactor: should be called lock, not cache, Adding lock jitter to prevent a lock stampede, unless duration is set to 0. Lock even if track is not found.	2025-05-28 08:48:07 +12:00
EffakT	173f893410	Add documentation for --cache-duration argument	2025-05-27 12:41:01 +12:00
EffakT	1896c500df	Add per-song update cache with configurable duration, and force-update option.	2025-05-27 09:15:17 +12:00
EffakT	ba0c066bf8	fix: output index is incorrect when start arg is supplied.	2025-05-27 09:00:34 +12:00
EffakT	4599a8c392	Implement token refreshing	2025-05-27 08:55:45 +12:00
EffakT	8aec18a580	Implement retry functionality, with a sleep to help with ratelimiting	2025-05-27 08:42:48 +12:00