False positives in alembic revisions #1237

diego-pm · 2025-02-19T16:13:48Z

The linter gives a false positive in version numbers from alembic revision.

For example, the filename daa52d289898_description.py is corrected to data52d289898_description.py
and the revision numbers included in the file also give a false positive:

"""Description...

Revision ID: daa52d289898  # `daa` should be `data`
Revises: d1a839b16d12
Create Date: 2025-02-18 12:10:22.253105
"""
...
# revision identifiers, used by Alembic.
revision = "daa52d289898"  # `daa` should be `data`
down_revision = "d1a839b16d12"
branch_labels = None
depends_on = None
...

The text was updated successfully, but these errors were encountered:

epage · 2025-02-19T16:25:58Z

Note that the template you filled out was specifically geared towards english words and does not make sense with other forms of identifiers.

It would also be good to define what an alembic revision as its not something I've ever come across.

If this is just another form of sha, we have #415 and #484.

diego-pm · 2025-02-19T16:36:48Z

It seems that the issues you mentioned could also solve this, but this case has more information, so it might be easier to identify that the revision is a false positive, as the migration scripts follow a certain structure.

Alembic is a tool, used along sqlalchemy, to manage database schema migrations. Basically, an alembic revision is a python script that handles the upgrade/downgrade operations of a database schema migration.

The revision number or revision id is like a unique identifier of the database schema version. I think it is like an UUID or maybe some hash, not sure at all.

epage · 2025-02-19T17:06:24Z

this case has more information,

Could you define that format?

Alembic is a tool, used along sqlalchemy, to manage database schema migrations. Basically, an alembic revision is a python script that handles the upgrade/downgrade operations of a database schema migration.

The question I then have is whether this is something general enough to include built-in support for or if this should be left to user config via extend-ignore-identifiers-re or extend-ignore-re

diego-pm · 2025-02-20T08:47:24Z

The example code I show is the structure of alembic migration scripts. Also, the scripts always have 2 functions: upgrade and upgrade. And the filename always follows the 12 digit identifiers with some description.

Here is an example provided by the documentation https://alembic.sqlalchemy.org/en/latest/tutorial.html

"""Add a column

Revision ID: ae1027a6acf
Revises: 1975ea83b712
Create Date: 2011-11-08 12:37:36.714947

"""

# revision identifiers, used by Alembic.
revision = 'ae1027a6acf'
down_revision = '1975ea83b712'

from alembic import op
import sqlalchemy as sa

def upgrade():
    op.add_column('account', sa.Column('last_transaction_date', sa.DateTime))

def downgrade():
    op.drop_column('account', 'last_transaction_date')

But it is true that maybe this is too specific, and should be left to the user to exclude these patterns. The PR to detect hashes and UUIDs might be more general.

epage · 2025-02-20T14:26:00Z

The example code I show is the structure of alembic migration scripts. Also, the scripts always have 2 functions: upgrade and upgrade. And the filename always follows the 12 digit identifiers with some description.

I was less asking about the file format as we won't deal with that but the format of each of the non-words being treated as words.

For example, you mentioned the filename has a 12 digit identifier. Is it always data<hash>_*.py?

The PR to detect hashes and UUIDs might be more general.

We do have those already but we have at least #415. Our word splitting would not catch data<hash>. #484 could help.

diego-pm · 2025-02-20T14:39:14Z

The format of the non-words is the base 16 number (hash), there is no data (in general) in the non-word. The file name is generally <hash>_*.py.

epage · 2025-02-20T14:49:25Z

Sounds like this is then a duplicate of #415 and closing in favor of that.

diego-pm added A-dict S-triage Status: New; needs maintainer attention. labels Feb 19, 2025

epage added A-exclude Area: automatic and user-controlled exclusions and removed A-dict labels Feb 19, 2025

epage added the C-bug Category: bug label Feb 19, 2025

epage mentioned this issue Feb 19, 2025

Feature required: ignore Jupyter notebook cell output #1234

Open

epage closed this as completed Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

False positives in alembic revisions #1237

False positives in alembic revisions #1237

diego-pm commented Feb 19, 2025 •

edited by epage

Loading

epage commented Feb 19, 2025

diego-pm commented Feb 19, 2025

epage commented Feb 19, 2025

diego-pm commented Feb 20, 2025 •

edited

Loading

epage commented Feb 20, 2025

diego-pm commented Feb 20, 2025 •

edited

Loading

epage commented Feb 20, 2025

False positives in alembic revisions #1237

False positives in alembic revisions #1237

Comments

diego-pm commented Feb 19, 2025 • edited by epage Loading

epage commented Feb 19, 2025

diego-pm commented Feb 19, 2025

epage commented Feb 19, 2025

diego-pm commented Feb 20, 2025 • edited Loading

epage commented Feb 20, 2025

diego-pm commented Feb 20, 2025 • edited Loading

epage commented Feb 20, 2025

diego-pm commented Feb 19, 2025 •

edited by epage

Loading

diego-pm commented Feb 20, 2025 •

edited

Loading

diego-pm commented Feb 20, 2025 •

edited

Loading