Opened 16 years ago
Closed 16 years ago
#11107 closed (fixed)
ManyToManyFields defined with keyword 'through' should not be manipulated as auto-generated m2m tables.
Reported by: | Owned by: | Cliff Dyer | |
---|---|---|---|
Component: | Database layer (models, ORM) | Version: | dev |
Severity: | Keywords: | ||
Cc: | gabor@… | Triage Stage: | Ready for checkin |
Has patch: | yes | Needs documentation: | no |
Needs tests: | no | Patch needs improvement: | no |
Easy pickings: | no | UI/UX: | no |
Description
This ticket is a generalization of #10881. The problem is that ManyToManyField
s defined using the through
keyword are being handled twice by some of the sql generating subcommands of django-admin.py, once for the Model table that is passed to the through
keyword, and again when sql is generated for all many_to_manys on each field. The second pass is incorrect, as ManyToManyField(through=x)
relationships don't auto-create an intermediate table. If the intermediate table's primary key is manually specified, it can lead to errors that cause ./manage.py flush and ./manage.py syncdb to fail. See comments on #10881, for examples.
Attachments (6)
Change History (24)
by , 16 years ago
Attachment: | many_to_many.rel.through.diff added |
---|
comment:1 by , 16 years ago
Has patch: | set |
---|---|
Needs tests: | set |
comment:2 by , 16 years ago
I fail to verify the issue. I used the following models:
from django.db import models class Member(models.Model): name = models.CharField(max_length=30) class Venue(models.Model): # [... snip fields ...] member_set = models.ManyToManyField('Member', through='VenueMember', blank=True) class VenueMember(models.Model): id = models.AutoField(db_column='venue_member_id', primary_key=True) venue = models.ForeignKey(Venue) member = models.ForeignKey(Member)
when I do manage.py sql testapp, I get the following result for postgresql 8.3, psycopg2:
BEGIN; CREATE TABLE "testapp_member" ( "id" serial NOT NULL PRIMARY KEY, "name" varchar(30) NOT NULL ) ; CREATE TABLE "testapp_venue" ( "id" serial NOT NULL PRIMARY KEY ) ; CREATE TABLE "testapp_venuemember" ( "venue_member_id" serial NOT NULL PRIMARY KEY, "venue_id" integer NOT NULL REFERENCES "testapp_venue" ("id") DEFERRABLE INITIALLY DEFERRED, "member_id" integer NOT NULL REFERENCES "testapp_member" ("id") DEFERRABLE INITIALLY DEFERRED ) ; COMMIT;
I'm using current trunk (rev. 10756) with python 2.4.
So everything looks fine to me. Can you give me a step-by-step instruction on how to reproduce the bug?
comment:3 by , 16 years ago
Yes. The tables get created properly, the errors show up when running loaddata (which gets run by ./manage.py flush or ./manage.py syncdb, if you have an initial_data fixture). I'm attaching a minimal fixture which goes with the models you are using. To reproduce:
1) Install this fixture as
yourapp/fixtures/initial_data.json
then do one of the following
2) Run./manage.py loaddata initial_data
or one of:
2a) Run
./manage.py syncdb yourapp
(Calls the same code as 2a)
2b) Run./manage.py flush yourapp
(Calls the same code as 2a)
This (No. 2 above) yields the following output:
Installing json fixture 'initial_data' from '/cgi/django/apps/cdla_apps/venues_minimal/fixtures'. Traceback (most recent call last): File "./manage.py", line 11, in <module> execute_manager(settings) File "/usr/home/jcdyer/src/django-trunk/trunk/django/core/management/__init__.py", line 362, in execute_manager utility.execute() File "/usr/home/jcdyer/src/django-trunk/trunk/django/core/management/__init__.py", line 303, in execute self.fetch_command(subcommand).run_from_argv(self.argv) File "/usr/home/jcdyer/src/django-trunk/trunk/django/core/management/base.py", line 195, in run_from_argv self.execute(*args, **options.__dict__) File "/usr/home/jcdyer/src/django-trunk/trunk/django/core/management/base.py", line 222, in execute output = self.handle(*args, **options) File "/usr/home/jcdyer/src/django-trunk/trunk/django/core/management/commands/loaddata.py", line 196, in handle cursor.execute(line) File "/usr/home/jcdyer/src/django-trunk/trunk/django/db/backends/util.py", line 19, in execute return self.cursor.execute(sql, params) psycopg2.ProgrammingError: column "id" does not exist LINE 1: ...venues_minimal_venuemember_id_seq"', coalesce(max("id"), 1),...
On closer inspection, the other hunk of the patch (in db.backends.init) is not as relevant, as it is just adding the table names to a set, so when it tries to re-add an m2m table, it's already in the set. I'll resubmit without that half of the patch.
by , 16 years ago
Attachment: | many_to_many.rel.through.2.diff added |
---|
Smaller patch, without ineffectual hunk
comment:4 by , 16 years ago
I could be totally off mark, but I think ticket #11114 is at least slightly related to this one.
comment:5 by , 16 years ago
BEGIN;
SELECT setval('"venues_minimal_member_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_member";
SELECT setval('"venues_minimal_venue_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_venue";
SELECT setval('"venues_minimal_venuemember_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_venuemember";
SELECT setval('"venues_minimal_venuemember_venue_member_id_seq"', coalesce(max("venue_member_id"), 1), max("venue_member_id") IS NOT null) FROM "venues_minimal_venuemember";
COMMIT;
}}}
Note that the third select tries to reset a sequence (venues_minimal_venuemember_id_seq
) that doesn't exist using a primary key (venues_minimal_venuemember.id
) that doesn't exist. The fourth line is the proper sequence reset statement for that particular table.
comment:6 by , 16 years ago
Sorry about that. Another way to reproduce this (without fixtures) is to run ./manage.py sqlsequencereset <appname>
Output:
BEGIN; SELECT setval('"venues_minimal_member_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_member"; SELECT setval('"venues_minimal_venue_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_venue"; SELECT setval('"venues_minimal_venuemember_id_seq"', coalesce(max("id"), 1), max("id") IS NOT null) FROM "venues_minimal_venuemember"; SELECT setval('"venues_minimal_venuemember_venue_member_id_seq"', coalesce(max("venue_member_id"), 1), max("venue_member_id") IS NOT null) FROM "venues_minimal_venuemember"; COMMIT;
Note that the third select tries to reset a sequence (venues_minimal_venuemember_id_seq
) that doesn't exist using a primary key (venues_minimal_venuemember.id
) that doesn't exist. The fourth line is the proper sequence reset statement for that particular table.
And yes #11114 is the same problem, but it's not quite accurate to say that the metadata is getting ignored, because the through table is being handled as a model in its own right. The problem is that it's *also* being handled as an auto-generated many_to_many intermediary table.
by , 16 years ago
Attachment: | many_to_many.rel.through.3.diff added |
---|
New patch includes fix for oracle
by , 16 years ago
Attachment: | many_to_many.rel.through.4.diff added |
---|
Patch now includes test. Test passes against postgresql and postgresql_psycopg2, needs to be run against oracle.
comment:8 by , 16 years ago
Needs tests: | unset |
---|---|
Owner: | changed from | to
Status: | new → assigned |
follow-up: 11 comment:9 by , 16 years ago
I've confirmed that the patch fixes the bug against Oracle.
However, the test case fails miserably because it's looking for sql specific to the postgresql backend, and the sql generated for the oracle backend is completely different.
comment:10 by , 16 years ago
Patch needs improvement: | set |
---|
comment:11 by , 16 years ago
Replying to ikelly:
I've confirmed that the patch fixes the bug against Oracle.
However, the test case fails miserably because it's looking for sql specific to the postgresql backend, and the sql generated for the oracle backend is completely different.
Duh. That makes sense. Can you send me the output Oracle gives you (with the patch applied), so I can add it to the tests?
I don't see an easy way to handle this without testing the backends conditionally using settings.DATABASE_ENGINE.
comment:12 by , 16 years ago
Testing the raw SQL output is the wrong way to do this. Using the introspection mechanisms makes significantly more sense.
comment:13 by , 16 years ago
I agree, inspecting the sql is too brittle. A better approach would be to actually perform the sequence reset and make sure that it works as expected.
comment:14 by , 16 years ago
New patch. This one has a test that runs loaddata with a fixture that includes objects in the models necessary to trigger the appropriate sequence resets to test the bug. It also patches relevant documentation on sqlsequencereset fixing a broken link to Simon Willison's blog.
by , 16 years ago
Attachment: | many_to_many.rel.through.5.diff added |
---|
Patch with test that doesn't rely on exact sql syntax of particular backends.
comment:15 by , 16 years ago
Patch needs improvement: | unset |
---|---|
Triage Stage: | Unreviewed → Ready for checkin |
Test passes in PostgreSQL and Oracle, and fails without the rest of the patch.
comment:17 by , 16 years ago
Cc: | added |
---|
comment:18 by , 16 years ago
Resolution: | → fixed |
---|---|
Status: | assigned → closed |
Fixes handling of sql generation for
ManyToManyField(through=MyModel)
relations in postgres