Context Navigation

← Previous Ticket
Next Ticket →

#34639 closed Bug (fixed)

MySQL 8.0 hangs indefinitely when using the admin search with a Foreign Key and Annotate

Reported by:	Nicolas Lupien	Owned by:	Simon Charette
Component:	contrib.admin	Version:	4.2
Severity:	Normal	Keywords:	mysql
Cc:	Simon Charette, Mariusz Felisiak, Natalia Bidart	Triage Stage:	Ready for checkin
Has patch:	yes	Needs documentation:	no
Needs tests:	no	Patch needs improvement:	no
Easy pickings:	no	UI/UX:	no

Description (last modified by Nicolas Lupien)

We've moved to MySQL 8.0 in order to use Django 4.2 but our production system went down and we reverted to using MySQL 5.7 with Django 4.1. We've currently found a workaround that I'll add at the end of the bug report.

If we use the search function of the admin on model with a foreign key and we override ModelAdmin.get_queryset with annotate, the search freezes our database. It had the same effect on Google Cloud SQL and on a local docker image of MySQL 8.0 and it works fine on both environment when using MySQL 5.7.

The code:

models.py

class Organization(models.Model):
    name = models.CharField(max_length=255)

class Member(models.Model):
    name = models.CharField(max_length=255)
    organization = models.ForeignKey(Organization, on_delete=models.CASCADE, null=True)

admin.py

class OrganizationAdmin(admin.ModelAdmin):
    search_fields = ["name", "member__name"]
    list_display = ["name", "member_count"]    
    
    class Meta:
        model = models.Organization
        
    def get_queryset(self, request):
        return super().get_queryset(request).annotate(Count("member"))
    
    def member_count(self, instance):
        return instance.member__count

I found that the ChangeList applies the override to get_queryset containing the annotate multiple times making the query extremely expensive. Give only 500 members it goes through 125,000,000 (500 * 500 * 500) rows.

The workaround: If we override the ChangeList queryset, the call to annotate happens only once and the query is fine.

class CustomChangeList(ChangeList):
    def get_queryset(self, request):
        return super().get_queryset(request).annotate(Count("member"))
        

class OrganizationAdmin(admin.ModelAdmin):
    search_fields = ["name", "member__name"]
    list_display = ["name", "member_count"]    
    
    class Meta:
        model = models.Organization
    
    def member_count(self, instance):
        return instance.member__count
    
    def get_changelist(self, request, **kwargs):
        return CustomChangeList

I created a repo with more details and the complete steps to reproduce the issue: https://github.com/betaflag/django-sqlbugdemo

Change History (15)

comment:1 by Nicolas Lupien, 21 months ago

Summary:	MySQL 8.0 hand indefinitely when using the admin search with a Foreign Key and Annotate → MySQL 8.0 hangs indefinitely when using the admin search with a Foreign Key and Annotate

comment:2 by Nicolas Lupien, 21 months ago

Description:	modified (diff)

comment:3 by Nicolas Lupien, 21 months ago

Description:	modified (diff)

comment:4 by Mariusz Felisiak, 21 months ago

Resolution:	→ invalid
Status:	new → closed

Thanks for the report, however, as it stands, it's a support question. You're using very expensive query for every row, so it takes time. Please reopen the ticket if you can debug your issue and provide details about why and where Django is at fault. As far as I'm aware, if the generated query is the same, then it's not an issue in Django itself. I'd start from checking database indexes.

comment:5 by Simon Charette, 21 months ago

It looks like this relates to #32682 (187118203197801c6cb72dc8b06b714b23b6dd3d).

The fact self.root_queryset is used without stripping unused annotations (e.g. the Count one) explains why the reporter is able to work around the issue by annotating their Count after the Exists filter is added which ensures it's not part of the subquery (which MySQL is bad at optimizing sometimes).

Adjusting SQLQuery.exists to strip used annotation (like we did with SQLQuery.get_aggreation in #28477 (59bea9efd2768102fc9d3aedda469502c218e9b7)) could be a possible way to generically optimize this problem away.

Last edited 21 months ago by Simon Charette (previous) (diff)

comment:6 by Nicolas Lupien, 21 months ago

Resolution:	invalid
Status:	closed → new

It's definitely a Django issue. The query before #32682 is small instantaneous, and now, it crashes my db because it's astronomically costly, with only 500 rows. On production with 50,000 row, it costs 1.25e+15. This is, like @simon said, only because Django is not stripping unused annotations when using the root_queryset. The resulting query does a lot of unnecessary calculation. You can see it with all the debug information in my repository that I joined previously https://github.com/betaflag/django-sqlbugdemo

comment:7 by Natalia Bidart, 21 months ago

Triage Stage:	Unreviewed → Accepted

Accepting based on the comment from Simon. Not marking it as release blocker since the potential regression was added in 3.2.1 (and afaik, 3.2 only receives security updates at this point, right?).

comment:8 by Simon Charette, 21 months ago

Looking at the generated SQL in more details it seems like the annotation is properly stripped but not the now unnecessary join that was required for it

SELECT
    `myapp_organization`.`id`,
    `myapp_organization`.`name`,
    COUNT(`myapp_member`.`id`) AS `member__count`
FROM
    `myapp_organization`
    LEFT OUTER JOIN `myapp_member` ON (
        `myapp_organization`.`id` = `myapp_member`.`organization_id`
    )
WHERE
    EXISTS(
        SELECT
            1 AS `a`
        FROM
            `myapp_organization` U0
            -- U1 is a completely unused alias now that COUNT(`U1`.`id`) is stripped!
            LEFT OUTER JOIN `myapp_member` U1 ON (U0.`id` = U1.`organization_id`)
            LEFT OUTER JOIN `myapp_member` U2 ON (U0.`id` = U2.`organization_id`)
        WHERE
            (
                (
                    U0.`name` LIKE '%a%'
                    OR U2.`name` LIKE '%a%'
                )
                AND U0.`id` = (`myapp_organization`.`id`)
            )
        -- Ditto with GROUP BY since aggregation is discarded
        GROUP BY
            U0.`id`
        ORDER BY
            NULL
        LIMIT
            1
    )
GROUP BY
    `myapp_organization`.`id`
ORDER BY
    `myapp_organization`.`name` ASC,
    `myapp_organization`.`id` DESC;

That's something the ORM is not good at.

Since annotate, filter and friends are additive only (there's no API to discard of them) the ORM was never optimized to have an operation that strips unused JOINs when the select mask changes or filters are elided (e.g. they end up raising EmptyQuerySet). This was identified when working on stripping unused annotations on aggregations in #28477 (59bea9efd2768102fc9d3aedda469502c218e9b7) and is relatively hard to solve.

What's interesting here is that prior to 187118203197801c6cb72dc8b06b714b23b6dd3d the reporter was likely not even getting the right results when searching because they were running into #2361 (notice the double left join against the multi-valued myapp_member table) which means their member__count returned the wrong values.

comment:9 by Simon Charette, 21 months ago

Cc:	Simon Charette Mariusz Felisiak Natalia Bidart added

Not sure how actionable the ticket is in its current form so I wonder how we should proceed here.

I can see a few paths forward:

Create an optimization / ORM ticket to have SQLQuery.exists and .aggregate prune unnecessary joins solely used by pruned annotations. It doesn't look like we have one already and this is desirable.

Adjust #32433 (6307c3f1a123f5975c73b231e8ac4f115fd72c0d) so the TypeError is only raised when distinct(fields) is used (which was the actually reported issue) so #32682 (187118203197801c6cb72dc8b06b714b23b6dd3d) can be reverted.

Don't attempt to remove duplicates when the queryset is meant to be used for deletion purposes as there is no value in doing so but that's easier said than done due with how things are currently structured. Here's how it could be achieved.

I think that 1. is desirable no matter what, and that we should choose a solution between 2. and 3. I have a slight preference for 2 given it relaxes an overly aggressive assertion instead of introducing a feature for the sole purpose of working around it.

comment:10 by Natalia Bidart, 20 months ago

Hello Simon,

I've been reading this ticket and the related tickets to try to provide a sensible advice. The ORM in general is something I'm not familiar with.

Regarding doing (1), and following your advice in terms of that being desirable and feasible, that sounds like a change we should pursue.

Then, I don't understand exactly what (2) involves: is it to remove the self.query.distinct or from the added if guard in the commit? I fail to see how that is safe (in terms of the original report), since in the PR Mariusz commented Using distinct() with delete() sounds like a bad idea to me.. Are we confident that allowing distinct().delete() is safe?

Thanks!

comment:11 by Simon Charette, 20 months ago

Owner:	changed from nobody to Simon Charette
Status:	new → assigned

Hello Natalia,

In the case of (2), and as originally reported in #32433, calling delete() after distinct() is only potentially ambiguous when DISTINCT ON is used (distinct(*fields)) or when DISTINCT is against a subset of fields that doesn't not include the primary key. That is the case because if the subset of fields the distinct grouping is applied on includes the primary key then it's not possible to reduce the set of included rows to a subset (the primary key is distinct for each rows by definition).

The delete method already prevents its usage after a values() call (source) so doing values(*subset_not_including_pk).distinct().delete() is not allowed. This only leaves the distinct(*fields).delete() case as potentially problematic as reported in #32433.

The patch for #32433 was overly strict though and disallowed the distinct().delete() case which then forced the logic introduced by 187118203197801c6cb72dc8b06b714b23b6dd3d to address #32682.

I'll work on PR to demonstrate what I mean by the above to make it crystal clear.

comment:12 by Simon Charette, 20 months ago

Has patch:	set

comment:13 by Mariusz Felisiak, 20 months ago

Triage Stage:	Accepted → Ready for checkin

comment:14 by Mariusz Felisiak <felisiak.mariusz@…>, 20 months ago

In 28e20771:

Refs #32433 -- Reallowed calling QuerySet.delete() after distinct().

While values(*field_excluding_pk).distinct() and
distinct(*field_excluding_pk) can reduce the number of resulting rows
in a way that makes subsequent delete() calls ambiguous standalone
.distinct() calls cannot.

Since delete() already disallows chain usages with values() the only
case that needs to be handled, as originally reported, is when
DISTINCT ON is used via distinct(*fields).

Refs #32682 which had to resort to subqueries to prevent duplicates in
the admin and caused significant performance regressions on MySQL
(refs #34639).

This partly reverts 6307c3f1a123f5975c73b231e8ac4f115fd72c0d.

comment:15 by Mariusz Felisiak <felisiak.mariusz@…>, 20 months ago

Resolution:	→ fixed
Status:	assigned → closed

In d569c1dc:

Fixed #34639 -- Reverted "Fixed #32682 -- Made admin changelist use Exists() instead of distinct() for preventing duplicates."

This reverts commit 187118203197801c6cb72dc8b06b714b23b6dd3d which
moved to using Exists() instead due to an overly strict
distinct().delete() check added in #32433.

Note: See TracTickets for help on using tickets.

Download in other formats:

Issues