#29692 closed Bug (fixed)
Incorrect removal of order_by clause created as multiline RawSQL
Reported by: | Marcin Nowak | Owned by: | Can Sarıgöl |
---|---|---|---|
Component: | Database layer (models, ORM) | Version: | dev |
Severity: | Normal | Keywords: | |
Cc: | Can Sarıgöl | Triage Stage: | Accepted |
Has patch: | yes | Needs documentation: | no |
Needs tests: | no | Patch needs improvement: | no |
Easy pickings: | no | UI/UX: | no |
Description
Hi.
The SQLCompiler
is ripping off one of my "order by" clause, because he "thinks" the clause was already "seen" (in SQLCompiler.get_order_by()
). I'm using expressions written as multiline RawSQL
s, which are similar but not the same.
The bug is located in SQLCompiler.get_order_by()
, somewhere around line computing part of SQL query without ordering:
without_ordering = self.ordering_parts.search(sql).group(1)
The sql
variable contains multiline sql. As a result, the self.ordering_parts
regular expression is returning just a line containing ASC or DESC words. This line is added to seen
set, and because my raw queries have identical last lines, only the first clasue is returing from SQLCompiler.get_order_by()
.
As a quick/temporal fix I can suggest making sql
variable clean of newline characters, like this:
sql_oneline = ' '.join(sql.split('\n')) without_ordering = self.ordering_parts.search(sql_oneline).group(1)
Note: beware of unicode (Py2.x u''
) and EOL dragons (\r
).
Example of my query:
return MyModel.objects.all().order_by( RawSQL(''' case when status in ('accepted', 'verification') then 2 else 1 end''', []).desc(), RawSQL(''' case when status in ('accepted', 'verification') then (accepted_datetime, preferred_datetime) else null end''', []).asc(), RawSQL(''' case when status not in ('accepted', 'verification') then (accepted_datetime, preferred_datetime, created_at) else null end''', []).desc())
The ordering_parts.search
is returing accordingly:
' then 2 else 1 end)'
' else null end'
' else null end'
Second RawSQL with a else null end
part is removed from query.
The fun thing is that the issue can be solved by workaround by adding a space or any other char to the last line.
So in case of RawSQL I can just say, that current implementation of avoiding duplicates in order by clause works only for special/rare cases (or does not work in all cases).
The bug filed here is about wrong identification of duplicates (because it compares only last line of SQL passed to order by clause).
Hope my notes will help you fixing the issue. Sorry for my english.
Change History (14)
comment:1 by , 6 years ago
Type: | Uncategorized → Bug |
---|
comment:2 by , 6 years ago
Is there a reason you can't use conditional expressions
No, but I didn't knew about the issue, and writing raw sqls is sometimes faster (not in this case ;)
I'm really happy having possibility to mix raw sqls with object queries. Next time I'll use expressions, for sure.
Allowing the ordering optimization stuff to handle arbitrary RawSQL may be difficult.
Personally I'd like to skip RawSQL clauses in the block which is responsible for finding duplicates. If someone is using raw sqls, he knows the best what he is doing, IMO. And it is quite strange if Django removes silently part of your SQL. This is very confusing. And please note that printing a Query
instance was generating incomplete sql, but while checking Query.order_by
manually, the return value was containing all clauses. I thought that just printing was affected, but our QA dept told me the truth ;)
I know there is no effective way to compare similarity of two raw clauses. This may be hard for expression objects, too, but you have a possibility to implement some __eq__
magic (instead of comparation of generated sqls). Unfortunately I don't know why duplicates detection was implemented, so it's hard to tell how to improve this part.
comment:7 by , 6 years ago
Needs tests: | unset |
---|---|
Patch needs improvement: | set |
comment:8 by , 6 years ago
Patch needs improvement: | unset |
---|
comment:10 by , 6 years ago
Patch needs improvement: | unset |
---|
comment:11 by , 6 years ago
Owner: | changed from | to
---|---|
Status: | new → assigned |
comment:12 by , 6 years ago
Patch needs improvement: | set |
---|---|
Version: | 1.11 → master |
comment:14 by , 6 years ago
Patch needs improvement: | unset |
---|
Is there a reason you can't use conditional expressions, e.g. something like:
I'm thinking that would avoid fiddly
ordering_parts
regular expression.If there's some shortcoming to that approach, it might be easier to address that. Allowing the ordering optimization stuff to handle arbitrary RawSQL may be difficult.