[Maria-developers] Review input for MDEV-21092, 21095, 29997
Hi Rex, Please find below review input for the collection of patches for these MDEVs. First, please try to have each piece of functionality in its own commit. You can use "git rebase -i" to make your last commits be one commit, as well as re-order them. Then, you can do a carefully considered "git push --force" to overwrite the contents of your branch with the new set of commits. in prune_partitions(): + String parts; + String_list parts_list; + + make_used_partitions_str(thd->mem_root, prune_param.part_info, &parts, parts_list ); + trace_partition_pruning.add("partition_prune", parts.ptr()); This creates a list of used partitions regardless whether we need it or not (and typically trace=off, so we don't'). Please put this code into if (unlikely(thd->trace_started())) { ... } Consider this testcase: create table t2 (a int, b int) partition by hash(a) partitions 10; create table t3 (a int, b int) partition by hash(a) partitions 10; set optimizer_trace=1; explain format=json select * from t2,t3 where t2.a in (2,3,4) and t3.a in (4,5); select * from information_schema.optimizer_trace\G This gives: ... { "partition_prune": "p2,p3,p4" }, { "partition_prune": "p4,p5" }, ... which makes it impossible to see which table is pruning is done for. Let's make the trace be: { "prune_partitions" : { "table": "t2", "used_partitions": "p2,p3,p4" } } { "prune_partitions" : { "table": "t3", "used_partitions": "p4,p5" } } . in item_subselect.cc: + { + OPT_TRACE_TRANSFORM(thd, trace_wrapper, trace_transform, + in_subs->get_select_lex()->select_number, + "EXISTS (SELECT)", "IN (SELECT)"); + trace_transform.add( "upper_not", ( upper_not?true:false ) ); + } coding style: everything inside { } should be indented one level (=2 spaces) right. in sql_select.cc: "attached_conditions_summary": [ { "table": "t1", "attached": "t1.c < 30", "index": "t1.a < 10 and t1.b < 20" } Please change "index" to "index_condition". and "attached" to "attached_condition". Let's not print index_condition if it is null. (Yes, this is somewhat inconsistent with printing "attached":null) in item.cc: + #include "my_json_writer.h" + const char *dbug_print_optrace( ) dbug_print_optrace() is fine. I think it needs a comment like this: /* Debug printout: print the trace trace we're currently writing if any, and return it. */ (Feel free to fix poor English if necessary) Please fix the coding style in the function body as discussed on Slack. Please move #include to the other #includes at the top of the file. Can add a comment, "// For dbug_print_optrace" Would dbug_print_opt_trace() be a better name? + const char *dbug_print_items(Item *item) + { + *big_buf= 0; + + dbug_add_print_items( item ); + return big_buf; + } This function doesn't seem to be needed? For the rest of the functions: as discussed on Slack, we can add them in a separate commit if we decide that they're useful. Could you provide a very short text describing: The entry point (which function is to be called from gdb?). The use case. Is it "to print List<Item>" ? I don't recall seeing this data structure in the server... Is there a lot of need to print Item_cond and then print its children? This looks self-repetitive... I'd like to see a log of an example gdb session...
Hi Rex, Nice to see progress with the patch, but there is further input. In ./mysql-test/main/opt_trace.test Do you see this comment at the end:
--echo # End of 10.6 tests
Please move the newly added testcases to be AFTER that line. This is important for those who do merges between version. sql_select.cc, coding style comments: instead of trace_attached_conditions( thd, join ); use trace_attached_conditions(thd, join); In opt_range.cc: instead of make_used_partitions_str(thd->mem_root, prune_param.part_info, &parts, parts_list ); Use make_used_partitions_str(thd->mem_root, prune_param.part_info, &parts, parts_list); No spaces, observe line length limit. IMPORTANT: I'm looking at the change in opt_trace_security.cc, and in the new file I see: { "attaching_conditions_to_tables": { "attached_conditions_computation": [], "attached_conditions_summary": [ { "table": "t1", "attached": null } ] } } ] } }, { "attaching_conditions_to_tables": { "attached_conditions_computation": [], "attached_conditions_summary": [ { "table": "t1", "attached_condition": null } ] } }, (One can see similar picture in other examples, too). This looks as if "attaching_conditions_to_tables" step was performed twice. I think we should then get back to the principle of "Optimizer Trace is a log of what optimizer does", which means things are logged as soon as they're done. This means: Please put back the tracing into make_join_select() as it originally was. With the exception that it should print "attached_condition" instead of "attached". Index Condition Pushdown is done in make_join_readinfo(). So, please add tracing like so: { "make_join_readinfo" : [ { "table": "t1", "index_condition": .... } // further array elements follow ] } The array elements should only be printed if the table has a pushed index condition. This will occur naturally if the code that prints the JSON object is push_index_cond(). It is ok if the code prints an empty "make_join_readinfo": [] for cases when there weren't any pushed index conditions. BR Sergei -- Sergei Petrunia, Software Developer MariaDB Corporation | Skype: sergefp | Blog: http://petrunia.net
Hi Rex, I am looking at the latest commit: commit 9eba044b9710bbdd8195e887cfdd4d62233942ca (HEAD -> bb-MDEV-21092-21095-29997-optimizer-trace-updates, Author: Rex <rex.johnston@mariadb.com> Date: Fri Dec 2 16:46:54 2022 +1200 MDEV-21092,MDEV-21095,MDEV-29997: Optimizer Trace for index condition pushdown, partition pruning, exists-to-in Add Optimizer Tracing for: - Index Condition Pushdown - Partition Pruning - Exists-to-IN optimization and trying this testcase: == testcase == --source include/have_sequence.inc create table t1 ( a int, b int, key(a) ); insert into t1 select seq, seq from seq_1_to_10000; --optimizer_trace explain format=json select * from t1 where a < 20 and (a+1<a+2) and b=3333; drop table t1; == Testcase ends == In the output I see: == Output == { "make_join_readinfo": [ { "table": "t1", "index_condition": "t1.b = 3333" // WRONG-1 } ] } ] } }, { "attaching_conditions_to_tables": { "attached_conditions_computation": [], "attached_conditions_summary": [ { "table": "t1", "attached_condition": "t1.b = 3333", "index_condition": "t1.a < 20 and t1.a + 1 < t1.a + 2" } ] } }, == Output ends == note the line marked "WRONG-1". I also think some of the recent changes in the patch go in the wrong direction. Let me take over, I think I'll make a patch to fix both. BR Sergei -- Sergei Petrunia, Software Developer MariaDB Corporation | Skype: sergefp | Blog: http://petrunia.net
participants (2)
-
Sergei Petrunia
-
Sergey Petrunia