How to obtain the "LES" (Last Executed Statement) from an Optimized Core Dump?

Ever ran into a situation where you saw “some important variable you really needed to know about=<optimized out>” while debugging? Let’s look at an example:

[Roel@qaserver master-data]$ gdb /percona-server/Percona-Server-5.5.25a-rel27.1-285.Linux.x86_64/bin/mysqld ./core.3200 
[...]
(gdb) bt
[...]
#20 handle_select (thd=0x33acd30, lex=0x33ae900, result=0x7f3e840058d0, setup_tables_done_option=1073741824)
    at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_select.cc:312
#21 mysql_execute_command (thd=thd@entry=0x33acd30)
    at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:3138
#22 mysql_parse (thd=thd@entry=0x33acd30, <strong>rawbuf=<optimized out>,</strong> length=72, parser_state=parser_state@entry=0x7f3ed013f810)
    at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:5809
#23 dispatch_command (command=COM_QUERY, thd=0x33acd30, <strong>packet=<optimized out></strong>, packet_length=72)
    at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:1060
[...]

[Roel@qaserver master-data]$ gdb /percona-server/Percona-Server-5.5.25a-rel27.1-285.Linux.x86_64/bin/mysqld ./core.3200

[...]

(gdb) bt

[...]

#20 handle_select (thd=0x33acd30, lex=0x33ae900, result=0x7f3e840058d0, setup_tables_done_option=1073741824)

at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_select.cc:312

#21 mysql_execute_command (thd=thd@entry=0x33acd30)

at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:3138

#22 mysql_parse (thd=thd@entry=0x33acd30, rawbuf=<optimized out>, length=72, parser_state=parser_state@entry=0x7f3ed013f810)

at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:5809

#23 dispatch_command (command=COM_QUERY, thd=0x33acd30, packet=<optimized out>, packet_length=72)

at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:1060

[...]

It happens to all of us. This “issue” is seen when using optimized (release) binaries: debug symbols have been stripped out. In non-optimized binaries, the query would (in most cases) have shown directly in the backtrace. Sidenote: “in most cases”: sometimes you have to swap threads (or use: ‘thread apply all bt’ or ‘thread <nr>’ then ‘bt’) before you can view the actual crashing statement: gdb may have incorrectly analyzed which thread caused the crash. Inspecting the error log (which also contains a stack trace) may also help in such cases.

So… maybe you were testing a C program with three threads executing highly concurrent DML statements (where each thread executes one particular type of statement) and you are stuck as to which statement is causing your program to crash.. Or you just ran your latest and greatest RQG grammar and found this nifty crashing bug, only to find out you’ll be stuck with hours or even days of “grammar simplification”..

Not so! Read on…

If we did know the “LES” – or Last Executed Statement – in our RQG case for example, we could bring up the server with a copy of the test run’s datadir and try to re-execute the crashing statement to see if it crashes again – a plausible thing to happen. So, how do we make this happen if the query is not directly visible from the stack trace?

Let’s fire up gdb and see what we find. (We will assume the source code used for compiling is available at the same location as where it was at compilation time)

First we inspect the mysql_execute_command frame:

(gdb) frame 21 
#21 0x00000000005a30af in mysql_execute_command (thd=thd@entry=0x33acd30)
    at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:3138 
3138    res= handle_select(thd, lex, sel_result, OPTION_SETUP_TABLES_DONE);
(gdb) list
3133                &lex->update_list, 
3134                &lex->value_list, 
3135                lex->duplicates, 
3136                lex->ignore))) 
3137       { 
3138  res= handle_select(thd, lex, sel_result, OPTION_SETUP_TABLES_DONE); 
3139         /* 
3140           Invalidate the table in the query cache if something changed 
3141           after unlocking when changes become visible. 
3142           TODO: this is workaround. right way will be move invalidating in

(gdb) frame 21

#21 0x00000000005a30af in mysql_execute_command (thd=thd@entry=0x33acd30)

at /percona-server/5.5/Percona-Server-5.5.25a-rel27.1/sql/sql_parse.cc:3138

3138 res= handle_select(thd, lex, sel_result, OPTION_SETUP_TABLES_DONE);

(gdb) list

3133 &lex->update_list,

3134 &lex->value_list,

3135 lex->duplicates,

3136 lex->ignore)))

3137 {

3138 res= handle_select(thd, lex, sel_result, OPTION_SETUP_TABLES_DONE);

3139 /*

3140 Invalidate the table in the query cache if something changed

3141 after unlocking when changes become visible.

3142 TODO: this is workaround. right way will be move invalidating in

Looking at line 3138 (or by inspecting the stack trace above), we may expect the query to be present in the thd variable. Looking further, we can see that thd is being referenced in all frames, and that it is passed to handle_select. Let’s see what it contains:

(gdb) p thd 
$1 = (THD *) 0x33acd30

1 2	(gdb) p thd $1 = (THD *) 0x33acd30

Ok, so it’s a pointer to an array’s memory address. Using * before the address allows us to see the contents of the thd array:

(gdb) p *(THD *) 0x33acd30
$5 = {<Statement> = {<ilink> = {_vptr.ilink = 0xf4a670, prev = 0x7f3e9800d660, next = 0x33a5c10}, 
<Query_arena> = {_vptr.Query_arena = 0xf4a6a8, free_list = 0x7f3e84005f10, mem_root = 0x33afac0, 
state = Query_arena::STMT_CONVENTIONAL_EXECUTION}, id = 0, query_strip_comments = {buffer = 0x0, 
length = 0, buffer_length = 0}, mark_used_columns = MARK_COLUMNS_READ, name = {str = 0x0, 
length = 0}, lex = 0x33ae900, query_string = {string = { str = 0x7f3e84004ba0 
"<strong>INSERT INTO testdb_N . t1_temp1_N SELECT * FROM test.table10_int_autoinc</strong>", length = 72}, 
cs = 0x101e980}, db = 0x7f3e84004b70 "test", db_length = 4}, <Open_tables_state> = { 
m_reprepare_observer = 0x0, open_tables = 0x7f3eac0085d0, temporary_tables = 0x7f3e84034a50, 
LOCK_temporary_tables = {m_mutex = { mutex = [...]

(gdb) p *(THD *) 0x33acd30

$5 = {<Statement> = {<ilink> = {_vptr.ilink = 0xf4a670, prev = 0x7f3e9800d660, next = 0x33a5c10},

<Query_arena> = {_vptr.Query_arena = 0xf4a6a8, free_list = 0x7f3e84005f10, mem_root = 0x33afac0,

state = Query_arena::STMT_CONVENTIONAL_EXECUTION}, id = 0, query_strip_comments = {buffer = 0x0,

length = 0, buffer_length = 0}, mark_used_columns = MARK_COLUMNS_READ, name = {str = 0x0,

length = 0}, lex = 0x33ae900, query_string = {string = { str = 0x7f3e84004ba0

"INSERT INTO testdb_N . t1_temp1_N SELECT * FROM test.table10_int_autoinc", length = 72},

cs = 0x101e980}, db = 0x7f3e84004b70 "test", db_length = 4}, <Open_tables_state> = {

m_reprepare_observer = 0x0, open_tables = 0x7f3eac0085d0, temporary_tables = 0x7f3e84034a50,

LOCK_temporary_tables = {m_mutex = { mutex = [...]

Great! The query is showing in there: “INSERT INTO testdb_N . t1_temp1_N SELECT * FROM test.table10_int_autoinc”.

Instead of using a numeric memory address, we could also use:

(gdb) p *thd

1	(gdb) p *thd

Gdb knows the variable type, so we don’t need the typecast to “(THD *)”.

Or, better yet, now that we know where the query string is hiding (query_string > string > str), we can use:

(gdb) p thd->query_string.string.str

1	(gdb) p thd->query_string.string.str

In the future.

Conclusion: the Last Executed Statement was found in an optimized core dump by checking the contents of the thd array in the mysql_execute_command frame. By simply examining “surrounding” variables we’ve been able to find the information we needed. You can try this technique yourself the next time you need to find out exactly which query is causing you trouble!

10 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Peter Zaitsev

Admin

11 years ago

Last Executed statement is helpful for many crashes troubleshooting. I would point out though it will not always be enough to repeat the crash. Crashes which are repeatable by running statement alone are “simple” and often caught easily so you do not run into them very often. Many problems in production are either context dependent – depend on what previous queries were executed or concurrency dependent – depend on what is being executed in parallel – it could be heap corruption, wrong mutex usage etc.

Mark Callaghan

11 years ago

Can you expect to get core dumps in production when the InnoDB buffer pool is huge? That will lead to huge core dump files. Perhaps it is time to figure out how to get core dumps that exclude the buffer pool.

Vadim Tkachenko

Admin

11 years ago

Mark,

Did you look into “Google Breakpad crash reporting system” from
http://darnaut.blogspot.com/2012/06/changes-in-twitter-mysql-5523t6.html ?
Is this something of interest?

Roel Van de Paar

11 years ago

@ Peter – Agreed, the last executed statement will often not be enough to repeat the crash. It is a good first test though, and it is simple to execute. If it works, one can simply dump or copy the db and use the crashing statement for a bug report. It also may provide developers with a good indication on where to start looking for the root cause. I found that in general many crashes can be reduced to a single statement + corresponding DDL & data setup. If the crash is however caused in production as the result of multiple threads interacting, or is caused by multiple successive queries, then indeed finding the root cause is likely not going to be straightforward. Besides a developer studying the coredump in those cases, it may also pay off to see if any application logs are available.

@ Mark – Good point (and thanks to Vadim for the link). One idea, at least if one is after the Last Executed Statement, is to also check the error log. Often times it will contain at least part of the crashing query.

Peter Zaitsev

Admin

11 years ago

Roel,

Another question by the way… when MySQL crashes it often will print query into error log even when core file is not enabled. Are there any cases when that information would not be available while core file will contain the query ?

Mark Callaghan

11 years ago

Vadim – thanks for pointing that out to me.

Roel Van de Paar

11 years ago

@ Peter – I believe so. You may see things like “Query (0x7f90ab293dc5): is an invalid pointer” or “Query (0): is an invalid pointer” in the error log when MySQL was unable to figure out which query caused the crash. However, it would be fair enough to assume that indiviudal thread variables recorded in the coredump still contain the queries which were being executed. I usually tend to check one or the other, depending on whatever comes to mind first. I have also found that in some cases MySQL and gdb disagree on what caused the crash (different main backtrace).

sbester

11 years ago

Peter, we still have cases where the query is not printed in error log. I haven’t investigated why (always works on my machines),. And this is after the fix for:

http://bugs.mysql.com/bug.php?id=51817
(incorrect assumption: thd->query at 0x2ab2a8360360 is an invalid pointer)

Roel Van de Paar

Author

11 years ago

For a more advanced version, see http://mysqlbugs.blogspot.com.au/2012/09/how-to-obtain-all-executing-queries.html

Roel Van de Paar

Author

10 years ago

For the latest on this, see http://www.mysqlperformanceblog.com/2013/11/11/how-to-extract-all-running-queries-including-the-last-executed-statement-from-a-core-file/

MySQL 5.7
End of Life

Compare Percona to Leading Database Solutions

Software
Downloads

Product
Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

How to obtain the “LES” (Last Executed Statement) from an Optimized Core Dump?

Related

Related Blog Articles

RECOMMENDED ARTICLES

Valkey/Redis: Not-So-Good Practices

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

Valkey/Redis: Configuration Best Practices

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7 End of Life

Compare Percona to Leading Database Solutions

Software Downloads

Product Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

How to obtain the “LES” (Last Executed Statement) from an Optimized Core Dump?

Related

Share This Post!

Want to get weekly updates listing the latest blog posts?

Related Blog Articles

RECOMMENDED ARTICLES

Valkey/Redis: Not-So-Good Practices

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

Valkey/Redis: Configuration Best Practices

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7
End of Life

Software
Downloads

Product
Documentation