Measuring max MySQL replication throughput on PXC with wsrep

Checking throughput with async MySQL replication
Replication throughput is the measure of just how fast the slaves can apply replication (at least by my definition). In MySQL async replication this is important to know because the single-threaded apply nature of async replication can be a write performance bottleneck. In a production system, we can tell how fast the slave is currently running (applying writes), and we might have historical data to check for the most throughput ever seen, but that doesn’t give us a solid way of determining where we stand right NOW().
An old consulting trick to answer this question is to simply stop replicating on your slave for a minute, (usually just the SQL_THREAD), restart it and watch how long it takes to catch up. We can also watch the slave thread apply rate during this interval to get a sense of just how many writes per second we can do and compare that with the normal rate (during peak hours, for example). This can be a handy way of quickly assessing how close you are to our maximum theoretical throughput.
But what about with PXC and Galera? This is easy on async because the master doesn’t care, but to be able to do this on PXC we need a way to intentionally lag a node without hanging or causing flow control on the rest of the cluster. And as it turns out, as of version 5.5.33, there’s a pretty easy way.

Measuring an average apply rate on PXC

First we need to pick a node that is not taking reads or writes (or shift some traffic away from one that is). We’re assuming reads and writes are happening on the rest of the cluster normally, and probably also that the node we chose has pretty similar hardware to every other node. Once we have this, we can use myq_status to see replication coming into the node and being applied:

mycluster / ip-10-142-147-72 / Galera 2.7(r157)
Wsrep    Cluster  Node     Queue   Ops     Bytes     Flow    Conflct PApply        Commit
    time P cnf  #  cmt sta  Up  Dn  Up  Dn   Up   Dn pau snt lcf bfa dst oooe oool wind
20:18:04 P   3  3 Sync T/T   0   0   0  5k    0 2.7M 0.0   0   0   0 703   82    4    3
20:18:05 P   3  3 Sync T/T   0  15   0  6k    0 3.1M 0.1   5   0   0 804   78    4    3
20:18:06 P   3  3 Sync T/T   0   0   0  6k    0 3.2M 0.1   5   0   0 701   80    4    3
20:18:07 P   3  3 Sync T/T   0  10   0  6k    0 3.1M 0.0   5   0   0 820   81    5    3
20:18:08 P   3  3 Sync T/T   0   0   0  6k    0 3.1M 0.1   3   0   0  1k   77    4    3
20:18:09 P   3  3 Sync T/T   0  57   0  3k    0 1.3M 0.6   2   0   0 758   64    2    2

mycluster / ip-10-142-147-72 / Galera 2.7(r157)

Wsrep Cluster Node Queue Ops Bytes Flow Conflct PApply Commit

time P cnf # cmt sta Up Dn Up Dn Up Dn pau snt lcf bfa dst oooe oool wind

20:18:04 P 3 3 Sync T/T 0 0 0 5k 0 2.7M 0.0 0 0 0 703 82 4 3

20:18:05 P 3 3 Sync T/T 0 15 0 6k 0 3.1M 0.1 5 0 0 804 78 4 3

20:18:06 P 3 3 Sync T/T 0 0 0 6k 0 3.2M 0.1 5 0 0 701 80 4 3

20:18:07 P 3 3 Sync T/T 0 10 0 6k 0 3.1M 0.0 5 0 0 820 81 5 3

20:18:08 P 3 3 Sync T/T 0 0 0 6k 0 3.1M 0.1 3 0 0 1k 77 4 3

20:18:09 P 3 3 Sync T/T 0 57 0 3k 0 1.3M 0.6 2 0 0 758 64 2 2

If we check the rate of growth of wsrep_last_committed over a full minute we can see:

ip-10-142-147-72 mysql> show global status like 'wsrep_last_committed'; select sleep(60); show global status like 'wsrep_last_committed';
+----------------------+---------+
| Variable_name        | Value   |
+----------------------+---------+
| wsrep_last_committed | 3136415 |
+----------------------+---------+
1 row in set (0.01 sec)

+-----------+
| sleep(60) |
+-----------+
|         0 |
+-----------+
1 row in set (59.99 sec)

+----------------------+---------+
| Variable_name        | Value   |
+----------------------+---------+
| wsrep_last_committed | 3443992 |
+----------------------+---------+
1 row in set (0.00 sec)

ip-10-142-147-72 mysql> show global status like 'wsrep_last_committed'; select sleep(60); show global status like 'wsrep_last_committed';

+----------------------+---------+

| Variable_name | Value |

+----------------------+---------+

| wsrep_last_committed | 3136415 |

+----------------------+---------+

1 row in set (0.01 sec)

+-----------+

| sleep(60) |

+-----------+

| 0 |

+-----------+

1 row in set (59.99 sec)

+----------------------+---------+

| Variable_name | Value |

+----------------------+---------+

| wsrep_last_committed | 3443992 |

+----------------------+---------+

1 row in set (0.00 sec)

So we’re averaging 5.1k TPS applying on this node (and across the whole cluster). But how much can we handle at peak?

Measuring Max Replication throughput on PXC

In another window on that same node, we execute this SQL (all at once):

mysql> set global wsrep_desync=ON; flush tables with read lock; show global status like 'wsrep_last_committed'; select sleep( 60 ); unlock tables;
+----------------------+--------+
| Variable_name        | Value  |
+----------------------+--------+
| wsrep_last_committed | 665368 |
+----------------------+--------+
1 row in set (0.00 sec)

mysql> set global wsrep_desync=ON; flush tables with read lock; show global status like 'wsrep_last_committed'; select sleep( 60 ); unlock tables;

+----------------------+--------+

| Variable_name | Value |

+----------------------+--------+

| wsrep_last_committed | 665368 |

+----------------------+--------+

1 row in set (0.00 sec)

We’ve desynced the node, locked writes to all tables, and checked the last seqno we committed on this node (665368). The wsrep_desync state tells this node to enter the Donor/Desynced state, which means it will not send Flow Control to the rest of the cluster if its queue gets backlogged.

Then we proceed to take a read lock on all tables, pausing the Galera applier. Then sleep 60 seconds, unlock the tables and wait to see how long it recovers.

Once the initial FTWRL happens, we can immediately see the node drop to the Donor/Desynced state and watch replication start to queue up on the node:

20:18:10 P   3  3 Sync T/T   0  55   0  4k    0 2.0M 0.4   3   0   0 768   79    4    4
20:18:11 P   3  3 Dono T/T   0  6k   0 577    0 306K 0.0   3   0   0 791   79    0    4
20:18:12 P   3  3 Dono T/T   0 12k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:13 P   3  3 Dono T/T   0 18k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:14 P   3  3 Dono T/T   0 24k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:15 P   3  3 Dono T/T   0 31k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:16 P   3  3 Dono T/T   0 37k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:17 P   3  3 Dono T/T   0 43k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:18:18 P   3  3 Dono T/T   0 49k   0   0    0    0 0.0   0   0   0 791    0    0    0

20:18:10 P 3 3 Sync T/T 0 55 0 4k 0 2.0M 0.4 3 0 0 768 79 4 4

20:18:11 P 3 3 Dono T/T 0 6k 0 577 0 306K 0.0 3 0 0 791 79 0 4

20:18:12 P 3 3 Dono T/T 0 12k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:13 P 3 3 Dono T/T 0 18k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:14 P 3 3 Dono T/T 0 24k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:15 P 3 3 Dono T/T 0 31k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:16 P 3 3 Dono T/T 0 37k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:17 P 3 3 Dono T/T 0 43k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:18:18 P 3 3 Dono T/T 0 49k 0 0 0 0 0.0 0 0 0 791 0 0 0

The rest of our cluster is operating normally here.

A minute later the queue is backlogged to almost 350k transactions. Then the lock is released, and Galera starts to apply that queue as quickly as possible:

mycluster / ip-10-142-147-72 / Galera 2.7(r157)
Wsrep    Cluster  Node     Queue   Ops     Bytes     Flow    Conflct PApply        Commit
    time P cnf  #  cmt sta  Up  Dn  Up  Dn   Up   Dn pau snt lcf bfa dst oooe oool wind
20:19:06 P   3  3 Dono T/T   0 326k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:19:07 P   3  3 Dono T/T   0 332k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:19:09 P   3  3 Dono T/T   0 339k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:19:10 P   3  3 Dono T/T   0 345k   0   0    0    0 0.0   0   0   0 791    0    0    0
20:19:11 P   3  3 Dono T/T   0 342k   0  9k    0 4.6M 0.0   0   0   0  5k  100    1   14
20:19:12 P   3  3 Dono T/T   0 336k   0 13k    0 6.7M 0.0   0   0   0 12k   96    2   10
20:19:13 P   3  3 Dono T/T   0 329k   0 12k    0 6.4M 0.0   0   0   0 15k   96    2   10
20:19:14 P   3  3 Dono T/T   0 322k   0 13k    0 6.8M 0.0   0   0   0 15k   92    2    6
20:19:15 P   3  3 Dono T/T   0 313k   0 13k    0 6.9M 0.3   0   0   0 15k   94    1    8
20:19:16 P   3  3 Dono T/T   0 304k   0 13k    0 6.7M 0.4   0   0   0 15k   91    1    7
20:19:17 P   3  3 Dono T/T   0 298k   0 13k    0 6.7M 0.0   0   0   0 15k   94    1    9

mycluster / ip-10-142-147-72 / Galera 2.7(r157)

Wsrep Cluster Node Queue Ops Bytes Flow Conflct PApply Commit

time P cnf # cmt sta Up Dn Up Dn Up Dn pau snt lcf bfa dst oooe oool wind

20:19:06 P 3 3 Dono T/T 0 326k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:19:07 P 3 3 Dono T/T 0 332k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:19:09 P 3 3 Dono T/T 0 339k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:19:10 P 3 3 Dono T/T 0 345k 0 0 0 0 0.0 0 0 0 791 0 0 0

20:19:11 P 3 3 Dono T/T 0 342k 0 9k 0 4.6M 0.0 0 0 0 5k 100 1 14

20:19:12 P 3 3 Dono T/T 0 336k 0 13k 0 6.7M 0.0 0 0 0 12k 96 2 10

20:19:13 P 3 3 Dono T/T 0 329k 0 12k 0 6.4M 0.0 0 0 0 15k 96 2 10

20:19:14 P 3 3 Dono T/T 0 322k 0 13k 0 6.8M 0.0 0 0 0 15k 92 2 6

20:19:15 P 3 3 Dono T/T 0 313k 0 13k 0 6.9M 0.3 0 0 0 15k 94 1 8

20:19:16 P 3 3 Dono T/T 0 304k 0 13k 0 6.7M 0.4 0 0 0 15k 91 1 7

20:19:17 P 3 3 Dono T/T 0 298k 0 13k 0 6.7M 0.0 0 0 0 15k 94 1 9

We can see right away that our ‘Ops Dn’ is much higher: peaking at 13k, but how can we get a good average? Let’s watch it catch all the way up:

mycluster / ip-10-142-147-72 / Galera 2.7(r157)
Wsrep    Cluster  Node     Queue   Ops     Bytes     Flow    Conflct PApply        Commit
    time P cnf  #  cmt sta  Up  Dn  Up  Dn   Up   Dn pau snt lcf bfa dst oooe oool wind
20:19:54 P   3  3 Dono T/T   0 70k   0 13k    0 6.7M 0.0   0   0   0 15k   95    2    9
20:19:55 P   3  3 Dono T/T   0 63k   0 13k    0 6.8M 0.0   0   0   0 15k   94    2    8
20:19:56 P   3  3 Dono T/T   0 56k   0 13k    0 6.7M 0.0   0   0   0 15k   92    2    7
20:19:57 P   3  3 Dono T/T   0 46k   0 14k    0 7.1M 0.0   0   0   0 15k   97    2   11
20:19:59 P   3  3 Dono T/T   0 39k   0 13k    0 6.9M 0.0   0   0   0 15k   93    2    8
20:20:00 P   3  3 Dono T/T   0 31k   0 13k    0 6.9M 0.1   0   0   0 15k   95    2   10
20:20:01 P   3  3 Dono T/T   0 25k   0 13k    0 6.8M 0.0   0   0   0 15k   92    2    7
20:20:02 P   3  3 Dono T/T   0 18k   0 12k    0 6.3M 0.0   0   0   0 15k   95    2    8
20:20:03 P   3  3 Dono T/T   0  5k   0 14k    0 7.3M 0.0   0   0   0 15k   90    2    5
20:20:04 P   3  3 Dono T/T   0  23   0 11k    0 5.8M 0.0   0   0   0 887   95    3    9
20:20:05 P   3  3 Dono T/T   0   1   0  7k    0 3.4M 0.0   0   0   0 883   67    4    2
20:20:06 P   3  3 Dono T/T   0   0   0  7k    0 3.4M 0.0   0   0   0 920   68    3    2
20:20:07 P   3  3 Dono T/T   0   0   0  5k    0 2.7M 0.0   0   0   0 852   73    4    3

mycluster / ip-10-142-147-72 / Galera 2.7(r157)

Wsrep Cluster Node Queue Ops Bytes Flow Conflct PApply Commit

time P cnf # cmt sta Up Dn Up Dn Up Dn pau snt lcf bfa dst oooe oool wind

20:19:54 P 3 3 Dono T/T 0 70k 0 13k 0 6.7M 0.0 0 0 0 15k 95 2 9

20:19:55 P 3 3 Dono T/T 0 63k 0 13k 0 6.8M 0.0 0 0 0 15k 94 2 8

20:19:56 P 3 3 Dono T/T 0 56k 0 13k 0 6.7M 0.0 0 0 0 15k 92 2 7

20:19:57 P 3 3 Dono T/T 0 46k 0 14k 0 7.1M 0.0 0 0 0 15k 97 2 11

20:19:59 P 3 3 Dono T/T 0 39k 0 13k 0 6.9M 0.0 0 0 0 15k 93 2 8

20:20:00 P 3 3 Dono T/T 0 31k 0 13k 0 6.9M 0.1 0 0 0 15k 95 2 10

20:20:01 P 3 3 Dono T/T 0 25k 0 13k 0 6.8M 0.0 0 0 0 15k 92 2 7

20:20:02 P 3 3 Dono T/T 0 18k 0 12k 0 6.3M 0.0 0 0 0 15k 95 2 8

20:20:03 P 3 3 Dono T/T 0 5k 0 14k 0 7.3M 0.0 0 0 0 15k 90 2 5

20:20:04 P 3 3 Dono T/T 0 23 0 11k 0 5.8M 0.0 0 0 0 887 95 3 9

20:20:05 P 3 3 Dono T/T 0 1 0 7k 0 3.4M 0.0 0 0 0 883 67 4 2

20:20:06 P 3 3 Dono T/T 0 0 0 7k 0 3.4M 0.0 0 0 0 920 68 3 2

20:20:07 P 3 3 Dono T/T 0 0 0 5k 0 2.7M 0.0 0 0 0 852 73 4 3

So, it took this node 50 seconds to catch up again. Right at 20:20:05 when the queue zeroed out, I checked wsrep_last_committed again:

ip-10-142-147-72 mysql> show global status like 'wsrep_last_committed';
+----------------------+---------+
| Variable_name        | Value   |
+----------------------+---------+
| wsrep_last_committed | 1332551 |
+----------------------+---------+
1 row in set (0.00 sec)

ip-10-142-147-72 mysql> show global status like 'wsrep_last_committed';

+----------------------+---------+

| Variable_name | Value |

+----------------------+---------+

| wsrep_last_committed | 1332551 |

+----------------------+---------+

1 row in set (0.00 sec)

Be sure to turn off wsrep_desync when we are done and caught up! Note you can turn off wsrep_desync right away, but that puts the node into the JOINED state which does limited flow control to help the node catch up. We want our sample to be unbiased by flow control (at least from this node).

ip-10-142-147-72 mysql> set global wsrep_desync=OFF;
Query OK, 0 rows affected, 1 warning (0.00 sec)

1 2	ip-10-142-147-72 mysql> set global wsrep_desync=OFF; Query OK, 0 rows affected, 1 warning (0.00 sec)

So the node drops back into the ‘Synced’ state and FC applies again:

mycluster / ip-10-142-147-72 / Galera 2.7(r157)
Wsrep    Cluster  Node     Queue   Ops     Bytes     Flow    Conflct PApply        Commit
    time P cnf  #  cmt sta  Up  Dn  Up  Dn   Up   Dn pau snt lcf bfa dst oooe oool wind
20:20:10 P   3  3 Dono T/T   0   0   0  6k    0 3.2M 0.1   0   0   0 901   76    4    3
20:20:11 P   3  3 Dono T/T   0   1   0  6k    0 3.3M 0.0   0   0   0 877   82    4    4
20:20:12 P   3  3 Dono T/T   0  24   0  6k    0 3.3M 0.0   0   0   0 877   88    3    7
20:20:13 P   3  3 Dono T/T   0   0   0  7k    0 3.5M 0.0   0   0   0 886   81    4    5
20:20:14 P   3  3 Sync T/T   0   0   0  6k    0 3.2M 0.0   2   0   0 879   69    3    3
20:20:15 P   3  3 Sync T/T   0   0   0  7k    0 3.4M 0.0   1   0   0 873   79    4    3
20:20:17 P   3  3 Sync T/T   0   0   0  6k    0 3.3M 0.1   7   0   0 971   80    4    3

mycluster / ip-10-142-147-72 / Galera 2.7(r157)

Wsrep Cluster Node Queue Ops Bytes Flow Conflct PApply Commit

time P cnf # cmt sta Up Dn Up Dn Up Dn pau snt lcf bfa dst oooe oool wind

20:20:10 P 3 3 Dono T/T 0 0 0 6k 0 3.2M 0.1 0 0 0 901 76 4 3

20:20:11 P 3 3 Dono T/T 0 1 0 6k 0 3.3M 0.0 0 0 0 877 82 4 4

20:20:12 P 3 3 Dono T/T 0 24 0 6k 0 3.3M 0.0 0 0 0 877 88 3 7

20:20:13 P 3 3 Dono T/T 0 0 0 7k 0 3.5M 0.0 0 0 0 886 81 4 5

20:20:14 P 3 3 Sync T/T 0 0 0 6k 0 3.2M 0.0 2 0 0 879 69 3 3

20:20:15 P 3 3 Sync T/T 0 0 0 7k 0 3.4M 0.0 1 0 0 873 79 4 3

20:20:17 P 3 3 Sync T/T 0 0 0 6k 0 3.3M 0.1 7 0 0 971 80 4 3

Conclusion

So in 50 seconds, the node was able to apply 667183 transactions (difference between the two wsrep_last_seqno) which comes out to 13.3k tps apply capacity (at least sustained for 1 minute). This tells us we’re around 38% capacity for write throughput. Is that a perfect number? Maybe not, but it at least gives you a rough idea.

However, the point is that thanks to wsrep_desync we can measure this safely within a synchronous replication environment that may normally not allow this type of operation.

1 Comment

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Alex

10 years ago

Jay, another amazing insight into Galera internals and application of wsrep_desync, which, I bet, none of us would have thought of! (y)

MySQL 5.7
End of Life

Compare Percona to Leading Database Solutions

Software
Downloads

Product
Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Measuring Max Replication Throughput on Percona XtraDB Cluster with wsrep_desync

Measuring an average apply rate on PXC

Measuring Max Replication throughput on PXC

Conclusion

Related

Related Blog Articles

RECOMMENDED ARTICLES

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

Seamless Table Modifications: Leveraging pt-online-schema-change for Online Alterations

Securing Your MySQL Database: Essential Best Practices

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7 End of Life

Compare Percona to Leading Database Solutions

Software Downloads

Product Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Measuring Max Replication Throughput on Percona XtraDB Cluster with wsrep_desync

Measuring an average apply rate on PXC

Measuring Max Replication throughput on PXC

Conclusion

Related

Share This Post!

Want to get weekly updates listing the latest blog posts?

Related Blog Articles

RECOMMENDED ARTICLES

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

Seamless Table Modifications: Leveraging pt-online-schema-change for Online Alterations

Securing Your MySQL Database: Essential Best Practices

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7
End of Life

Software
Downloads

Product
Documentation