If you only learn one thing about database transactions, it should be thisSun, Aug 27, 2006 in Databases
I’ve been writing a lot of articles about locks, deadlocks, and transactions recently, and it occurs to me I’ve neglected to mention the single most important thing to know. If you only learn one thing about transactions in database systems, you should learn this, and learn it thoroughly – burn it into your brain permanently, if possible.
The single most important thing you can do is keep your transactions as small as possible.
That simple practice will improve performance, increase concurrency, reduce deadlocks, and generally create world peace. Plus it’ll make you think hard about your queries, which will probably make them higher-quality and better to maintain.
Now, how can you do it? Ah, that’s the trick. I offer you six simple suggestions.
1. Have the right attitude
The way to think about transactions is as an urgent mission. The instant you say
START TRANSACTION, the race is on.
It’s like when you’re waiting in the car for the pouring rain to stop, before you make a mad dash for the house. You prepare yourself, unlock the door, unbuckle the seat belt… take a deep breath… START TRANSACTION go go go go COMMIT!
2. Do as few statements as possible
Do only the statements you need to inside the transaction. Don’t make updates to
big_huge_table and inserts to
giant_table, then poke around inside other tables looking at little things of no consequence before finishing your work. Remember, a transaction is a set of statements that must all succeed together as a unit, or must all fail together as a unit. Include in your transaction only the statements that belong to that unit of work.
If you can do it all in a single query, you don’t even need a transaction. By definition, a single query is a one-statement transaction.
3. Prepare as much ahead of time as possible
To help include as few statements as possible in your transaction, look at the order of the queries. Can some of them be moved earlier in the sequence, before the
START TRANSACTION statement? If so, good. Do as much preparation as possible before you start the work. Especially think about whether you can check to make sure there’s even work to be done, or whether you’re likely to be able to finish the work – if you can find that out ahead of time, you might be able to avoid even doing anything.
4. Touch the smallest amount of data possible
If possible, avoid changing data that doesn’t need to be changed. For example, if you’re updating a summary table and you know what was just changed in the table from which the summary is calculated, you may not need to update the entire summary – maybe just part of it. Use indexes wisely to constrain your work to just part of a table instead of doing the entire table. Use every bit of information at your disposal to avoid working with more data than you need to.
5. Don’t wait around before committing
The goal is to lock as few resources as possible, for the shortest time possible. To that end, look at whether you can re-order the statements within your transaction. Is it possible to make the big changes later in the transaction? Can you delay getting locks on the really important table, which everyone else is accessing at the same time, until near the end? If so, you might cut down the number of locks and the duration they’re held. And definitely commit the transaction as soon as you’re done.
6. Don’t sacrifice consistency
Transactions have a purpose, and you should not be so afraid of holding locks that you commit when only half the work is done. Use transactions deliberately and carefully to group a set of work together into a logical unit. By all means examine whether the unit is bigger than it needs to be, but don’t shoot yourself in the foot by committing before the work is all done, out of fear of deadlocks.
Keeping transactions as small as possible is the most important thing to do, but it may not be obvious, especially if you’re less experienced with databases. A few easy practices, combined with simple awareness, can go a very long way. But remember – don’t defeat the purpose by cheating yourself out of the very benefits transactions give you.
I'm Baron Schwartz, the founder and CEO of VividCortex. I am the author of High Performance MySQL and lots of open-source software for performance analysis, monitoring, and system administration. I contribute to various database communities such as Oracle, PostgreSQL, Redis and MongoDB. More about me.