I just finished reading this excellent interview with Bjarne Stroutrup. The guy just blows my mind with how much he understands and takes in, and it just made me start thinking about some thing. He uses an amazing analogy in the interview where he points out that the amount of energy required to move a boulder across a town is far greater than to move a pebble across that town, but in the end everyone cares where the boulder is and only if the pebble ends up in someone's shoe does he whine about it.
It made me realize that for the majority of my life I have viewed "genius" and "skill" as getting something done quickly or in some clever manner, but in reality, true genius is in the slow and steady work that moves that boulder. Even in life, very little of any consequence is done in a minute, an hour, or even a day, but the things that truly matter can take a lifetime. It was just an interesting thought that helped me remember that the old adage, "slow and steady wins the race," probably has more truth than even an initial inspection lets on.
...or maybe I'm just getting old.
The place that documents all of the crazy ideas I have and all of the wacky projects that come from them.
Friday, August 14, 2009
Wednesday, July 15, 2009
Queries for Nested Set Model
So this is just a follow up post to my Nested Set Model post to document the scheme/queries that I'm using to define my tree. I created a node with id=0 called ROOT (since it's basically just a place holder that I use to do the dirty work and won't ever show the user) and create a UNIQUE constraint on (lft, rgt) to satisfy my obsession with data integrity (and I imagine that it should help speed up the queries as well).
So here's the queries I use to grab the information in useful manners.
Grab the Whole Tree:
So here's the queries I use to grab the information in useful manners.
Grab the Whole Tree:
SELECT *Grabs the Ancestors of a Node:
FROM bills
ORDER BY lft;
SELECT parent.*Grabs the Descendents of a Node:
FROM bills AS node, bills AS parent
WHERE node.lft BETWEEN parent.lft AND parent.rgt
AND node.id = ? AND parent.id <> ? (NOTE: Both ?'s are the same value)
ORDER BY node.lft;
SELECT node.*Grab the Parent of a Node:
FROM bills AS node, bills AS parent
WHERE node.lft > parent.lft AND node.rgt < id =" ?">
SELECT parent.*Obviously, the '?' is the ID of the node that you're interested in and you can cater the selected columns to your needs.
FROM bills AS node, bills AS parent
WHERE node.lft BETWEEN parent.lft AND parent.rgt
AND node.id = ?
ORDER BY parent.lft DESC LIMIT 1,1
Tuesday, July 14, 2009
So maybe databases aren't half bad afterall
So I've been working on a budgeting program for the last few years (none of the available ones had the features that I needed/wanted) and when I first started I put all of the data in XML. It was easy/quick to get it up and running and handled the hierarchical nature of the data very well, but as is the case with XML is was very space heavy and took a while to save, load, and query the information (I later changed the program to just load the XML to mirrored data structures in memory to speed things up, but querying transactions by date or bill assignment still meant that I had to touch every single transaction in memory).
So I eventually made the switch to SQLite and since the hierarchy was fixed and I didn't have any trees in my data structures, the weaknesses of hierarchical data in databases weren't a concern. Currently there's only 2 levels in my bills structure (categories and then bills), but a few weeks ago I decided that I wanted to get rid of categories and change the way I did bills to allow infinite depth/sub-bills to my bill organization.
This then had me wondering if I should go back to a more hierarchical friendly data format. After the compactness and speed of SQLite, I didn't want to go back to XML and a little searching turned up Protocol Buffers. They are actually really cool and I was amazed by them that I was about to redo my whole program to make use of them (I'm a sucker for strict typing), but then I started thinking about the whole querying thing again. Right now, I can query in so many ways in a very easy and efficient manner that it's almost impossible to match with my own data structures. I thought about reorganizing the data so that querying was more efficient, but then I was just recreating a database by hand in my code and I just couldn't justify all of the programming time that that would require just to get some potential speed improvements (and then all of the data would have to be resident in memory all the time (or I'd have to write my own read/write/cache system) and the cons definitely started outweighing the potential pros).
So I was then left with getting the hierarchical data working in SQLite. The standard parent-child tree structure was the simple approach that popped into my head first and apparently some new SQL to handle such a thing in some DBs (but not SQLite). I figured that someone smarter than myself must have already solved this problem, and some digging revealed that I was right about that. I stumbled on the nested set model, and it's one of those beautifully elegant solutions that I love so much about programming (but that's another post). It's pretty cool, because you actually encode all of the links in a separate table (or you can do the same table if you like) and you do it by doing a pre-order tree traversal and numbering as you go. So insert, moving, and deleting nodes is just a matter of updating the numbering in linkage. The one disadvantage is that when you add/delete/change a node in the tree, you are going to have to touch half the entries in the table on average, but in my program adding/moving bills is infrequent and there are FAR fewer bills than transactions, so paying that small hit in performance on the bills is more than worth it for the big gains with transactions.
So I eventually made the switch to SQLite and since the hierarchy was fixed and I didn't have any trees in my data structures, the weaknesses of hierarchical data in databases weren't a concern. Currently there's only 2 levels in my bills structure (categories and then bills), but a few weeks ago I decided that I wanted to get rid of categories and change the way I did bills to allow infinite depth/sub-bills to my bill organization.
This then had me wondering if I should go back to a more hierarchical friendly data format. After the compactness and speed of SQLite, I didn't want to go back to XML and a little searching turned up Protocol Buffers. They are actually really cool and I was amazed by them that I was about to redo my whole program to make use of them (I'm a sucker for strict typing), but then I started thinking about the whole querying thing again. Right now, I can query in so many ways in a very easy and efficient manner that it's almost impossible to match with my own data structures. I thought about reorganizing the data so that querying was more efficient, but then I was just recreating a database by hand in my code and I just couldn't justify all of the programming time that that would require just to get some potential speed improvements (and then all of the data would have to be resident in memory all the time (or I'd have to write my own read/write/cache system) and the cons definitely started outweighing the potential pros).
So I was then left with getting the hierarchical data working in SQLite. The standard parent-child tree structure was the simple approach that popped into my head first and apparently some new SQL to handle such a thing in some DBs (but not SQLite). I figured that someone smarter than myself must have already solved this problem, and some digging revealed that I was right about that. I stumbled on the nested set model, and it's one of those beautifully elegant solutions that I love so much about programming (but that's another post). It's pretty cool, because you actually encode all of the links in a separate table (or you can do the same table if you like) and you do it by doing a pre-order tree traversal and numbering as you go. So insert, moving, and deleting nodes is just a matter of updating the numbering in linkage. The one disadvantage is that when you add/delete/change a node in the tree, you are going to have to touch half the entries in the table on average, but in my program adding/moving bills is infrequent and there are FAR fewer bills than transactions, so paying that small hit in performance on the bills is more than worth it for the big gains with transactions.
Saturday, April 25, 2009
Disabling Trackpad while typing in Ubuntu 9.04 (Jaunty)
So I just upgraded my laptop to the official release of Ubuntu 9.04 and I was annoyed that the trackpad wasn't disabled automatically when typing (why this isn't just done in the hardware I don't know), but after piecing together some posts from these instructions, I was able to get it working. Here's what I did:
1) Enabled SHMConfig by following these instructions, which is to basically create the file /etc/hal/fdi/policy/shmconfig.fdi and add this to it:
2) Add the following command "syndaemon -i 1 -d -t -S" to System->Preferences->Startup Applications
3) Restart
It worked like a charm and now I don't have to worry about the cursor jumping around like crazy when I'm typing.
1) Enabled SHMConfig by following these instructions, which is to basically create the file /etc/hal/fdi/policy/shmconfig.fdi and add this to it:
<deviceinfo version="0.2">
<device>
<match key="input.x11_driver" string="synaptics">
<merge key="input.x11_options.SHMConfig" type="string">True</merge>
</match>
</device>
</deviceinfo>
2) Add the following command "syndaemon -i 1 -d -t -S" to System->Preferences->Startup Applications
3) Restart
It worked like a charm and now I don't have to worry about the cursor jumping around like crazy when I'm typing.
Subscribe to:
Posts (Atom)