Getting the jump

One thing that has been interesting about early to bed, early to rise is the psychological effect of having more time between when I get up and when my day starts. Nobody else schedules meetings at 6am, and even though I could start working as soon as I wake up, I normally start an hour or two after that.

But when you get up late, you sometimes don't get any time before things happen. The day has started without you, you've woken up 5 minutes before your meeting, something important is happening and you need to deal with it right now, quick, get going! These days are the worst, because you start behind and stay behind, and spend the rest of the day struggling to get back to zero.

Whereas when you start ahead, and you know you've got the jump on the day, you can relax a bit. There's plenty of time before the day starts, so what should that day contain? What would be a good use of that time? Can you prepare for anything you know is going to happen? Are there any problems you think will come up and is there anything you can do about them now?

Of course, this isn't so much about waking up as it is about preparation. Any time you go into a situation without having time to think about that situation, you're at a disadvantage. But what's interesting is that a day is a situation like any other, and you can't really prepare for it unless you have time to think before it starts.

Content-addressable hardware

I've been doing a bit more hardware-related tomfoolery, and one thing that can be hard to get your head around is the sheer quantity and variety of nearly-identical hardware. This amazing documentary on Shenzhen goes into a lot of the mindset behind it, but basically everyone copies everything else, and that goes all the way from individual components up to entire products.

It can be very hard to keep everything straight, especially when a lot of things don't have model numbers, or use different model numbers for the same thing. Like in these results for breadboard power supplies, you end up with names like "red wings", "ywrobot v2", and "that white one with three capacitors". And the problem is, even if you have a sensible model number, everyone else will copy the whole thing anyway.

I think a nice way to approach this would be a pattern that turns up a lot in data management: content addressing. Instead of giving your hardware a name, the name is implied by what it describes. So if your hardware is a couple of capacitors and transistors rigged up in a certain configuration, its name would be determined by the transistors and capacitors used and the arrangement that they're in. Some components are already named by their properties so it seems like a logical continuation.

Of course, a single product could contain hundreds of components, so these names could be ridiculously long. But there's a nice answer to that too: content-addressed names can refer to other names. So a particular transistor has a consistent part number, and a configuration of those transistors (into, say, an op amp), can have a long-form name based on what it contains, but also a short-form one based on what it does. That way, you could name a breadboard power supply either as the parts it contains, or as its characteristics (input/output voltages, regulator type etc).

The goal is that, one way or another, you can get away from the idea of manufacturer-specific model numbers and just think in terms of descriptions of the parts themselves. I think that would be more in line with the open hardware mindset and hopefully lead to faster iteration and less confusion as a result.

Prototype wrapup #37

Last time I was doing some web experiments and some stuff in a new language. This set of prototypes was a bit more practical, but still included some new language tomfoolery.

Tuesday

I've had an Arduino ECG shield kicking around for ages that I've been meaning to try out, so I figured I'd give it a try. The example sketch worked great, and it just dumps data via serial, so I figured I'd do some Rust to read the serial data and print it. I spent a while trying to figure out the right way to do packet parsing in Rust, before eventually giving up and just printing the raw data. The data looked cool, though Rust's immature library ecosystem makes it probably the wrong choice of tool for this sort of problem when compared with, say, Python.

Thursday

I wanted to try out some Elm again, after my previous attempts turned out fairly unimpressively. Elm has since ditched its effect model for a new system so I figured that would be fun. I actually felt like I was fighting things less this time, so it's nice to see the language mature. This was just a simple toy to generate random numbers (impure!) and send them over websockets (even more impure!). Worked well though, and had a much better experience than before. Elm still seems to really like starting you off with a simple mode with less options, which is just confusing because you outgrow it so quickly and then have to rewrite everything.

Saturday

I recently got some milight smart bulbs and was hoping to control them using one of the many embedded computers I have lying around. Unfortunately, the protocol has some scrambling that's just complex enough to be annoying to hobbyists, while basic enough not to add any actual security. Oh, hardware manufacturers, never change. So I figured I'd put on my reverse-engineering hat and have a crack at it. First step was to pull together the various bits of information about the new bulbs and how to communicate with them. This culminated in a patch to the openmili library, which let me dump out the data.

Sunday

Next is the unscrambling. I've never really done a lot of this, but I figured I'd just look for patterns. Each message is 9 bytes. Bytes 1-6 and 8 seem to be random, but if one is the same then they're all the same, while bytes 7 and 9 are only the same when you reset the remote, making 7 likely a counter and 9 likely a checksum. I collected enough data to find a way to predict byte 2 given byte 1. I think that technique should allow for eventually descrambling the whole thing, but I have to figure out how the counter byte changes. For now, this prototype is just the code for predicting that second byte.

Little data

I hear a lot lately about Big Data, the process of collecting lots and lots of information and then figuring out how to find meaning in all of it. This is an approach that has become popular in (especially tech) businesses, in science, and even in personal management. When it comes to data, bigger is better. Why have megabytes when you can have gigabyes, and why gigabytes when you can have terabytes? Just think of all the great science you can do with all that data!

Only, I'm not sure it really is science. I mean, one of the fundamental tenets of science is that you test theories with evidence. That means the theory has to come before the evidence does. If you do the experiment first and then figure out the theory afterwards, you fairly quickly fall into the famed phlogiston school of "oh what a coincidence, my theory also explains this evidence". In other words, you're not using the evidence to test the theory, you're using theory to describe the evidence.

Even when they work, there is something fundamentally missing from data-driven approaches; no matter how sophisticated, prediction isn't understanding. Let's say you have a powerful weather prediction model, trained on the biggest Big Data your big Big Data data warehouse can hold. It can tell you with 99.9% accuracy the weather tomorrow based on a thousand different dimensions of input. But do you actually know how the weather works? Have you learned anything about fluid dynamics? Can you turn your predictions into understanding?

I think the essential conflict is that understanding means less data. A terabyte of random-looking numbers can be replaced with a one-line formula if you know the rule that generated them. An enormously sophisticated and complex model can be replaced with a simple one if you figure out what the underlying mechanics are. The standard model of particle physics can fit on a t-shirt. If your model doesn't, either it's more complex than particle physics or you just don't understand it very well.

Now, you might say particle physics is a bad example. The Large Hadron Collider records terabytes of data for every particle collision, surely that's Big Data? That actually gets to the core of the issue; it's not that analysing lots of data can't be useful, but it's a means, not an end. Nobody at CERN started with "hey let's just smash a bunch of particles together and record it all, maybe we'll find some science in there". Theory came first, and the LHC is the experiment that comes after. If there was an easier experiment, some way that didn't require all that data and all that expense, you think anyone would bother with particle colliders?

The most wonderful skill in all of science is to take a complex question and turn it into a simple question, and then use a simple answer to solve both. Big Data is too often a way of answering complex questions without making them simple. What you get is a complex answer, when what you really wanted was a simple one. Sometimes you might need a lot of data to answer a simple question, but often you only need a little, and I think it'd be good to see more hype for little data.

Automata by Example

Today I'm happy to release Automata by Example, a new project where you can build your own cellular automata by just clicking around on a grid. Each click creates new cellular automata rules that lead to large scale changes across the entire grid.

I think of this project as being vaguely a successor to The Sound of Life and Scrawl, two previous projects concerning cellular automata and iterative line construction respectively. The difference here is that instead of the system being controlled directly, it's controlled by rules that you generate.

The technique, which I think of as rule generation, is something a little like direct manipulation, but with an additional generalisation step. You click the pixel you want, but instead of just setting the pixel directly, the system figures out a rule that would set that pixel and applies that rule globally. In other words, you determine the rule from the action, then use the rule to apply more actions.

It seems to work really well, provided your system is simple enough to turn rules into actions easily. The 3x3 neighbourhood grid (inspired by Conway's Game of Life) turns out to be great for this. Each click captures the 3x3 neighbourhood at the time of the click and uses that to generate a cellular automata rule that toggles the cell under the mouse and all others with the same neighbourhood.

This representation makes construction rules easy, but it's quite hard to be exhaustive. For example, if you want to match all cells with 4 neighbours you have to explicitly create every permutation as a rule. Still, there are a lot of fascinating patterns and interesting automata you can make with very few rules.

Here are a few of my favourites:

As always, you can see it in action on my demoserver or check out the code (and more technical detail) on Github.