Current Reading List – Feb 2012

It has been a good many years since I have posted about my current reading list, so I thought it was about time to fire off another one. These are the books started, completed, being read or on my short-list to start (or in one case re-read) in the month of February.


The 4-Hour Workweek (completed)

I found this book amazingly insightful. Regardless of how much you implement in your own career, it’s a fantastic tome. Those I’ve gifted the book too have all said they really found it useful, interesting and a true paradigm shift in how they view life, career, family and finding a new balance between them that suits you!.

Tim laid out his own struggles in great candor, failures in life time management and how he found a way to over-come all of them. The book is also filled with testimonials from readers of his first edition. If you’re finding that you want more out of your life, struggling with the concept of retirement and wondering what you’ll do when you retire, this book may upset your world, but hopefully in do some will show you some options you might not have considered. Give this a read!


Design of Design (finishing up)
Over the many years in the role of software designer (originally trained in the 80’s, which is my biggest challenge to overcome), there has always been a nagging sense that some part of the process was not working for me. I adjusted, tried other methods, made adaptions, but the old Rational Model (aka Waterfall) of design always seemed to fail me. Now, I understand why! It’s a BAD MODEL. Dr. Frederick Brooks (father of the IBM 360) and now professor at University of North Carolina Chapel Hill, rips open the old concepts in this book of his essays on design.

Covering a variety of other design methodologies, this book is not only a theoretical read, but an empirical one. Many real-world examples of design program successes and failures are laid out, almost in a case study format. This is been a very educational read. Lessons learned from this book have been put into place in current projects, and the results are already starting to be seen. Now I just need to start educating my staff and colleagues on these findings. Recommended.


The Creative Priority
I originally read this book in the last 90’s and found it very useful in understanding the creative process. What drives creatives and how to foster a creative culture. Sadly, over the hears of the Dot-Bomb meat-grinder cultural immersion, these concepts and skills have been lost. So, I’m pulling this one back off the shelf for a re-read. I plan to report on it soon.


Cassandra High Performance Cookbook
This is the latest addition to the list, having just arrived this weekend. I’m currently running a project to investigate the suitability of Cassandra to solve problems in a client’s current relational database solution (see my previous post about Cassandra for background).

This book was recommended by the primary authors and maintainers of Cassandra. I look forward to cracking this open and going head-first into this technology.

Apache Cassandra Project – processing “Big Data”

Being an old-school OSS’er, MySql has been my go-to DB for data storage since the turn of the century. It’s great, I love it (mostly) but it does have it’s drawbacks. Largest of which is it’s now owned by Oracle which does a HORRIBLE JOB of supporting it. I have personal experience with this, as the results of a recent issue with InnoDB and MySQL.

In the mean time, some of the hot-shot up-and-commers in another department have been facing their own data processing challenges (with MySql and other DB’s), and have started to look at some highly scalable alternatives. One of the front-runners right now is Apache’s Cassandra database project.

The synopsis from the page is (as would be most marketing verbiage) very encouraging!

The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra’s support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.

This sounds too good to be true. Finally a solution that we might be able to implement and grow, and one that doe not have the incredibly frustrating drawback of InnoDB and MySql’s fragile replication architecture. I’ve found out exactly how fragile it is, despite have a cluster of high-speed specially designed DB servers, the amount of down time we had was NOT ACCEPTABLE!).

With a charter to handle ever growing amounts of data and the need for ultimate availability and reliability, an alternative to MySQL is almost certainly required.

Of the items discussed on the main page, this one really hits home and stands out to me:

Fault Tolerant

Data is automatically replicated to multiple nodes for fault-tolerance. Replication across multiple data centers is supported. Failed nodes can be replaced with no downtime.

I recently watched a video from the 2011 Cassandra Conference in San Francisco. A lot of good information shared. This video is available on the Cassandra home page. I recommend muscling through the marketing BS as the beginning and take in what they cover.

Job graph for ‘Big Data’ is skyrocketing.

Demand for Cassandra experts is also skyrocketing.

Big data players are using Cassandra.

It’s a known issue that RDBM’s (ex. MySql) have serious limitations (no kidding).

RDBM’s generally have an 8GB cache limit (this is interesting, and would explain some issues we’ve had with scalability in our DB severs, which have 64GB of memory).

The notion that Cassandra does not have good read speed, is a fallacy. Version 0.8 read speed is at parity of the already considered fast 0.6 write speed. Fast!?

No global or even low-level write locks. The column swap architecture alleviates the need for these locks, this allows high-speed writes.

Quorum reads and writes are consistent across the distribution.

New feature of local LOCAL_QUORUM allows quorums to be established from only the local nodes, alleviating latency waiting for a quorum including remote nodes in other geographic locations.

Cassandra uses XML files for schema modifications. In version 0.7 provides new features to allow on-line schema updates.

CLI for Cassandra is now very powerful.

Has a SQL language capability (yes!).

Latest version provides much easier to implement secondary indexing (indexes other than the primary).

Version 0.8 supports bulk loading. This is very interesting for my current project

There is wide support for Cassandra in both interpreted and compiled OSS languages, including the ones I most frequently use.

CQL Cassandra Query Language.

Replication architecture is vastly superior to MySQLs transaction and log replay strategy. Cassandra uses an rsync style replication where hash comparisons are exchanged to find which parts of the data tree a given replication node (that is responsible for that tree of data) might need updating, then then transferring just that data. Not only does this reduce bandwidth, but this implies asynchronous replication! Finally! Now this makes sense to me!!

Hadoop support exists for Cassandra, BUT, it’s not a great fit for Cassandra. Look into Brisk if Hadoop implementation is desired or required.

Division of Real-Time and Analytics nodes.

Nodes can be configured to communicate with each other in an encrypted fashion, but in general inter-node communication across public-private networks should be established using VPN tunnels.

This needs further research, but it’s very, VERY promising!

NEXT: “Cassandra and Big Data – building a single-node ‘cluster’

Updated main website with Feed Dividers

Released enhancement today to display daily dividers in those RSS Feed aggregators that show data more than 1 day old. The most obivous of these are my Blog Updates and the USGS data feed.

This update works best in the USGS feed. The posting to my blogs is infrequent enough that the dividers are just as prevelant as the posts themselves.

Here is a screen shot showing this first implementation worked out. Loving it in the USGS Quakes parsers/agreegators but NOT loving it in my own Blog Roll.

New daily dividers
More changes coming, I’m fairly certain of that.

Philippines Earth Quakes, follows similar activity in Vanuatu and Papua New Guinea

In what looks like sympathetic seismic activity, there has been a burst of quakes in the Philippines, just days after a string of large quakes not all that far south off of Vanuatu.

M 5.2 — Negros – Cebu region, Philippines
M 5.8 — Negros – Cebu region, Philippines
M 6.0 — Negros – Cebu region, Philippines
M 5.6 — Negros – Cebu region, Philippines
M 6.7 — Negros – Cebu region, Philippines

One has to wonder what in the world is going on there. Is this the precursor to more activity world wide? Does it mean anything at all?

USGS PAGER Data

USGS - PAGER map of quake cluster

Other recent strings of activity:

Complete string of Moderate Quakes Worldwide

M 5.2 — Negros – Cebu region, Philippines
M 5.8 — Negros – Cebu region, Philippines
M 6.0 — Negros – Cebu region, Philippines
M 5.6 — Negros – Cebu region, Philippines
M 6.7 — Negros – Cebu region, Philippines
M 5.1 — Fiji region
M 5.2 — Mindanao, Philippines
M 6.0 — Vanuatu
M 5.2 — Kyrgyzstan
M 5.0 — southern Iran
M 5.0 — Vanuatu
M 6.1 — Vanuatu
M 5.2 — Kepulauan Kai, Indonesia
M 5.3 — Owen Fracture Zone region
M 5.7 — Vancouver Island, Canada region
M 5.4 — Samar, Philippines
M 5.1 — central Mid-Atlantic Ridge
M 5.2 — Vanuatu
M 5.6 — Samar, Philippines
M 5.8 — Tonga
M 5.4 — central Mongolia
M 5.1 — Taiwan region
M 5.1 — New Britain region, Papua New Guinea
M 5.7 — Vanuatu
M 5.4 — Vanuatu
M 6.0 — Vanuatu
M 5.1 — Vanuatu
M 5.1 — Vanuatu
M 5.2 — Vanuatu
M 5.5 — Vanuatu
M 5.0 — Vanuatu
M 5.3 — Vanuatu
M 7.1 — Vanuatu
M 5.5 — New Britain region, Papua New Guinea
M 5.0 — New Ireland region, Papua New Guinea
M 5.4 — New Britain region, Papua New Guinea
M 5.0 — near the coast of southern Peru
M 5.0 — west of Macquarie Island
M 5.3 — Easter Island region
M 5.6 — southern Sumatra, Indonesia
M 5.2 — Samar, Philippines
M 5.1 — Vanuatu

Massive Earth Quake Activity in Vanuatu this week

A series of large quakes has hit the area near Vanuatu.

Here is a snapshot of the activity tracked on my homepage [ LINK ]. There have been big quakes near there for some time, but this is cluster of very large events:

M 5.0 — Vanuatu
M 6.1 — Vanuatu
M 5.2 — Vanuatu
M 5.7 — Vanuatu
M 5.4 — Vanuatu
M 6.0 — Vanuatu
M 5.1 — Vanuatu
M 5.1 — Vanuatu
M 5.2 — Vanuatu
M 5.5 — Vanuatu
M 5.0 — Vanuatu
M 5.3 — Vanuatu
M 7.1 — Vanuatu
M 5.1 — Vanuatu

It would appear that there is a some significant seismic activity in that region. Other areas like New Zealand and the seemingly always active Papua New Guinea are also seeing events.

M 5.0 — Kermadec Islands, New Zealand
M 5.4 — Kermadec Islands, New Zealand
M 5.4 — Kermadec Islands, New Zealand
M 5.2 — Kermadec Islands, New Zealand

M 5.5 — New Britain region, Papua New Guinea
M 5.0 — New Ireland region, Papua New Guinea
M 5.4 — New Britain region, Papua New Guinea
M 5.1 — New Britain region, Papua New Guinea
M 5.3 — Papua, Indonesia

The largest of the recent events [ LINK ] happened Thursday, February 02, 2012 at 13:34:40 UTC.

As illustrated in this clipping from the USGS Map for the event, there has been a lot of large quake activity in the last 7 days.

Fortunately, despite the area having structures vulnerable to shaking damage, the estimated threat to population due to distance is virtually nil [ LINK ]. It’s estimated less that there was a 30% chance of any fatality, and a 4% chance of more than 30.

Epicenter for the quakes appears to be on a submarine ridge between two island chains. A very nice nursery for a tsunami, should there be significant under sea land movement from a large event.

As always, I continue to watch worldwide seismic events on my home page: [ LINK ].

UPDATE: – 4-FEB-2012 21:20

More activity near the epicenter of that large quake of Vanuatu… these in just the last few hours:

M 5.0 — Vanuatu
M 6.1 — Vanuatu
M 5.2 — Vanuatu

CIDR Calculator App picked up by Softpedia

This just in…. (seems like a good thing). Notice that my new App CIDR Calculator for the MAC (in the wide for barely 24 hours now), was found by Softpedia, and linked in their site.

Congratulations,

CIDR Calculator, one of your products, has been added to Softpedia’s
database of software programs for Mac OS. It is featured with a description
text, screenshots, download links and technical details on this page:
http://mac.softpedia.com/get/Utilities/DeMartini-CIDR-Calculator.shtml

The description text was created by our editors, using sources such as text
from your product’s homepage, information from its help system, the PAD
file (if available) and the editor’s own opinions on the program itself.

Nothing wrong with a little free exposure. No ratings so far, but I hope to get some good feedback. It’s already sold several units so I know someone is out there giving it a test.

If you want to learn more about this entry in Softpedia, [ HERE IS THE LINK ].

If you want to check out the App itself at the Apple MAC App Store.. just click on the button below!

Buy at the Mac App Store