<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-6975983</id><updated>2011-09-28T13:59:14.628-07:00</updated><category term='Mobile'/><category term='Analytics'/><category term='Internet'/><category term='Visualization'/><category term='Technology'/><category term='MapReduce'/><category term='Cloud Computing'/><category term='programming'/><category term='Entertainment'/><category term='UI'/><category term='Windows'/><category term='Security'/><category term='Search'/><category term='Web 2.0'/><category term='Business Intelligence'/><category term='Concurrency'/><category term='Open Source'/><category term='Sightings'/><category term='Apps'/><category term='Database'/><category term='Dismal Science'/><category term='software engineering'/><category term='DRM'/><category term='programming languages'/><category term='SDForum'/><category term='Media'/><category term='Institutional Corruption'/><category term='Books'/><category term='Web Analytics'/><title type='text'>Build and Break</title><subtitle type='html'>We do not really know how something works until we have taken it to pieces and put it back together again. Thus gaining knowledge involves breaking things. I deal mostly with software.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default?start-index=101&amp;max-results=100'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>298</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-6975983.post-7853559489676641524</id><published>2011-07-11T23:17:00.000-07:00</published><updated>2011-07-11T23:17:10.187-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Apps'/><category scheme='http://www.blogger.com/atom/ns#' term='Mobile'/><title type='text'>Will Microsoft Wipe Out the Google Android Advantage?</title><content type='html'>Microsoft seems to be working many different angles to wipe out any advantage that Google should get as the primary developer of the Android OS for Smartphones. The sneaky angle is to persuade the Smartphone makers to replace Google as the default search engine with Microsoft's own Bing search engine. For example, the LG Revolution has the Android OS, however as the PC Mag &lt;a href="http://www.pcmag.com/article2/0,2817,2386210,00.asp"&gt;review&lt;/a&gt; says, it has been thoroughly Binged. Bing is the default search engine and Bing Maps is the default mapping application. My son, who recently got a LG Revolution told me that he had to download Google Maps to get the mapping application he prefers.&lt;br /&gt;&lt;br /&gt;Microsoft has also been going after the various Android smartphone makers, getting them to pay royalties on Microsoft patents. &lt;a href="http://www.eweek.com/c/a/Mobile-and-Wireless/Apple-Microsoft-Oracle-Lead-Unholy-Patent-Alliance-Against-Android-446031/"&gt;Reputedly&lt;/a&gt;, HTC is paying Microsoft $15 per smartphone to license patents. It would not surprise me if Microsoft is making more money from licensing patents to Android handset makers than it is making from selling its own Windows 7 Mobile operating system to handset makers. They are certainly making more profit from Android than from Windows 7.&lt;br /&gt;&lt;br /&gt;Microsoft could apply pressure other smartphone makers to set the default search engine to Bing in exchange for reduced payments of royalties. If this becomes widespread, Google loses the advantage that it gets from having developed Android, and in the long run even threatens the existence of Android. If Google gets no advantage from developing Android why should it continue? This is only a tiny slice of what is going on with mobile patents, however the ball seems to be in Google's court and we are waiting to see what they do with it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7853559489676641524?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7853559489676641524/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7853559489676641524' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7853559489676641524'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7853559489676641524'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/07/will-microsoft-wipe-out-google-android.html' title='Will Microsoft Wipe Out the Google Android Advantage?'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3322931224869253208</id><published>2011-07-01T00:09:00.000-07:00</published><updated>2011-07-01T00:09:55.940-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Fired!</title><content type='html'>It happened yesterday. They just sent me an eMail. How is that a good way to be &lt;a href="http://www.halfwaytoconcord.com/amazon-fires-california-associates-ahead-of-internet-tax/"&gt;fired&lt;/a&gt;? Anyway, from today onwards I can tell you that I am no longer an Amazons.com Associate. I cannot fault Amazon's reasons for firing me. Yesterday Governor "Moonbeam" Brown signed a law that because Amazon has associates in California they would have to collect sales taxes from California residents, and Amazon, clinging to "&lt;a href="http://news.cnet.com/8301-13578_3-10220649-38.html"&gt;the internet is tax free&lt;/a&gt;" mantra fired all their associates that are California residents.&lt;br /&gt;&lt;br /&gt;Also, I cannot really fault Governor Brown. He has a big budget gap to fund, an intransigent legislature and the determination that he is going to fix the budget problem properly, or at least better than his predecessors. So Brown signed legislation that would provide California with more revenue, or at least try to plug a gap in state revenue, or at least be a stage along the way to plugging the internet sales tax hole. This legislation on its own is not going to generate any extra revenue, however if &lt;a href="http://www.frugal-families.com/blog/373/i-was-fired/"&gt;enough&lt;/a&gt; &lt;a href="http://radar.oreilly.com/2010/03/amazon-fires-its-colorado-asso.html"&gt;other&lt;/a&gt; &lt;a href="http://breningstall.typepad.com/breningstall-on-typepad/2011/03/amazon-should-be-excoriated-fires-all-its-illinois-associates-to-avoid-paying-state-sales-tax-amazon.html"&gt;states&lt;/a&gt; with a sales tax pass similar legislation maybe enough companies will throw in the towel and start complying with state sales tax collection.&lt;br /&gt;&lt;br /&gt;What am I going to do? Well I am not going to leave California. The weather is good, there are plenty of good jobs that pay well enough and the other amenities, while not cheap, are well worth it. Also, I have to confess that I have not been a very good Amazon Associate, making practically nothing from all those seductive links sprinkled throughout this blog. In truth, I did not become an Amazon Associate to make money. The real reason was that I was concerned that if I copied a link from the Amazon.com web site, and if you clicked on that &lt;a href="http://www.amazon.com/exec/obidos/subst/home/home.html/002-1597209-2460243"&gt;link&lt;/a&gt;, you might see a message at the top of your screen that said something like "Hello Richard Taylor. Have we got some recommendations for you." As an Associate, I could get a good clean link for a product without having to worry about the fact that the link could have other unwanted baggage attached.&lt;br /&gt;&lt;br /&gt;So adios Amazon. I will just have to go back to guessing how to fix up links to your site so that they do the right thing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3322931224869253208?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3322931224869253208/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3322931224869253208' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3322931224869253208'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3322931224869253208'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/07/fired.html' title='Fired!'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6934836125939655086</id><published>2011-06-28T22:42:00.000-07:00</published><updated>2011-06-28T22:42:13.032-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Bitcoin as an Economic Entity</title><content type='html'>Bitcoin is the new peer-to-peer virtual currency that I wrote about &lt;a href="http://bandb.blogspot.com/2011/06/bitcoin-peer-to-peer-virtual-currency.html"&gt;previously&lt;/a&gt;. This post evaluates Bitcoin as money from an economic point of view. I will write a separate post on technical and security aspects. Economists look at money as &lt;a href="http://en.wikipedia.org/wiki/Money"&gt;three things&lt;/a&gt;: a measure of value, a medium of exchange and as a commodity, more commonly and politely stated as a store of value. Here is how Bitcoin measures up to these three function.&lt;br /&gt;&lt;br /&gt;One function of money is as a measure of value. When we use money to measure value, we do not mean that the money exists, rather that the asset, good or service is worth or cost the sum of money. Thus when we say that someone is a millionaire, this means that the sum of all they own minus their debts is more than a million dollars. It does not mean that they have a million dollars in bills stuffed into a mattress.&lt;br /&gt;&lt;br /&gt;The men with the green eyeshades often talk about this purpose of money as "unit of account", thinking about it as a measure of value gets to the essence more quickly. So, when I am in a computer store trying to decide whether I should buy the $500 laptop or the $1000 laptop, I use money as a comparative measure of value, by asking whether the $1000 laptop is really worth twice the $500 laptop and an absolute measure of value by asking whether I can afford the $1000 laptop that I really want or whether I should make do with the $500 laptop and save the difference for other needs. &lt;br /&gt;&lt;br /&gt;For a measure of value, the best currency is the currency we are familiar with, that we are paid in and that we use every day. Anyone who has been abroad knows the difficulty of commerce with an unfamiliar currency. At first, after every transaction the thought lingers in the back of your mind, did we just get a deal, or were we robbed? However, with repeated use you pick up a new currency. By the end of a vacation you are starting to be able to predict what goods and services will cost in the new currency. When I played World of Warcraft (WOW), I quickly learned the value of WOW Gold through having to work with it all the time. &lt;br /&gt;&lt;br /&gt;Bitcoin has another problem as a measure of value, its volatile exchange rate with other currencies. Since its introduction, it has appreciated against all other currencies by about &lt;a href="http://www.smartmoney.com/invest/stocks/the-currency-thats-up-200000-1307029053200/?link=SM_hp_ls4e"&gt;200,000%&lt;/a&gt;. Recently, heavy selling on a Bitcoin exchange caused its value to &lt;a href="http://blog.zorinaq.com/?e=55"&gt;fluctuate&lt;/a&gt; between $0.01 and $17.00 over the period of a day. This volatility makes it difficult to use as a measure of value because its value is uncertain. Most currencies are managed by a central bank and one of the purposes of a central bank is to keep the currency stable with respect to other currencies so that it can be safely used for all the three functions on money. On the other hand, the essence of Bitcoin is that it is completely distributed with no central authority. As it is unmanaged, we can expect its exchange rate to be somewhat more volatile than other currencies.&lt;br /&gt;&lt;br /&gt;Another function of money is as a medium of exchange. Before money existed, trading was difficult. If I led a cow to market with the intent on trading it for grain, I might come to an agreement with another farmer that my cow is worth 8 sacks of grain, except that I only want one sack of grain the other farmer only has 5 sacks of grain to trade and he does not want a cow anyway. With money, I can sell the cow for money to someone who wants a cow, buy just as much grain as I need and save any leftover money for other transactions in the future. Money as a medium of exchange greases the wheels of commerce by acting as an intermediary and thus removing barriers to trade.&lt;br /&gt;&lt;br /&gt;Bitcoin scores high as a medium of exchange. It can be securely and anonymously traded on the internet for other goods and services. Also it is almost infinitely divisible so it serves for small exchanges. There are two caveats. Firstly a Bitcoin transaction takes about 10 minutes to confirm, so sellers may be unwilling to accept it for immediate transactions where there is no recourse. That is, Bitcoin is good for selling Alpacca socks over the internet, but not for selling hot-dogs at Coney Island. As Bitcoin is an internet currency, this is only of concern to someone who sells virtual goods over the internet without recourse. The &lt;a href="https://en.bitcoin.it/wiki/FAQ"&gt;Bitcoin FAQ&lt;/a&gt; addresses this issue, saying:&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;Do you have to wait 10 minutes in order to buy or sell things with BitCoin?&amp;nbsp;&lt;/b&gt;&lt;/blockquote&gt;&lt;blockquote&gt;No, it's reasonable to sell things without waiting for a confirmation as long as the transaction is not of high value.&amp;nbsp;&lt;/blockquote&gt;&lt;blockquote&gt;When people ask this question they are usually thinking about applications like supermarkets or snack machines, as discussed in this thread from July 2010. Zero confirmation transactions still show up in the GUI, but you cannot spend them. You can however reason about the risk involved in assuming you will be able to spend them in future. In general, selling things that are fairly cheap (like snacks, digital downloads etc) for zero confirmations will not pose a problem if you are running a well connected node."&lt;/blockquote&gt;The second caveat is that we typically maintain a reserve of any currency that we regularly use as a float to smooth out transactions. Anyone concerned with the volatility of the value of Bitcoin may be unwilling to maintain a float in Bitcoin and therefore not have a convenient reserve of Bitcoin for doing transactions. If Bitcoin continues to have a volatile exchange rate with other currencies and users do not keep a reserve of Bitcoin for doing transactions, it becomes more cumbersome to use and therefore less useful as a medium of exchange. The end result is that Bitcoin is only used when there is no alternative method of payment. The conclusion is that Bitcoin, or any other currencies usefulness as a medium of exchange does depend on it having a reasonably stable value.&lt;br /&gt;&lt;br /&gt;The final function of money is as a commodity like Gold, Oil or Frozen Concentrated Orange Juice (&lt;a href="http://www.imdb.com/title/tt0086465/"&gt;FCOJ&lt;/a&gt;). Currencies are commodities that are traded like other commodities for good legitimate reasons. For example, a company that contracts to buy a good that is priced in another currency may want to buy insurance against a change in the exchange rate that would cause the good to become more expensive than when they made the original commitment. Financial companies create and sell instruments that provide this insurance and then trade currencies as commodities to protect their position. &lt;br /&gt;&lt;br /&gt;First some words about commodities in general. Owning a commodity does not produce any value. Stocks and bonds may pay a dividend, while a commodity does not, so the only reason for owning a commodity as an investor is the hope that its value will increase so that it can be sold at a profit. In practice owning a commodity is an even worse proposition because money is tied up in owning the commodity that could be otherwise earning interest, so even owning a commodity is a losing proposition unless the commodity increases in value. Then there is a cost for every trade which further saps profits. Thus people who are not Bitcoin speculators will not want to hold more Bitcoin than they need for their day to day needs.&lt;br /&gt;&lt;br /&gt;Commodity trading creates a market for the commodity that sets its price. The first test of a commodity is that there is a market where the commodity can be traded efficiently. Bitcoin passes this test as there are &lt;a href="https://en.bitcoin.it/wiki/Selling_bitcoins"&gt;several markets&lt;/a&gt; for Bitcoin, although a recent attack against the &lt;a href="https://mtgox.com/"&gt;MtGox&lt;/a&gt;, the largest Bitcoin exchange may reduce confidence. As an example of the efficiency of trading Bitcoin, MtGox charges a 0.65% charge against every trade. &lt;br /&gt;&lt;br /&gt;When evaluating a commodity, we consider how it is used to understand the supply and demand that determines its fundamental price. Bitcoin is a fiat currency which has value because people find it useful as a medium of exchange like the other like other fiat currency: Dollar, Pound, Euro or Yen. The key to understanding the value of Bitcoin like any other currency is money supply, the sum of all the money that people keep in their bank accounts and wallet to smooth out their transactions and grease the wheels of commerce as discussed previously. However there is one difference. With other currencies there is a central bank that manages the money supply to keep the value of the currency stable. With Bitcoin, there is no central bank, rather the amount of Bitcoin is circulation is stable. Thus the base value of Bitcoin depends on demand for its money supply. &lt;br /&gt;&lt;br /&gt;The base demand for Bitcoin is to use it as a medium of exchange. If more people regularly do Bitcoin transactions and keep it in their wallet to smooth out their transactions, or they tend to keep more Bitcoin in their wallet because they expect to use it for more transactions, there is more demand for the stable supply of Bitcoin and therefore its price rises. Conversely, if less people keep Bitcoin in their wallet or people keep less money in their wallet the price falls. On top of this base demand, there is demand from speculators who expect the price of Bitcoin to rise and therefore hold it in investment level quantities. The base demand for Bitcoin will tend to keep the price stable, while the speculative demand is likely to make the price more volatile.&lt;br /&gt;&lt;br /&gt;Another consideration is whether there are any risks associated with owning the commodity. Bitcoin is a virtual currency and &lt;a href="http://bandb.blogspot.com/2009/05/virtual-worlds-real-metrics.html"&gt;a problem&lt;/a&gt; with other virtual currencies has been hyperinflation, caused by someone discovering a software bug that allows them to generate huge amounts of the virtual currency without much effort. This has happened in several Massive Multiplayer Online Games (MMOG), but in each case the game has had a central server that hands out money and a game mechanism that is designed with a specific rate of exchange in mind. Bitcoin is different in that it does not have a central authority and it is traded in a free and open market that sets its value. An attack on Bitcoin could reduce its value, however this could be self defeating as it immediately reduces the value of the attack. I will write a separate post on the security considerations, however it is safe to say that as there is a vibrant market for Bitcoin, it is reasonably safe.&lt;br /&gt;&lt;br /&gt;In summary, Bitcoin's purpose is to be used as a medium of exchange for transactions over the internet. Its base value comes from small amounts of it being held in a large number of users wallets because they regularly use it as a medium of exchange. If Bitcoin is heavy used as a medium of exchange, this will tend to stabilize its exchange rate against other currencies and make it more useful as a currency when measured against all the functions of money.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6934836125939655086?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6934836125939655086/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6934836125939655086' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6934836125939655086'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6934836125939655086'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/06/bitcoin-as-economic-entity.html' title='Bitcoin as an Economic Entity'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2581808992934381468</id><published>2011-06-28T00:02:00.000-07:00</published><updated>2011-06-28T00:02:27.312-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>eReaders Arrive</title><content type='html'>After writing about the &lt;a href="http://www.amazon.com/gp/product/B002Y27P3M/ref=as_li_ss_tl?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=217145&amp;amp;creative=399373&amp;amp;creativeASIN=B002Y27P3M"&gt;Kindle&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.com/e/ir?t=&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B002Y27P3M&amp;amp;camp=217145&amp;amp;creative=399373" style="border: none !important; margin: 0px !important;" width="1" /&gt; for some &lt;a href="http://bandb.blogspot.com/2009/10/e-readers-for-all.html"&gt;time&lt;/a&gt;, I can let you know that I am now a proud owner of one. I can also tell you that it is a wonderful device, even more wonderful than I imagined, when used for reading the &lt;a href="http://bandb.blogspot.com/2010/01/two-kinds-of-book-readers.html"&gt;right kind&lt;/a&gt; of book, that is, the page-turner kind of book where you start on page 1 and keep turning pages until you get to the end. The other kind of book, the kind of book where you start with the index or table of contents and then jump around has been subsumed by the web with search and hyperlinks to the point where it is redundant anyway. Thus the Kindle is the perfect device for reading the only kind of book that is left, the kind of book that you read straight through.&lt;br /&gt;&lt;br /&gt;I am not the only person who has recently bought an eReader. Today a Pew Internet research &lt;a href="http://pewinternet.org/Reports/2011/E-readers-and-tablets/Report.aspx"&gt;study&lt;/a&gt; showed that eReader ownership has doubled in the last 6 months. It is now up to 12% in the US and is currently growing faster than Tablet ownership. eReaders have been around for longer than the current incarnation of Tablets, and seem to be arriving at the period of mass adoption. Also, given the current price there is little reason not to own one.&lt;br /&gt;&amp;nbsp; &lt;br /&gt;An objection to the Kindle has always been that it is not a converged device. It is good for reading one kind of book and little else. Many commentators wanted it to be good at everything, and argued that otherwise it is just another device that you need to carry around. I particularly like that it is not a converged device. When I am reading a book on my Kindle, it will not interrupt my train of thoughts to announce that an email or twitter has arrived, or tempt me to play a silly game or fiddle with a Facebook page. With the Kindle I can walk away from the computer and read a book without all those interruptions and distractions that make life a disconnected &lt;a href="http://bandb.blogspot.com/2011/05/living-in-stream.html"&gt;stream of consciousness&lt;/a&gt;.&amp;nbsp;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2581808992934381468?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2581808992934381468/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2581808992934381468' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2581808992934381468'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2581808992934381468'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/06/ereaders-arrive.html' title='eReaders Arrive'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3415114155455014569</id><published>2011-06-19T22:09:00.000-07:00</published><updated>2011-06-19T22:09:24.109-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Bitcoin, a Peer to Peer Virtual Currency</title><content type='html'>&lt;a href="http://www.bitcoin.org/"&gt;Bitcoin&lt;/a&gt; is a peer-to-peer virtual currency that seems to &lt;a href="http://www.smartmoney.com/invest/stocks/the-currency-thats-up-200000-1307029053200/?link=SM_hp_ls4e"&gt;pop up&lt;/a&gt; in the &lt;a href="http://www.wired.com/threatlevel/2011/06/silkroad/"&gt;conversation&lt;/a&gt; &lt;a href="http://www.grc.com/sn/sn-287.htm"&gt;everywhere&lt;/a&gt; I &lt;a href="http://www.econtalk.org/archives/2011/04/andresen_on_bit.html"&gt;look&lt;/a&gt;. A virtual currency is is a currency that is created on computers and traded on the internet. A couple of examples of virtual currencies are Linden Dollars in the online world Second Life and Gold in the massive multiplayer online game World of Warcraft (WOW). People in third world countries play WOW to collect WOW Gold and sell it for real money to players in the first world so that they can buy more powerful armor, weapons and spells to use in the game. Bitcoin is different in that its purpose is to be a currency like dollars, euros or pounds, whereas Linden Dollars and WOW Gold are an element of their games and have no real purpose or value outside of the game.&lt;br /&gt;&lt;br /&gt;The other aspect of Bitcoin is that it is a peer-to-peer currency. Bitcoin is created by mining for it against a cryptographic algorithm. Once Bitcoins are created they are traded on a peer-to-peer network. When a transaction has taken place, it is broadcast to the peers on the network, they confirm that the transaction is valid and has taken place. The peer computers add the transaction to the history so that the transaction becomes permanent. There is no central authority that creates or manages Bitcoin, it manages itself through its network of peer computers all running the same software. &lt;br /&gt;&lt;br /&gt;One feature of Bitcoin that has excited interest is that it promises secure anonymous transactions, like cash, but over the internet. While this may seem like a good thing, it is also a problem as it means that Bitcoin is an extremely useful currency for people who want to get around the law. Bitcoin has the problem that it needs to establish itself as useful currency with a legitimate reason to be. If the major use of Bitcoin turns out to be to abet criminal activity it may find itself &lt;a href="http://news.cnet.com/8301-13506_3-20070268-17/senators-target-bitcoin-currency-citing-drug-sales/"&gt;under attack&lt;/a&gt; from governments that want to suppress it. &lt;br /&gt;&lt;br /&gt;I am going to do a couple of posts on Bitcoin, one examining the economic aspects, and the other looking the technical and security aspects. In the mean time here are a number of links on related issues. My interest in a virtual currency comes from several direction. In the past I have written in this blog about both &lt;a href="http://bandb.blogspot.com/2007/06/virtual-goods.html"&gt;Virtual Goods&lt;/a&gt; and &lt;a href="http://bandb.blogspot.com/2009/05/virtual-worlds-real-metrics.html"&gt;Virtual Economies.&lt;/a&gt; &lt;br /&gt;&lt;br /&gt;A big question at the moment is the whole issue of what is Money. Some politicians, concerned about monetary policy have &lt;a href="http://www.politico.com/news/stories/0611/57049.html"&gt;called&lt;/a&gt; for a return to the Gold standard, which has resulted in others asking this question. This American Life did a &lt;a href="http://www.thisamericanlife.org/radio-archives/episode/423/the-invention-of-money"&gt;Podcast&lt;/a&gt; on that subject and came to the conclusion that Money is much more ephemeral than we may think. Planet Money did a related &lt;a href="http://www.npr.org/blogs/money/2011/02/15/131934618/the-island-of-stone-money"&gt;story&lt;/a&gt; where they looked at the small Pacific island of Yap where they used giant round stones as money. When a stone changes hand because of a payment, as the stone is large and heavy, the stone remains where it is and everyone on the island just knows it belongs to someone different. If you think that is strange, it is not that different from the way we manage gold. The gold bars sit in a bank vault and their ownership is digital bits recorded on a disk that is revolving at 7200 RPM. When the gold changes hands, a new record of ownership is written to the disk, however the gold remains exactly where it is. I will have to write more about virtual goods in real economies another time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3415114155455014569?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3415114155455014569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3415114155455014569' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3415114155455014569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3415114155455014569'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/06/bitcoin-peer-to-peer-virtual-currency.html' title='Bitcoin, a Peer to Peer Virtual Currency'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4785701917995068011</id><published>2011-05-25T00:07:00.000-07:00</published><updated>2011-05-25T00:07:34.119-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='programming'/><title type='text'>On Copyright and Open Source</title><content type='html'>Copyright is a key part of an &lt;a href="http://bandb.blogspot.com/2004/10/open-source-dilemma.html"&gt;Open Source&lt;/a&gt; or &lt;a href="http://bandb.blogspot.com/2004/09/there-is-no-such-thing-as-free.html"&gt;Free Software&lt;/a&gt; project. It may sound like copyright is antithetical to Free and Open software, but if &lt;a href="http://www.fsf.org/about/leadership/"&gt;Richard Stallman&lt;/a&gt;, President of the Free Software Foundation (FSF) thinks that ownership of copyright is an important part of Free Software, then we should believe him. A couple of things have led me to these conclusions. Firstly, at the February meeting of the Business Intelligence SIG, Ian Fyfe &lt;a href="http://bandb.blogspot.com/2011/04/business-of-open-source-suites.html"&gt;discussed&lt;/a&gt; the business of Open Source suites and how Pentaho is able to offer a suite of Open Source projects as a commercial produce by controlling the Open Source projects, and in particular copyright to the code.&lt;br /&gt;&lt;br /&gt;The other clue to the importance of copyright came by accident as I was looking at the difference between the emacs editor and the XEmacs editor. Emacs was an open software project that forked in the early 1990's before the terms Free Software and Open Source had even been invented. One of the criticisms that Stallman, speaking for the emacs project levels against the XEmacs project is that they have been sloppy about the ownership of the code and not always got the "legal papers" that assign ownership of the contribution to the project. On &lt;a href="http://www.xemacs.org/About/XEmacsVsGNUemacs.html"&gt;this&lt;/a&gt; web page about XEmacs versus emacs, Stallman says:&lt;br /&gt;&lt;blockquote&gt;"XEmacs is GNU software because it's a modified version of a GNU program. And it is GNU software because the FSF is the copyright holder for most of it, and therefore the legal responsibility for protecting its free status falls on us whether we want it or not. This is why the term "GNU XEmacs" is legitimate.&lt;br /&gt;&lt;br /&gt;"But in another sense it is not GNU software, because we can't use XEmacs in the GNU system: using it would mean paying a price in terms of our ability to enforce the GPL. Some of the people who have worked on XEmacs have not provided, and have not asked other contributors to provide, the legal papers to help us enforce the GPL. I have managed to get legal papers for some parts myself, but most of the XEmacs developers have not helped me get them."&lt;/blockquote&gt;Note that GNU is the FSF "brand" for its software. The legal papers that Stallman references assign ownership and copyright of a code contribution to the FSF. Because the FSF owns the code it can enforce its right as owner on anyone who breaks its license. Also it can change the terms of the license, and license the code to another party under any other license that it sees fit. The FSF has changed the license terms of the code that it owns. As new versions of the GNU Public License (GPL) have emerged the FSF have upgraded the license to the latest version.&lt;br /&gt;&lt;br /&gt;Copyright and Open Source is a study in contradictions. On the one hand, Richard Stallman has "campaigning against both software patents and dangerous extension of copyright laws". On the other hand, he uses ownership of copyright to push his agenda through the GNU Public License which has a viral component so that the source code of any software that is linked with GNU licensed software must be published as open source software. I will write more about this issue.&lt;br /&gt;&lt;br /&gt;A good Open Source project needs to make sure that everyone who contributes code to the project signs a document that assigns copyright of their contribution to the project. Unless care is taken to make all the code belong to a single entity, each person who has contributed to the code owns their contribution. If the project wants to be able to do anything with the code other than passively allow its distribution under its existing license, the code must be owned by a single entity. As Stallman says, the project may not be able to defend its own rights unless the code has clear ownership.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4785701917995068011?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4785701917995068011/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4785701917995068011' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4785701917995068011'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4785701917995068011'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/05/on-copyright-and-open-source.html' title='On Copyright and Open Source'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1024566654101901354</id><published>2011-05-18T23:23:00.000-07:00</published><updated>2011-05-18T23:23:09.522-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>The Facebook PR Fiasco</title><content type='html'>Last week came the &lt;a href="http://www.thedailybeast.com/blogs-and-stories/2011-05-12/facebook-busted-in-clumsy-smear-attempt-on-google/#"&gt;revelation&lt;/a&gt; that Facebook had secretly hired a prestigious Public Relations (PR) firm to plant negative stories about Google and its privacy practices. This is a completely ridiculous thing to have done and wrong in so many ways that it is difficult to know where to begin. Here are some of the top reasons as to why it was a bad idea.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Firstly, the idea that Facebook should be accusing anyone of of playing fast and loose with peoples privacy is a severely hypocritical. Just last year, Mark Zuckerberg told us that "&lt;a href="http://www.readwriteweb.com/archives/facebooks_zuckerberg_says_the_age_of_privacy_is_ov.php"&gt;the age of privacy is over&lt;/a&gt;". Now he is trying to say that Google is worse for privacy than Facebook! And by the way, this revelation comes at the same time as Symantec has &lt;a href="http://www.symantec.com/connect/blogs/facebook-applications-accidentally-leaking-access-third-parties"&gt;discovered&lt;/a&gt; a serious and longstanding security hole in the Facebook App API that allows a users private data to to leak. The only cure is to change your Facebook password, so if you are a Facebook user, go and change your password now!&lt;/li&gt;&lt;li&gt;Secondly, we come to the oxymoronic idea of a secret PR campaign. Anyone who thinks that a PR campaign can be secret does not understand PR. &lt;/li&gt;&lt;li&gt;Thirdly, a competent let alone "prestigious" PR firm should have understood that the ruse was bound to be discovered and that the fallout would be much worse publicity than anything negative that they could promulgate. Thus anyone who claims to understands PR should have guided their client to do something less radical and refused to get involved in the PR campaign. As it is, the PR firm of Burson-Marsteller has lost a lot of their credibility by being involved in the fiasco, and in PR credibility is everything.&lt;/li&gt;&lt;li&gt;Fourthly, the whole idea of a secret PR campaign against another company seems sophomoric, as if Facebook is run by a bunch of undergraduates who have little real world experience, and think that they will be able to get away with a jape like this. No wait …&lt;/li&gt;&lt;li&gt;Finally, if Facebook does want to launch a PR campaign on privacy they should do so openly by generating positive press that compares their supposedly good privacy policies with others less good privacy policies and behavior. As Machiavelli said "A prince also wins prestige for being a true friend or a true enemy, that is, for revealing himself without any reservation in favor of one side against another" and goes on to explain why openness and taking sides leads to better outcomes than pretended neutrality. As Facebook did their PR campaign in secret, we conclude that they could not have done it in public and therefore their privacy practices are no better than that of Google or anyone else.&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Note:&lt;/b&gt; I was going to call this post "Pot hires PR firm to secretly call kettle black" until I read this &lt;a href="http://www.theatlantic.com/technology/archive/2011/05/google-doesnt-laugh-saving-witty-headlines-in-the-age-of-seo/238656/"&gt;article&lt;/a&gt; from the Atlantic about Search Engine Optimization (SEO) and the fact that as search engines do not have a sense of humor, humorous headlines do not work in the online world.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1024566654101901354?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1024566654101901354/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1024566654101901354' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1024566654101901354'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1024566654101901354'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/05/facebook-pr-fiasco.html' title='The Facebook PR Fiasco'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8642164377461874708</id><published>2011-05-07T18:00:00.000-07:00</published><updated>2011-05-07T18:00:51.308-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Living In the Stream</title><content type='html'>It used to be that "stream of consciousness" was a pejorative. It was a phrase you used to put down the type of person who talked endlessly with little connection between what they said and what anyone else said or even between what they had just said. Nowadays, the way live our lives is in a stream of consciousness.&lt;br /&gt;&lt;br /&gt;Text messages demand to be answered. If you do not answer a text within ten or fifteen minutes the sender complains that you are ignoring them. Emails keep arriving, and a popup in the corner of the screen heralds their arrival. The popup contains an excerpt of the message designed to make you read the whole thing immediately, even although you know that it is junk or something that you should handle later. Instant message boxes pop up whenever you are on line and cannot be ignored. Sometimes people call you on the phone, although good form these days is to IM someone first to see if you can call them on the phone. Finally there are the two great streams of consciousness that captivate our attention: Facebook and Twitter. Random stuff arrives in a random order and as you have subscribed to the feeds you keep looking at them to see if anything interesting happened. In practice it is most likely to be a video of a cute animal doing something stupid.&lt;br /&gt;&lt;br /&gt;How people survive and get anything done with these constant streams of distraction is a mystery to me. I do software, and sometimes I need to concentrate on a problem for a good period of time without interruption. It is not that I am necessarily thinking hard all the time, just that it can take time to investigate a problem or think through all the ramifications of a solution and any distraction just breaks the groove, meaning I have to start over. When this happens endlessly in a day my rate of getting stuff done drops towards zero.&lt;br /&gt;&lt;br /&gt;So how do we fight back against constant disruption? The answer is to take control and do not let others dictate the agenda. Firstly, establish that there are periods when you are off-line. I do not take my phone to the bathroom, or when I work out or when I go to bed. Also, I do not answer the phone when driving alone, and have my passenger answer when I am not alone. All our means of communication apart from voice have a buffer so that they do not need to be answered immediately, for voice there is a thing called voicemail. On the other hand, voicemail introduces us to the game of telephone tag which is fun for those who like playing it and exceedingly annoying for the rest of us.&lt;br /&gt;&lt;br /&gt;Secondly, you do need to "&lt;a href="http://www.collegehumor.com/video/5439990/girlfriend-doesnt-realize-boyfriend-is-on-vacation"&gt;return your calls&lt;/a&gt;" as they used to say. Which brings to the crux of the matter. If you want to be part of the conversation, you need to take part in it. Unfortunately, these days what you have to do is "return your calls", respond to your texts, answer your emails, react to IMs, post to Facebook and Twitter to show that you are a conscious sentient being, and finally do something to make a living. So it comes down to picking conversations, and thinking hard about which conversations you want to join. Do this right and we become &lt;a href="http://www.amazon.com/gp/product/0743253426/ref=as_li_ss_tl?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=217145&amp;amp;creative=399349&amp;amp;creativeASIN=0743253426"&gt;Islands in the Stream&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.com/e/ir?t=&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0743253426&amp;amp;camp=217145&amp;amp;creative=399349" style="border: none !important; margin: 0px !important;" width="1" /&gt;, which is the most we can hope to be these days.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8642164377461874708?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8642164377461874708/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8642164377461874708' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8642164377461874708'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8642164377461874708'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/05/living-in-stream.html' title='Living In the Stream'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6463486555268977951</id><published>2011-05-01T22:54:00.000-07:00</published><updated>2011-05-01T22:54:33.427-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='MapReduce'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><title type='text'>Understanding MapReduce Performance: Part 2</title><content type='html'>Getting good performance out of MapReduce is a matter of understanding two concepts. I discussed the first one, that MapReduce is designed to run on large clusters, in a &lt;a href="http://bandb.blogspot.com/2011/04/understanding-mapreduce-performance.html"&gt;post&lt;/a&gt; last week. Here is the second concept and it is something that everyone who uses MapReduce needs to grapple with. MapReduce works by breaking the processing task into a huge number of little pieces so that the work can be distributed over the cluster to be done in parallel. Each Map task and each Reduce task is a separate task that is can be scheduled to run in parallel with other tasks. For both Map and Reduce, the number of tasks needs to be much larger than the number of nodes in the cluster. &lt;br /&gt;&lt;br /&gt;The archetypal &lt;a href="http://hadoop.apache.org/common/docs/r0.20.2/mapred_tutorial.html#Example%3A+WordCount+v1.0"&gt;example&lt;/a&gt; of MapReduce is to count word frequency in a large number of documents. A Map task reads a document and outputs a tuple for each word with the count of occurrences of the word in the document. A Reduce task takes a word and accumulates a total count for the word from the per document count produced by each Map tasks. In this example, there are a large number of documents as input to the Map tasks and presumable a large number of words so that there are a large number of Reduce tasks. Another illustration of this principle is found in the Sort Benchmark &lt;a href="http://sortbenchmark.org/Yahoo2009.pdf"&gt;disclosure&lt;/a&gt; that I discussed in the previous post. For the Gray sort, the 100 TB of data is broken into 190,000 separate Maps and there are 10,000 Reduces for a cluster of 3400 nodes. &lt;br /&gt;&lt;br /&gt;While most users of MapReduce get the idea that MapReduce needs its input data broken into lots of little pieces so that there are many Map tasks, they forget about the same requirements for Reduce tasks. Searching the internet it is easy to find examples of MapReduce with a small number of Reduce tasks. One is a &lt;a href="http://pages.cs.wisc.edu/%7Egibson/mapReduceTutorial.html"&gt;tutorial&lt;/a&gt; from the University of Wisconsin where there is ultimately only one Reduce task. It is particularly galling that this example comes from the University of Wisconsin where they have a large and prestigious program on parallel database system research. In their defense, the tutorial does show how to do intermediate reduction of the data, but that does not prevent it from being a bad example in general.&lt;br /&gt;&lt;br /&gt;Sometimes the problem is too small. What do you do if the problem you are working on now just involves the computation of a single result? The answer is to enlarge the problem. In a large cluster it is better to compute more results even although they may not be of immediate use to you. Lets look at an example. Say you want to analyze a set of documents for the frequency of the word 'the'. The natural thing to do is process all the documents and in the Map function filter for the word 'the' and count the results in the Reduce function. This is how you are taught to use "valuable" computing resources. In practice, with MapReduce it is better to count the frequency of all the words in the documents and save the results. It is not a lot more effort for the MapReduce engine to count the frequency of all the words in the documents and if you then want to know how many uses there are of 'a' or any other word, they are there for you immediately.&lt;br /&gt;&lt;br /&gt;A common &lt;a href="http://bandb.blogspot.com/2009/06/databases-in-cloud.html"&gt;analogy&lt;/a&gt; is MapReduce as a freight train as opposed to a relational database which is a racing car. The freight train carries a huge load but is slow to start and stop. A race car is very fast and nimble but it carries only one person. Relational database systems rely on you to use the where clause to reduce the data that it has to analyze, and in return gives you the answer in a short time. MapReduce does not give you an answer as quickly but it is capable of effectively processing a lot more data. With MapReduce you should process all the data and save the results, then use them as you need them. We can sum the way of thinking about how to use MapReduce with the slogan "no where clauses".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6463486555268977951?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6463486555268977951/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6463486555268977951' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6463486555268977951'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6463486555268977951'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/05/understanding-mapreduce-performance.html' title='Understanding MapReduce Performance: Part 2'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5509642011534961400</id><published>2011-04-28T00:05:00.000-07:00</published><updated>2011-04-28T00:05:08.847-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='MapReduce'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><title type='text'>Understanding MapReduce Performance: Part 1</title><content type='html'>Currently MapReduce is riding high on the hype cycle. The other day I saw a presentation that was nothing but breathless exhortation for MapReduce as the next big thing and that we had better all jump on the bandwagon as soon as possible. However, there are rumblings of performance problems. At the recent Big Data Camp, Greenplum &lt;a href="http://bandb.blogspot.com/2011/02/greenplum-at-big-data-camp.html"&gt;reported&lt;/a&gt; that their MapReduce was 100 times slower than their database system. Searching the web finds many people &lt;a href="http://rickosborne.org/blog/2010/02/napkin-math-for-mongodb-performance/"&gt;complaining&lt;/a&gt; about MapReduce performance, particularly with NoSQL systems like MongoDB. That is a problem because MapReduce is the data analysis tool for processing NoSQL data. For MongoDB, anything more than the most trivial reporting will require the use of MapReduce.&lt;br /&gt;&lt;br /&gt;At the same time there is plenty of evidence that MapReduce is no performance slouch. The &lt;a href="http://sortbenchmark.org/"&gt;Sort Benchmark&lt;/a&gt; is a prime measure of computer system performance and currently the Hadoop MapReduce system holds two out of 6 titles for which it is eligible. One title is the Gray test for sorting 100 Terabytes (TB) of data in 173 minutes. The other title is the Minute test for sorting 500 Gigabytes (GB) of data in under a minute. These results are as of May 2010 and the Sort Benchmark is run every year, so we can expect better performance in the future. &lt;br /&gt;&lt;br /&gt;Understanding MapReduce performance is a matter of understanding two simple concepts. The first concept is that the design center for MapReduce systems like Hadoop is to run large jobs on a large distributed cluster. To get a feel of what this means, look at the Hadoop &lt;a href="http://sortbenchmark.org/Yahoo2009.pdf"&gt;disclosure document&lt;/a&gt; for the Sort Benchmark. The run for sorting 100 TB was made on a cluster of about 3400 nodes. Each node had 8 cores, 4 disks, 16 GB of RAM and 1GB ethernet. For the Minute sort, a smaller cluster was used with 1400 node systems with the same configuration except 8GB of RAM on each node. That is not to say that MapReduce will only work on thousand node systems. Most systems are much smaller than this, however Hadoop is particularly designed so that it will scale to run on a huge cluster. &lt;br /&gt;&lt;br /&gt;One problem with a large cluster is that nodes break down. Hadoop has several features that transparently work around the problem of broken nodes and continue processing in the presence of failure. From the Sort Benchmark disclosure, for the Gray sort run, every processing task is replicated. That is, for every processing task, two nodes are assigned to do it so that should a node break down, the sort can still continue with the data from the other node. This was not used for the Minute test because the likelihood of a node breaking down in the minute while the test is running is low enough to be ignored.&lt;br /&gt;&lt;br /&gt;Another large cluster feature that has an important effect on performance is that all intermediate results are written to disk. The results of all the Mappers are written to disk and the sorted data for Reducers is written to disk. This is done so that if a node fails only a small piece of work needs to be redone. By contrast, relational database systems go to great length to ensure that after data has been read from disk, it does not touch the disk again before being delivered to the user. If a node fails in a relational database system, the whole system goes into an error state and then does a recovery which can take some time. This is extremely disruptive to performance when a node fails and much better for performance when there is no failure. Relational database systems were not designed to run on thousands of nodes so they treat the problem of a node failure as a very rare event whereas Hadoop is designed as if it a commonplace. The consequence is that Hadoop performance can look slow when compared to a relational database on a small cluster.&lt;br /&gt;&lt;br /&gt;Unfortunately, there is not a lot that a user can do about this, except look for a bigger cluster to run their analysis on, or look for bigger data to analyze. That is the subject for the second part of this post where I will talk about the other simple concept for understanding MapReduce performance.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5509642011534961400?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5509642011534961400/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5509642011534961400' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5509642011534961400'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5509642011534961400'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/04/understanding-mapreduce-performance.html' title='Understanding MapReduce Performance: Part 1'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-258412784100177178</id><published>2011-04-24T21:14:00.000-07:00</published><updated>2011-04-24T21:14:27.981-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Apps'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><title type='text'>The Truth about Smartphone Location Tracking</title><content type='html'>There is a wave of outrage over the internet about revelation that iPhones has a file with tracking information recording all the places it has been. How dare Apple track users of their products! I am afraid that this is a extremely naive attitude. The fact is that everybody is tracking you on iPhone and not only on a iPhone but on all smartphones and on many less than smart phones as well. Let me count the ways, starting off with the benign and moving to the egregious.&lt;br /&gt;&lt;br /&gt;Firstly the carriers and handset makers collect data from phone to help improve their service. Last week we has a joint meeting of the SDForum Business Intelligence and Mobile SIGs on "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13876"&gt;Mobile Analytics&lt;/a&gt;". At that meeting Andrew Coward of CarrierIQ described how they embed their software in phones, usually at the carriers direction, to collect information that can be used to improve service. For example, he told us for example that it is quite normal for them to report to a carrier that their dropped call rate is 6% whereas the carrier's own engineers are telling management that their dropped call rate is 1%. They collect data on location so that the carrier knows where their users are using their phones from so that they can improve their service to that area.&lt;br /&gt;&lt;br /&gt;In Europe, CDR laws require phone carriers to retain their Call Data Record (CDR) for all calls for a period of 1 or 2 years. The police can and do request information on all the calls made to or from a number to help with their enquiries into crime. While a CDR record does not usually contain specific location information, it can identify the cell tower and thus the approximate location of the caller. Police have successfully used location based CDR data to help with their investigations for well over a decade. &lt;br /&gt;&lt;br /&gt;With the users permission, Google collects information from Android phones about their location. Google is the ultimate data collection company and I am always amazed at the creative ways they find for using that data. One Google service is the Traffic overlay on their Maps. This is derived from observing the change in location of Android phones. However, while Google says that they do not collect personally identifying information, they do need to distinguish between phones to make this application work, so they are tracking the movements of individuals, if only to provide the rest of us generic information on traffic flows. Google has plenty of other uses for this data. For example, they keep a database that locates every Wi-Fi hotspot is so that they can identify your location based on the Wi-Fi hotspot you using. Google can use data from Android phones to validate and update that database.&lt;br /&gt;&lt;br /&gt;Mobile analytics and Apps is where the use of location based information starts to get interesting. Last year Flurry &lt;a href="http://bandb.blogspot.com/2010/10/accidental-data-empires.html"&gt;presented&lt;/a&gt; to the Business Intelligence SIG and we heard about their run in with Steve Jobs. You can read their &lt;a href="http://www.mobilecrunch.com/2010/01/24/breaking-flurry-notices-cupertino-based-users-testing-apps-on-apple-tablet/"&gt;press release&lt;/a&gt; to get the full story of what they did. In short Flurry has a free toolkit that developers install into their mobile Apps that collects information and sends the data back to Flurry. The developer can then access analytics reports about their app at the Flurry web site. However, Flurry retains the data that has been collected from the App, including location based data. &lt;br /&gt;&lt;br /&gt;In January 2010, a couple of days before the iPad was announced, Flurry issued a press release saying that they saw a new Apple device that was was only being used in the Apple headquarters in Cupertino and gave some statistics on the number of different Apps that were being tested on this device. At this Steve Jobs blew his top and tried to get Flurry completely banned from iPhone Apps. Eventually Flurry and Apple settled their &lt;a href="http://moconews.net/article/419-how-a-company-recovers-after-pissing-off-steve-jobs/"&gt;differences&lt;/a&gt;. The conclusion was that in the words of the iPhone developer agreement "The use of third party software in Your Application to collect and send Device Data to a third party for processing or analysis is expressly prohibited."&lt;br /&gt;&lt;br /&gt;So lets parse this. Flurry is a company that has no direct relationship with the carriers, handset makers or the users of Apps, yet is is collecting data from all the Apps that it is included in. The data is available for use by the App developer and by Flurry. At the time of the iPad release they could identify that the device was different from all other devices and identify its location to within one set of buildings. Now, I am not trying to pick on Flurry specifically, there are several companies in this area. At the Business Intelligence SIG last week we heard from Apsalar, a recent start up in the same space, however, Flurry is the largest company that provides mobile analytics. Flurry estimates that they are included in up to 1 in 5 mobile Apps for the iPhone and Android. Because they are in so many Apps, they can provide aggregate data on all App usage. &lt;br /&gt;&lt;br /&gt;The point of this is that we want location aware Apps, however we also want to preserve our privacy. As Apps are, these two goals are incompatible. To be location aware, the App has to know your location, and if the App knows your location, it can transmit that information back to the App developer or aggregator of analytics for the App developer. Thus they know where you are whether you want to or not. &lt;a href="http://developer.android.com/guide/topics/location/obtaining-user-location.html"&gt;Android&lt;/a&gt;, has a profile that determines which information an App can access that is set when the App is installed. If it is allowed to access location information on installation, it can continue to do so until it is uninstalled.&lt;br /&gt;&lt;br /&gt;Compared to what Apps know about what you are doing while you use the App, the location database that the iPhone is collecting seems to be a small matter. In fact it seems to be a good reason to limit the number of Apps that you can be running at any one time. At least if only one App is running then only one App knows where you are at any particular time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-258412784100177178?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/258412784100177178/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=258412784100177178' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/258412784100177178'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/258412784100177178'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/04/truth-about-smartphone-location.html' title='The Truth about Smartphone Location Tracking'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2595482006052283350</id><published>2011-04-12T23:30:00.000-07:00</published><updated>2011-04-12T23:30:04.073-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><title type='text'>The Business of Open Source Suites</title><content type='html'>I have often wondered how a commercial company builds an Open Source Suite out of a collection of open source projects. At the last BI SIG &lt;a href="http://bandb.blogspot.com/2011/02/agile-bi-at-pentaho-ian-fyfe-chief.html"&gt;meeting&lt;/a&gt; Ian Fyfe Chief Technology Evangelist at &lt;a href="http://www.pentaho.com/"&gt;Pentaho&lt;/a&gt; told us how they do it and gave some interesting insights on how Open Source really works. Pentaho offers a Open Source Business Intelligence suite that includes the Kettle data integration project, the Mondrian OLAP project and the WETA data mining project amongst other projects. &lt;br /&gt;&lt;br /&gt;As Ian explained, Pentaho controls these Open Source projects because it employs the project leader and major contributors to each of the projects. In some cases Pentaho also owns the copyright of the code. In other cases, any ownership is in doubt, because there have been too many contributors and or what they have contributed has not been managed well enough to be able to say who owns the code. Mondrian is an example of an Open Source project where there have been enough contributors that it is not possible to take control of the whole source code and exert any real rights over it.&lt;br /&gt;&lt;br /&gt;The real control that Pentaho exerts over the Open Source components of its suites is that it gets to say what their roadmap is and how they will evolve in the future. As I noted, Pentaho is driving the various projects to a common metadata layer so that they can become integrated as a single suite of products.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2595482006052283350?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2595482006052283350/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2595482006052283350' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2595482006052283350'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2595482006052283350'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/04/business-of-open-source-suites.html' title='The Business of Open Source Suites'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5725113591566003185</id><published>2011-04-09T15:06:00.000-07:00</published><updated>2011-04-09T15:06:04.640-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><title type='text'>The Fable of the Good King and the Bad King</title><content type='html'>A long time ago there were two countries. Each country had a King. One King was a good King and the other King was a bad King as we will find out. Now, as you all know a Kings main job is to go out and make war on his enemies. It is the reason that Kings exist. If a king is not out making war against his enemies, he will go out hunting and make war on the animals of the forest. A good war will enlarge the kingdom, enhance the King fame and gives him more subjects to rule over. But before a King can make war, he should make sure that his subjects provided for. For while the subjects of a King owe everything that they have to their King, the King is also responsible for the welfare and good being of his subjects.&lt;br /&gt;&lt;br /&gt;There are many parts to taking care of subjects: making good laws, passing down sound judgements, but the most important one is making sure that the granaries are filled in times of plenty. For as surely as fat times follow lean times, lean times follow fat times. In times of plenty, the excess harvest should be saved so that in times of need the subjects do not starve. Subjects who are starving are weak and cannot praise their King nor defend his kingdom.&lt;br /&gt;&lt;br /&gt;Now in our two countries, these were years of plenty, and the Kings knew that they would go to war. The good King also knew that it was his duty to make sure the granaries were filled, and so he did. However, the bad King wanted to win the battle so badly that he sold off all the grain in his granaries to buy expensive war machines. A little incident happened, it was blown up into a huge crisis and the two countries went to war. Each King assembled his army and let it to the battleground at the border of their countries as had happened so many times before. The armies were evenly matched and they fought all day. At the end of the day the army of the bad King held its ground and he was declared the victor. The expensive war machines had helped, but less than hoped for. However, both armies were so weakened and exhausted by the fight that they turned around and went home, as they had so many times before.&lt;br /&gt;&lt;br /&gt;The years after this battle were years of want. The harvest had failed and both kingdoms suffered. However, the kingdom of the bad King suffered much more than the kingdom of the good King for there was no grain in their granaries. When the little incident happened that blew up into a huge crisis, both Kings assembled their armies and marched to the battleground on the border. This time the good King won the battle because his men were stronger.&lt;br /&gt;&lt;br /&gt;The good King advanced his army into the country of the bad King. They may not be able to take the whole country, but the good King had to let his men do a little rape and pillage as a reward for winning the battle. The bad King realizing his precarious position came out to parley with the good King. The bad King had nothing to offer the good King but some used war machines and the hand of his daughter in marriage. The good King accepted that the daughter of the bad King should should marry his son and that when the two Kings has passed on the greater battleground in the sky, the son of the good King would rule both countries. Thus the two kingdoms would become one united country. A country that would be large and strong enough to make war on the countries on the far side of the mountains.&lt;br /&gt;&lt;br /&gt;The moral of this story is that in times of plenty, make sure that the granaries are filled, for as surely as fat times follow lean times, lean times follow fat times, and the best protection against lean times are full granaries. On this matter, a King must beware of false council. When times are good, the false council will say "What could possibly go wrong? The times are fat and everyone is happy. Make the populace more happy by selling off the grain in the granary and rewarding the citizens each according to what they have contributed."  Even worse, when times are lean the false council will say "Times are awful and getting worse, we must take the grain out of the peoples mouths and put in in the granaries for the harvest next year could be even worse than this year." The point of a granary or any store of wealth is to save the excess during the fat years so that they can be used during the lean years.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5725113591566003185?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5725113591566003185/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5725113591566003185' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5725113591566003185'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5725113591566003185'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/04/fable-of-good-king-and-bad-king.html' title='The Fable of the Good King and the Bad King'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5847694216418701859</id><published>2011-03-30T23:04:00.000-07:00</published><updated>2011-03-30T23:04:43.281-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Cloud Security</title><content type='html'>Security is not only the the number one concern for adopting cloud computing, it is also a serious barrier to the adopt-ability of cloud computing. Also, security considerations are causing the Virtual Machine (VM) operating system to evolve. All this came out at the SDForum &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Page.viewPage&amp;amp;pageId=652&amp;amp;parentID=483&amp;amp;nodeID=1"&gt;Cloud SIG&lt;/a&gt; night on Cloud Security (the presentations are on the SIG page). There were three speakers and a lot was said. I am just going to highlight a few things that struck me as important.&lt;br /&gt;&lt;br /&gt;Firstly, Dr Chenxi Wang from &lt;a href="http://www.forrester.com/"&gt;Forrester Research&lt;/a&gt; spoke on cloud security issues and trends. She highlighted the issue of compliance to various regulations and how it clashes with what the cloud providers have to offer. One concern is where data is stored, as countries have different regulations for data privacy and record keeping on individuals. If data from one country happened to be stored in another country, that could create a problem with complex legal ramifications that would be expensive to resolve. On the other side of the equation are the cloud system vendors who want to provide a generic service with as few constraints as possible. Having to give a guarantee about where data is stored would make their service offering more complicated and expensive to provide.&lt;br /&gt;&lt;br /&gt;Another more specific example of the clash between compliance and what cloud vendors provide is with the PCI security standard in credit card card industry. One PCI requirement is that all computer systems used for PCI applications are scanned for vulnerabilities at least ever three months. Most cloud vendors are unwilling to have their systems scanned for vulnerabilities for a variety of reasons, one of which I will discuss shortly. The solution may be specialized cloud services that are aimed at specific industries. IBM is experimenting with a cloud service that they claim is PCI compliant. These specific services will be more expensive and we will have wait and see whether they succeed.&amp;nbsp; &lt;br /&gt;&lt;br /&gt;Chris Richter from &lt;a href="http://www.savvis.com/en-US/Pages/Home.aspx"&gt;Savvis&lt;/a&gt;, a cloud provider spoke next. He mentioned standards as a way to resolve the issued described above. The International Standards Organization is creating the ISO 27000 suite of standards for information security. So far ISO 27001 "Information security management systems — Requirements" and ISO 27002 "Code of practice for information security management" are the most mature and relevant standards. As with other ISO standards like ISO 9000 quality standard, there is certification process which will allow cloud providers to make standards based security claims about the service that they provide.&lt;br /&gt;&lt;br /&gt;Finally, Dave Asprey from &lt;a href="http://us.trendmicro.com/us/home/"&gt;Trend Micro&lt;/a&gt; discussed the evolving nature of the VM technology that underlies cloud computing offerings. The original VMware vision was that a virtual machine would be used to develop software for a real physical machine so they spent a lot of time and effort on faithfully replication every aspect of a physical machine in their virtual machine. Now the use case has shifted to making more efficient use of resources. However, a problem is that common operations can bring a set of virtual machines to a standstill if they all decide to do the same common operation at the same time. &lt;br /&gt;&lt;br /&gt;Again, vulnerability scanning shows the problem. If the company default is that the anti-virus scan is scheduled for lunchtime Wednesday, then the whole virtual machine infrastructure can be brought to its knees when everyone's VM starts its scan at the same time. Furthermore, because many of the files being scanned may be shared by all the virtual machines, having each VM scan them is a huge waste of resources. Anti-virus software companies are working with the VM software vendors to provide a vulnerability scan that is VM aware and that uses new VM APIs to perform its function is an efficient and non-disruptive way. While this is necessary it seems to run counter to the original notion that each VM is an entirely separate entity that is completely unaware that other VMs exist.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5847694216418701859?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5847694216418701859/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5847694216418701859' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5847694216418701859'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5847694216418701859'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/03/cloud-security.html' title='Cloud Security'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8365220713288120869</id><published>2011-03-13T15:20:00.000-07:00</published><updated>2011-03-13T15:20:14.040-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Database System Startups Capitulate</title><content type='html'>In the last decade, there have been many database system startups, most of them aimed at the analytics market. In the last year, several of the most prominent ones have sold out to large companies. Here are my notes on what has happened.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Netezza to IBM&lt;/b&gt;&lt;br /&gt;Netezza is database appliance that uses hardware assistance to do search. Recently it has been quite successful, with revenues getting into the $200M range. Netezza was founded in 2000 and sold out to IBM for $1.7B. The deal closed in November 2010. The Netezza hardware assistance is a gismo near the disk head that decides which data to read. Many people, myself included, think that special purpose hardware in this application is of marginal value at best. You can get better price performance and much more flexibility with commodity hardware and clever software. IBM seems to be keeping Netezza at arms length as a separate company and brand, which is unusual as IBM normally integrates the companies it buys into its existing product lines.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Greenplum to EMC&lt;/b&gt;&lt;br /&gt;Greenplum is a massive multi-processor database system. For example, Brian Dolan &lt;a href="http://bandb.blogspot.com/2010/08/mad-skills-for-big-data.html"&gt;told&lt;/a&gt; the BI SIG last year how Fox Interactive Media (MySpace) used a 40 host Greenplum database system to do their data analytics. The company was founded in 2003. The sale to EMC closed in July 2010. The price is rumoured to be somewhere at the top of the $300M to $400M range. EMC is a storage system vendor that has been growing very fast, partly by acquiring successful companies. EMC owns VMWare (virtualization), RSA (security) and many other businesses. The Greenplum acquisition adds big data to big storage. &lt;br /&gt;&lt;br /&gt;&lt;b&gt;Vertica to HP&lt;/b&gt;&lt;br /&gt;Vertica is a columnar database system for analytics. The privately held company started in 2005 with respected database guru Mike Stonebreaker as a founder. The sale was announced in February 2011. The sale price has not been announced. I have heard a rumour of $180M which seems low, although the company received only $30M in VC funding. Initially Vertica seemed to be doing well, however in the last year it seems to have lost momentum.&lt;br /&gt;&lt;br /&gt;The other interesting part of this equation is HP which used to be a big partner with Oracle for database software. When Oracle bought HP hardware rival Sun Microsystems in 2009, HP was left in a dangerous position as they did not have a database system to call their own. I was surprised that nobody &lt;a href="http://gigaom.com/2009/04/20/oracle-to-buy-sun-for-74-billion/"&gt;commented&lt;/a&gt; on this at the time. In the analytics area, HP tried to fill in with the NeoView database system, which proved to be such a &lt;a href="http://www.dbms2.com/2011/01/19/sound-bites-on-hpmicrosoft-and-neoview/"&gt;disaster&lt;/a&gt; that they recently cancelled it and bought Vertica instead. NeoView was based on the Tandem transaction processing database system. Firstly, it is difficult to get database system that is optimized for doing large numbers of small transactions to do large analytic queries well, and the Tandem system is highly optimized for transaction processing. Secondly, the Tandem database system only ran on the most expensive hardware that HP had to offer so it was very expensive to implement.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Aster Data Systems to Teradata&lt;/b&gt;&lt;br /&gt;Aster Data is a massive multi-processor database system, which in theory is a little more flexible about using a cluster of hosts than Greenplum. The company was founded in 2006 and sold out to Teradata for about $300M in March 2011. Teradata, founded in 1979 and acquired by NCR in 1991 was spun out of NCR in 2007 and since then has been sucessfully growing in the data warehouse space. It is not clear how Aster Data and Teradata will integrate their product lines. One thing is that Aster data gives Teradata a scalable offering in the cloud computing space. Teradata has been angling to get into this space for some time as we &lt;a href="http://bandb.blogspot.com/2010/07/data-management-in-cloud.html"&gt;heard&lt;/a&gt; last summer when Daniel Graham spoke the the BI SIG.&lt;br /&gt;&lt;br /&gt;Recently there have been a lot of database systems startups, and several of them are still independent. On the other side, there are not a lot of companies that might want to buy a database systems vendor. Furthermore, there is a strong movement to NoSQL databases which are easier to develop and where there are several strong contenders. The buyout prices are good, but apart from Netezza the prices are no blowout. The VCs behind these sales probably decided that they do not want to be left standing when the music stops and so sold out for a good but not great profit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8365220713288120869?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8365220713288120869/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8365220713288120869' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8365220713288120869'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8365220713288120869'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/03/database-system-startups-capitulate.html' title='Database System Startups Capitulate'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4449108323684600823</id><published>2011-02-26T21:17:00.000-08:00</published><updated>2011-02-26T21:17:12.595-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Apps'/><title type='text'>The App Store Margin</title><content type='html'>Recently there has been a &lt;a href="http://techcrunch.com/2011/02/21/readability-app-rejection/"&gt;lot&lt;/a&gt; of &lt;a href="http://blogs.forrester.com/james_mcquivey/11-02-17-apples_30_and_googles_10_fees_are_too_high"&gt;discussion&lt;/a&gt; about the Apple announcement that they are taking a 30% margin for selling subscriptions through their App store, and that Apple will also take a 30% margin for Apps that sell virtual products and subscriptions through the App. Unfortunately most of the discussion has been heat without light. That is there have been no facts to back up the arguments on either side. I had been curious about the margin in selling goods anyway, so as I had the data, I computed gross margin for publicly traded US companies in the various different retail categories.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-PmfLPZngVtQ/TWnclNu-y6I/AAAAAAAAACE/g5ZDrcDce4M/s1600/margins.JPG" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="163" src="https://lh4.googleusercontent.com/-PmfLPZngVtQ/TWnclNu-y6I/AAAAAAAAACE/g5ZDrcDce4M/s320/margins.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;As you can see the margin varies between 20% and 40%. The overall average is about 25%, dominated by the Grocery and Department &amp;amp; Discount categories. For retailers working in the real world, after paying for their goods, they have to pay for their properties, staff and marketing so their net margin is considerably less. On the other hand Apple is just processing payments and delivering virtual goods over the internet. On this basis, a 30% margin seems to be on the high side, although not completely out of line.&lt;br /&gt;&lt;br /&gt;Galen Gruman at Infoworld &lt;a href="http://www.infoworld.com/d/mobilize/apples-ipad-subscriptions-troubles-all-around-832"&gt;points out&lt;/a&gt; that a higher margin tends to favor small app and content providers because they would have high distribution costs anyway. On the other hand, a large content provider resents having to hand over 30% of their revenue to Apple for not doing a lot of work. For this reason, I expect that large content providers campaign for a bulk discount on the cost of distributing their content. Thus a good and hopefully likely outcome is a sliding scale. For example, a 30% margin on the first $20,000 per month, 20% on the next $20,000, 10% on the next $20,000 and so on (I have no insight on the business so these numbers are invented as an illustration rather than a suggestion as to what the numbers should be).&lt;br /&gt;&lt;br /&gt;Part of the resentment with Apple is that they have a captive market and their behavior in stating terms appears dictatorial. They would have been much better to follow the standard politically correct procedure. That is, to put out a discussion document and then after some to and fro, imposed their terms as they always intended. It has the same end result while creating good will through a patina of choice and consultation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4449108323684600823?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4449108323684600823/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4449108323684600823' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4449108323684600823'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4449108323684600823'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/02/app-store-margin.html' title='The App Store Margin'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh4.googleusercontent.com/-PmfLPZngVtQ/TWnclNu-y6I/AAAAAAAAACE/g5ZDrcDce4M/s72-c/margins.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-565119977467449004</id><published>2011-02-19T11:50:00.000-08:00</published><updated>2011-02-19T11:58:04.508-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><title type='text'>Agile BI at Pentaho</title><content type='html'>Ian Fyfe, Chief Technology Evangelist at &lt;a href="http://www.pentaho.com/"&gt;Pentaho&lt;/a&gt; showed us what the Pentaho Open Source Business Intelligence Suite is and where they are going with it when he spoke to the February meeting of the SDForum Business Intelligence SIG on "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventId=13836&amp;amp;pageId=620"&gt;Agile BI&lt;/a&gt;". Here are my notes on Pentaho from the meeting.&lt;br /&gt;&lt;br /&gt;Ian started off with positioning. Pentaho is an open source Business Intelligence Suite with a full set of data integration, reporting, analysis and data mining tools. Their perspective is that 80% of the work in a BI project is acquiring the data and getting it into a suitable form and then other 20% is reporting and analysis of the data. Thus the centerpiece of their suite is the Kettle data integration tool. They have a strong Mondrian OLAP analysis tool and Weka Data Mining tools. Their reporting tool is perhaps not quite as strong as other Open Source BI suites that started from a reporting tool. All the code is written in Java. It is fully embeddable in other applications and can be branded for that application.&lt;br /&gt;&lt;br /&gt;Ian showed us a simple example of loading data from a spreadsheet, building a data model from the data and then generating reports from the data. All of these things could be done from within the data integration tool, although they can also be done with stand alone tools. Pentaho is working in the direction of a fully integrated set of tools with common metadata between them all. Currently some of the tools are thick clients and some web based clients. They are moving to have all their client tools be web based.&lt;br /&gt;&lt;br /&gt;We had come to hear a presentation on agile BI and Ian gave us the Pentaho view. In an enterprise, the task of generating useful business intelligence is usually done by the IT department in consultation with the end users who want the product. The IT people are involved because they supposedly know the data sources and they own the expensive BI tools. Also, the tools are complicated and using them is usually too difficult for the end user. However, IT works to their own schedule, through their own processes and take their time to produce the product. Often, by the time IT has produced a report, the need for it has moved on.&lt;br /&gt;&lt;br /&gt;Pentaho provides a tightly integrated set of tools with a common metadata layer so there is no need to export the metadata from one tool and import it into the next one. The idea is that that the end to end task of generating business intelligence from source data can be done within a single tool or with a tightly integrated suite of tools. This simplifies and speeds up the process of building BI products to the point that it can be delivered while it is still useful. In some cases, the task is simplified to such an extent that it may be done by a power user rather than being thrown over the wall to IT.&lt;br /&gt;&lt;br /&gt;The audience was somewhat sceptical of the idea that a sprinkling of common metadata can make for agile BI. All the current BI suites, commercial and open source, have been pulled together from a set of disparate products and they all have rough edges in the way the components work together. I can see that deep and seamless integration between the tools in a suite will make the work of producing Business Intelligence faster and easier. Whether it will be fast enough to call agile we will have to learn from experience.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-565119977467449004?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/565119977467449004/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=565119977467449004' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/565119977467449004'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/565119977467449004'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/02/agile-bi-at-pentaho-ian-fyfe-chief.html' title='Agile BI at Pentaho'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1995308460567618797</id><published>2011-02-06T23:07:00.000-08:00</published><updated>2011-02-06T23:13:16.283-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><title type='text'>Revolution: The First 2000 Years of Computing</title><content type='html'>For years, the &lt;a href="http://www.computerhistory.org/"&gt;Computer History Museum&lt;/a&gt; (CHM) has a open storage area where they put their collection of old computers, but without any interpretation except for docent led tours. I had no problem wandering through this treasure trove because I knew a lot about what they had on show, from slide rules and abacuses to the Control Data 6600 and the Cray machines. Even then, a docent could help by pointing out features that I would miss, such as the ash tray on each workstation of the Sage early warning computer system.&lt;br /&gt;&lt;br /&gt;Now the CHM has opened their "Revolution: The First 2000 Years of Computing" &lt;a href="http://www.computerhistory.org/visit/"&gt;exhibition&lt;/a&gt;, and I recommend a visit. They still have all the interesting computer hardware as they had in the visible storage area, however it is placed in a larger space and there is all kind of interpretive help from explanation of the exhibits to video clips that you can browse. In my visit, I saw a lot of new things and learned much.&lt;br /&gt;&lt;br /&gt;For example, &lt;a href="http://en.wikipedia.org/wiki/Napier%27s_bones"&gt;Napier's Bones&lt;/a&gt; are an old time calculation aid that turns long multiplication into addition. The Napier's Bones exhibit explains how they work and allows you to do calculations using a set. The exhibit on computers and rocketry has the guidance computer for a large missile arrayed in a circle around the inside of the missile skin leaving an ominously empty space in the middle for the payload. In the semiconductor area they had examples of silicon wafers that ranged from the size of a small coin from the early days to a current wafer that is the size of a large dinner plate. There is also an interesting video discussion of the marketing of the early microprocessors like the 8086, the Z8000, the M68000 and the absolute importance of landing the design win for the IBM PC that led to the current era where Intel is biggest and most profitable chip maker. These are just a sample of the many fascinating exhibits there.&lt;br /&gt;&lt;br /&gt;I spent over 2 hours in the exhibition and only managed to get through half of it. I am a long time member of the museum and can go back any time, so this is a warning to non-members to allow enough time for their visit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1995308460567618797?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1995308460567618797/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1995308460567618797' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1995308460567618797'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1995308460567618797'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/02/revolution-first-2000-years-of.html' title='Revolution: The First 2000 Years of Computing'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6393843267312741808</id><published>2011-02-05T15:58:00.000-08:00</published><updated>2011-02-05T16:07:51.619-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Greenplum at Big Data Camp</title><content type='html'>I was at the Big Data Camp at the Strata Big Data conference the other day and one of the breakout sessions was with &lt;a href="http://www.greenplum.com/"&gt;Greenplum&lt;/a&gt;. They had several interesting things to say on performance in the cloud, map-reduce and performance. Greenplum is a parallel database system that runs on a distributed set of servers. To the user, Greenplum looks like a conventional database server except that it should be faster and able to handle large data because it farms out the data and the workload over all the hosts in the system. Greenplum also has a map-reduce engine in the server and distributed Hadoop file system. Thus the user can use Greenplum both as a big data relational database and as a big data NoSQL database.&lt;br /&gt;&lt;br /&gt;Map-reduce is good for taking semi-structured data and reducing it to more structured data. The &lt;a href="http://bandb.blogspot.com/2008/11/understanding-map-reduce.html"&gt;example&lt;/a&gt; of map-reduce that I gave some time ago does exactly that. Thus a good use of Map Reduce is to do the Transformation part of ETL (Extract-Transform-Load), which is the way data gets into a data warehouse. The Greenplum people confirmed that this is a common use pattern for map-reduce in their system.&lt;br /&gt;&lt;br /&gt;Next was a discussion of performance. Greenplum has compared performance and asserted that their relational database is 100 times faster than their map-reduce engine for doing the same query. I was somewhat surprised by the magnitude of this number, however I know that at the low end of system and data size, a relational database can be much faster than map-reduce and at the high end there places you can go with map-reduce that conventional database servers will not go, so it is never a level comparison. I will write more on this in another post.&lt;br /&gt;&lt;br /&gt;Finally we got to performance on Virtual Machines (VMs) and in the &lt;a href="http://en.wikipedia.org/wiki/Cloud_computing"&gt;cloud&lt;/a&gt;. Again Greenplum had measured their performance and offered the following. In a place where where conditions are well controlled like a private cloud, they expect to see a 30% performance reduction from running on VMs. In a public cloud like the &lt;a href="http://aws.amazon.com/ec2/"&gt;Amazon EC2&lt;/a&gt; server cloud, they see a 4 times performance reduction. The problem in a public cloud is inconsistent speed for data access and networks. They see both an overall speed reduction and inconsistent speeds when the same query is run over and over again.&lt;br /&gt;&lt;br /&gt;It is worth remembering that Greenplum and other distributed database systems are designed to run on a set of servers with the same performance. In practice this means that the whole database system tends to run at the speed of the slowest instance. On the other hand, map-reduce is designed to run on distributed systems with inconsistent performance. The workload is dynamically balanced as the map-reduce job progresses, so map-reduce will work relatively better in a public cloud than a distributed database server.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6393843267312741808?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6393843267312741808/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6393843267312741808' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6393843267312741808'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6393843267312741808'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/02/greenplum-at-big-data-camp.html' title='Greenplum at Big Data Camp'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4164935396869247174</id><published>2011-01-31T22:54:00.000-08:00</published><updated>2011-02-02T00:31:29.586-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><title type='text'>Security in the Cloud</title><content type='html'>Although I am not an expert, I have been asked more than once about security in the cloud. Now I can help because last week I got an education on best security practices in the cloud at the SDForum Cloud SIG &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13803"&gt;meeting&lt;/a&gt;. Dave Asprey VP of Cloud Security at &lt;a href="http://us.trendmicro.com/us/home/"&gt;Trend Micro&lt;/a&gt; gave us 16 best practices for ensuring that data is safe in a public cloud like the Amazon cloud services. I will not list all of them, but here is the gist.&lt;br /&gt;&lt;br /&gt;Foremost is to encrypt all data. The cloud is a very dynamic place with instances being created and destroyed all over the place, and your instances or data storage may be moved about to optimize performance. When this happens, the residual copy of your data can be left behind for the next occupier of that space to see. Although this would happen by accident, you do not want to expose confidential data for other to see. The only cure is to encrypt all data so that whatever may be left behind is not recognizable. Thus you should only use encrypted file systems, encrypt data in shared memory and encrypt all data on the network.&lt;br /&gt;&lt;br /&gt;Management of encryption keys is important. For example, you should only allow the decryption key to enter the cloud when it is needed, and make sure that it is wiped from memory after it has been used. Passwords are a thing of the past. Instead of a password, being able to provide the key to decrypt your data is sufficient to identify you to your cloud system. There should be no password based authentication and access to root privileges should not be mediated by a password, but should be enabled as needed by mechanisms like encryption keys.&lt;br /&gt;&lt;br /&gt;Passive measures are not the end of cloud security. There are system hardening tools and security testing services. Also use an active intrusion detection system, for example &lt;a href="http://www.ossec.net/"&gt;OSSEC&lt;/a&gt;. Finally, and most importantly, the best advice is to "Write Better Applications!"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4164935396869247174?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4164935396869247174/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4164935396869247174' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4164935396869247174'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4164935396869247174'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/01/security-in-cloud.html' title='Security in the Cloud'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5929655456384834278</id><published>2011-01-17T22:56:00.000-08:00</published><updated>2011-01-17T23:11:11.630-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='DRM'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>The Steve Jobs Media Playbook</title><content type='html'>Information wants to be free. Steve Jobs is not usually associated with setting information free, however he set &lt;a href="http://www.guardian.co.uk/technology/2007/feb/07/stevejobs.copyright"&gt;music free&lt;/a&gt; and may well be on the way to set more media free. Here is the playbook that he used to set music free, and an examination of whether he can set other media free.&lt;br /&gt;&lt;br /&gt;Back at the turn of the millennium digital music was starting to make waves and Apple &lt;a href="http://www.apple.com/pr/products/ipodhistory/"&gt;introduced&lt;/a&gt; their first iPod in 2001. At the beginning, it was not a great seller. Next year the second  generation iPod that worked with Microsoft Windows came out and sales started to take off. The next problem with promoting sales of the iPod was to let people buy music directly. In those days, to buy music you had to buy a CD, rip it onto a computer and then sync the music onto the iPod.&lt;br /&gt;&lt;br /&gt;The record companies did not like digital music. It was in the process of destroying their business model of selling physical goods, that is CDs, which had been plenty profitable until the internet and file sharing had come along. Thus the record companies knew that if they were going to allow anyone to sell digital music, the music content had to be protected by a strong Digital Rights Management (DRM) system. Basically DRM encrypts digital content so that it can only be accessed by a legitimate user on a accredited device.&lt;br /&gt;&lt;br /&gt;Now there is one important thing about any encryption, it depends upon a secret key to unlock the content. If too many people know a secret, it is no longer a secret. So it made perfect sense for Apple to have their own DRM system and be responsible for keeping their secret safe. The only problem was that Apple effectively controlled the music distribution channel because of the DRM system and its secret. By providing exactly what the music business had asked for, Apple managed to wrest control of the distribution channel from them.&lt;br /&gt;&lt;br /&gt;In the past I have &lt;a href="http://bandb.blogspot.com/2007/04/times-they-are-changing.html"&gt;joked&lt;/a&gt; about the music business controlling the industry by controlling the means of production. In fact they controlled the business by controlling the distribution channel between the artists and the record stores who sold the music. When the iTunes store became the prime music distribution channel it was game over for the recording industry. They had to climb down and offer their music without DRM to escape from its deadly embrace. DRM free music has not stopped iTunes but it does open up other sales channels.&lt;br /&gt;&lt;br /&gt;The remaining question is what will happen with other media? Apple will not dominate the tablet market as it has the music player market so it will not be able to exert the same influence. On the other hand, other media is not a collectible as music. We collect music because we want to listen to it over and over again. With most other media, we are happy to consume it once and then move on. Thus we do not feel the need to own the media in the same way. I have some more thoughts that will have to wait for another time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5929655456384834278?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5929655456384834278/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5929655456384834278' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5929655456384834278'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5929655456384834278'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2011/01/steve-jobs-media-playbook.html' title='The Steve Jobs Media Playbook'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4475227693574476080</id><published>2010-12-31T16:36:00.000-08:00</published><updated>2010-12-31T16:56:26.127-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><title type='text'>The Year in Posts</title><content type='html'>Looking back at posts in this blog over the last year I see a couple of themes emerge. Firstly there were many posts on technology and media, in particular several on the &lt;a href="http://bandb.blogspot.com/2010/09/understanding-ipad.html"&gt;iPad&lt;/a&gt; which has had an extraordinary effect as the first device specifically designed for consuming media. Other issues of concern included &lt;a href="http://bandb.blogspot.com/2010/04/and-future-of-subscription-television.html"&gt;television&lt;/a&gt;, &lt;a href="http://bandb.blogspot.com/2010/02/avatar-issues.html"&gt;3D&lt;/a&gt;, &lt;a href="http://bandb.blogspot.com/2010/09/tablet-aspect-ratios.html"&gt;aspect ratios&lt;/a&gt; and the problem of &lt;a href="http://bandb.blogspot.com/2010/11/registration-dilemma.html"&gt;registration&lt;/a&gt; at web sites. We are going through huge changes in the media world as digitialization and the internet delivery system changes everything. I have written many posts on this in the &lt;a href="http://bandb.blogspot.com/2004/09/comedy-of-commons.html"&gt;past&lt;/a&gt; and I will continue to do so.&lt;br /&gt;&lt;br /&gt;The SDForum Business Intelligence SIG that I chair had a banner year with so many memorable meetings, it is difficult to pick out the best one. A fantastic talk from Google Analytics Evangelist Avinash Kaushik on "&lt;a href="http://bandb.blogspot.com/2010/04/web-analytics-20.html"&gt;Web Analytics 2.0&lt;/a&gt;" drew by far the biggest crowd. We had two great big data talks: "&lt;a href="http://bandb.blogspot.com/2010/06/winning-with-big-data.html"&gt;Winning with Big Data&lt;/a&gt;" from Michael Driscoll of Dataspora and "&lt;a href="http://bandb.blogspot.com/2010/08/mad-skills-for-big-data.html"&gt;Mad Skills for Big Data&lt;/a&gt;" from Brian Dolan, both very impressive. Donovan Schneider from SalesForce.com spoke on "&lt;a href="http://bandb.blogspot.com/2010/06/reporting-from-production-database.html"&gt;Real Time Analytics&lt;/a&gt;" and Dan Graham from Teradata spoke on "&lt;a href="http://bandb.blogspot.com/2010/07/data-management-in-cloud.html"&gt;Data Management in the Cloud&lt;/a&gt;". Finally Peter Farago and Sean Byrnes of Flurry talked about the &lt;a href="http://bandb.blogspot.com/2010/10/accidental-data-empires.html"&gt;extraordinary information&lt;/a&gt; they collect about smartphone usage that they collect from their Mobile App analytics platform. Co-chair &lt;a href="http://ororke.com/paul/blog/"&gt;Paul O'Rorke&lt;/a&gt; who organized several of these meetings has stepped down and we will miss him greatly.&lt;br /&gt;&lt;br /&gt;Finally, Blogger started collecting statistics in May of this year. Looking at the page views on this blog, my last post on "&lt;a href="http://bandb.blogspot.com/2010/12/windows-file-type-fail.html"&gt;Windows File Type Fail&lt;/a&gt;" has generated a lot of interest in the few days since it was posted. The most viewed post is a 2009 post on "&lt;a href="http://bandb.blogspot.com/2009/04/ruby-versus-scala.html"&gt;Ruby versus Scala&lt;/a&gt;" followed closely by the Windows post. In my view, the post last year about the &lt;a href="http://bandb.blogspot.com/2009/12/die-autorun-die.html"&gt;Windows Autorun&lt;/a&gt; feature is a better rant than the current one. You can feel the veins bulging in that rant whereas this years rant is very laid back in comparison. Do not worry, there are many more misfeatures of Microsoft Windows to rant about so I am not going to run out of material for a long time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4475227693574476080?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4475227693574476080/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4475227693574476080' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4475227693574476080'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4475227693574476080'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/12/year-in-posts.html' title='The Year in Posts'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6029532242789332898</id><published>2010-12-28T22:43:00.000-08:00</published><updated>2010-12-28T23:29:44.505-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Windows'/><category scheme='http://www.blogger.com/atom/ns#' term='software engineering'/><title type='text'>Windows File Type Fail</title><content type='html'>It is that time of year when I rant &lt;a href="http://bandb.blogspot.com/2009/12/die-autorun-die.html"&gt;about&lt;/a&gt; an awful, awful, awful feature of the Microsoft Windows operating system. This year the subject of my diatribe is &lt;a href="http://en.wikipedia.org/wiki/Filename_extension"&gt;file types&lt;/a&gt;. You see, Windows thinks that every file has a type and the type connects the file to a program that can handle it. Like many "features" in Windows, file types are intended to make your life easier while in practice doing the opposite. Note that some time ago, I wrote about file systems and Content  Management as opposed to a file type manager. I still think there are  some good ideas in &lt;a href="http://http//bandb.blogspot.com/2005/11/consumer-content-management_20.html"&gt;there&lt;/a&gt; that need to be explored.&lt;br /&gt;&lt;br /&gt;If you do not know what a file type is, here is a primer. Every file has a name. The file type is a usually 3 letter extension to the name. So for example, the program for Windows Explorer, is called "explorer.exe", the dot is a separator and exe is the file type. The type exe means a program that Windows can run. To look at all the file types on an XP system, bring up the control panel, select Folder Options and then click the File Types tab. On Vista and 7, the path through the control panel is slightly different. The dialog shows a huge list of registered file types and the programs that will handle them. Note that the first few entries in the list are not representative, go down to the middle or bottom of the list so see what it is really all about.&lt;br /&gt;&lt;br /&gt;Windows goes to great length to hide file types from you. By default they are not shown anywhere and you can go for a long time without even knowing that files have types. One way to run into file types is to double click on a file with a type that Windows does not know about. Windows shows a dialog asking you what program you want to use with it. You can either look up the file type on the web or select a program from a list. The most annoying aspect is that when you select a program from a list, there is a little check box that says "Always use the selected program to open this type of file." If you test a program that does not work without unchecking the box the mistake is remembered and thereafter every time you open a file of that type, the wrong program is chosen. If you uncheck the box, a mistake is not remembered, however neither is a success. Either way, you can lose. Moreover, to recover from a mistake, you have to find the entry for the file extension in the File Types window discussed above and delete it, which is not a trivial task, given the number of file types.&lt;br /&gt;&lt;br /&gt;Another little problem with file types is that they can be wrong, confused or direct Windows  to do the wrong thing. I wrote about a &lt;a href="http://bandb.blogspot.com/2008/08/windows-woes.html"&gt;problem&lt;/a&gt; with .avi files from a Canon camera breaking Windows Explorer. There are security issues where Windows is penetrated because it trusts the file type information and then does the wrong thing with a broken file.&lt;br /&gt;&lt;br /&gt;However, the real problem with file types appears when you install a new program. Programs are greedy. They want to control as much of your experience as possible so they will try to register as many different file types as they can. If you have one program that deals with a type of file and you install another program that deals with similar files, the new program should pop up a dialog asking you which types of files it should handle. Then you have to make all sorts of complicated decisions about which file types the new program should handle.&lt;br /&gt;&lt;br /&gt;Programs for handling media are the worst in this respect because there are lots of different media types and it is common to have several media players installed to handle different special cases. For example, on my home computer I have Windows Media Player and a DVD player because they came with the system. Then there is iTunes for my iPod, the QuickTime video player that comes with iTunes, a RealPlayer for the BBC iPlayer and finally a program for ripping and burning CDs and DVDs. There may well be other media players amongst the shovelware preinstalled on the box. There are also programs for editing specific media types like at least two picture editors and a video editor or several.&lt;br /&gt;&lt;br /&gt;A typical scenario is that you are installing a new media player program because you want to use it to view a particular type of media. Unfortunately, the program installer knows about all the media types that it can handle and asks you to chose what media types types it should handle. Thus you have to disengage your thoughts from the one media type that is the object of your attention and instead start to think about all those other media types that you are not interested in. Unfortunately, there is the worry that if you give in to the new media player and let it handle certain types of media, other things will stop working. Maybe you will not be able to watch videos, or maybe videos will stop syncing with your portable media player because you changed the program associations with a particular file type. Given the complexity of these systems, who knows what may go wrong.&lt;br /&gt;&lt;br /&gt;I said that the media player installer should ask you which file types you want associated with the program. A few years ago, Real managed to destroy much of their franchise by not playing nice and fair with file types. The RealPlayer installer switched all file types that it could handle to use the RealPlayer without bothering to ask or notify. Worse, if you went in and installed another program that changed the file type associations or even tried used the File Types dialog screen to change file type associations, it would just change them back to the RealPlayer, again without a notification. When this came to light, many people, myself included, uninstalled RealPlayer and swore never to install any software from Real again. Recently I caved on this resolution so that I could listen to old BBC radio shows like "The Goon Show" with the BBC iPlayer which it turns out to be just a rebadged Real player.&lt;br /&gt;&lt;br /&gt;Since the RealPlayer imbroglio, installer programs have been a lot more careful about asking users about file types, but that just throws the problem back to the user. As the whole point of file types is to hide system complexity from the user, this it is no solution at all. A better path is to do without file types. Why are they necessary? Do they really serve a purpose? Other operating systems get along fine without file types, so why does Windows need them. Lets just throw them out and make life easier.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6029532242789332898?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6029532242789332898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6029532242789332898' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6029532242789332898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6029532242789332898'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/12/windows-file-type-fail.html' title='Windows File Type Fail'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4195154531032455296</id><published>2010-12-20T22:21:00.000-08:00</published><updated>2010-12-20T22:35:21.954-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='UI'/><title type='text'>Is That Annoying Modal Caps Lock Key Going Away?</title><content type='html'>So Google came out with their new Chrome Operating System, loaded it onto a laptop and gave the whole caboodle of people to play with and comment on. While Chrome OS has generated a lot of comments, the largest and most active discussion has been about the Caps Lock key. You see, Google has changed the behavior of the key that used to be Caps Lock to instead call up a search page. I am sure this change was made to pander to keyboard weenies who want to Google without having to lift their hands from the keyboard. Anyway, the change has backfired. Instead of talking about Chrome OS, everyone is engaged in a &lt;a href="http://hardware.slashdot.org/story/06/08/16/1225239/War-Declared-on-Caps-Lock-Key"&gt;furious&lt;/a&gt; &lt;a href="http://www.slate.com/id/2277926/"&gt;discussion&lt;/a&gt; of why the Caps Lock is either essential or should have been disposed of a long time ago.&lt;br /&gt;&lt;br /&gt;I have two problems with the Caps Lock, no make that three. The first problem is that it sits right between two important keys. Below is the Shift key whose importance needs no explanation. Above it is the Tab key, used for next field, command completion, automatic indent and plenty of other useful purposes. In the middle sits Caps Lock just waiting to be hit by accident. This brings to the next problem, Caps Lock is modal. Hit the Caps Lock key by accident and you do not make just one typing mistake, rather the whole keyboard is shifted into a new mode and the error compounds. By the time I look at the screen, I have typed half a sentence in the wrong case.&lt;br /&gt;&lt;br /&gt;I am a member of the tribe that hates modal user interfaces with a passion. Some of my compatriots physically remove the Caps Lock key or reprogram their keyboard to reduce typing errors. I have only gone as far as to disable that other annoying modal key. The Insert key is used by many editors to switch between insert mode and overtype mode. If you hit Caps Lock by accident, the result is obvious, if you hit Insert by accident you can go on for some time before you realize that you are seriously damaging the document that you are trying to fix up. Of course, the Insert key is slightly off the main keyboard, right above the really useful Delete key and just waiting to be hit by accident.&lt;br /&gt;&lt;br /&gt;My final problem with the Caps Lock key is that if you are in Caps Lock mode and you press shift, it reverts back to entering lower case. This means that when I hit cAPS lOCK by accident every key I type is in the wrong case, not just some of them. I happen to have an old typewriter from the 1930's so I know what shift really means. The Shift key causes the whole paper carriage and platen to move so that when the typebar comes down a different type piece strikes the ink ribbon and paper. Shifting the platen is why it is called the Shift key and it is a heavy key to hold, so there is a Shift Lock key that is a mechanical lock to hold the platen in the shifted position. With the platen locked in the shift position, hitting the shift key does nothing, so why has someone gone to the trouble of programming bogus behavior in out modern and supposedly more convenient keyboards?&lt;br /&gt;&lt;br /&gt;Now, I know that there are people who love the Caps Lock key and who use it all the time. For my part, given the choice between a key that causes a small typing mistake  every time I hit it by accident and a key that brings up a new web page by accident, I will choose the Caps Lock function every time. Caps Lock is annoying but I have lived with it for a long time and it is much smaller surprise than a new page that I do not want.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4195154531032455296?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4195154531032455296/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4195154531032455296' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4195154531032455296'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4195154531032455296'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/12/is-that-annoying-modal-caps-lock-key.html' title='Is That Annoying Modal Caps Lock Key Going Away?'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5921327010532360312</id><published>2010-12-18T13:12:00.000-08:00</published><updated>2010-12-18T13:51:06.054-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>The Gawker Password Fiasco</title><content type='html'>Last month I wrote about &lt;a href="http://bandb.blogspot.com/2010/11/write-down-your-password.html"&gt;password security&lt;/a&gt;, just a little too soon. This month the popular blog site owner Gawker admitted to a huge security breach where hackers had broken into their web servers and stolen their entire database of user account names with email addresses and passwords. The attack has brought password security to every ones attention, with people reporting that their email and other accounts have been compromised. There are a lot of discussions of protocols for password security with good information, and unfortunately there is also a lot of misinformation. Here is my take.&lt;br /&gt;&lt;br /&gt;The Forbes magazine web-site has a clear &lt;a href="http://blogs.forbes.com/firewall/2010/12/13/the-lessons-of-gawkers-security-mess/"&gt;description&lt;/a&gt; of the attack on Gawker, (although their discussion of the password encryption is not correct). The short story is that the break-in was done by a hacker group called Gnosis who were annoyed by Gawker. Frankly, given Gawker's arrogant style, who has not been annoyed by them at some time? Gnosis first broke in to Gawker in July and got the passwords to accounts for Nick Denton and 16 other staffers there. In November, Denton noticed some possible tampering in a web account, and finally in December Gnosis announced their break in and released data they had gathered.&lt;br /&gt;&lt;br /&gt;Although, Gawker had used encryption to hide the users passwords, they are susceptible to a brute force attack and many passwords have been broken. Gawker lost over 1 million accounts and more than 100,000 passwords have been cracked and published so far. The Wall Street Journal has a &lt;a href="http://blogs.wsj.com/digits/2010/12/13/the-top-50-gawker-media-passwords/"&gt;nice analysis&lt;/a&gt; of the most popular passwords including a frequency graph.&lt;br /&gt;&lt;br /&gt;There is a lot of misunderstanding about how passwords are stored on a web site and how a brute force attack takes place. For example, the Forbes article I mentioned earlier obviously does not have a clue. I do not know for certain how Gawker protects their passwords, however the best practice is to use a salted hash. With this technique, the web-site chooses a salt, which is just a random string of characters. When a user sets a password, the salt is appended to the password and the whole string is hashed with a &lt;a href="http://en.wikipedia.org/wiki/Cryptographic_hash_function"&gt;cryptographic hash&lt;/a&gt; function like SHA-1. The resulting hash value is a seemingly random string of bits, and this is stored as the encrypted users password. When the user wants to log in, the salt is added to the supplied password, the resulting string hashed, and the hash value compared to the saved hash. If they are the same, the user must have provided the correct password and is allowed to log in. By using a salted hash, the web-site does not save the users password, they just save a cryptographic hash that is used to confirm that the user knows their password. To make things more secure, the web-site can save a different salt for each user or just add the user name to a common salt so that even if two users have the same password, the salted hash of their passwords are not the same.&lt;br /&gt;&lt;br /&gt;In a brute force attack the attacker knows the algorithm used to generate the salted hash and has the salted hash of the password. The attacker generates a list of potential passwords, applies the password checking algorithm to each password and if the results are the same, they have guessed the users password. If the attacker can try 20 passwords a second, they can test well over a million passwords a day on a single computer.&lt;br /&gt;&lt;br /&gt;It is very easy to generate a list of potential passwords. One good starting point is a list of broken passwords, such as published by Gnosis from the attack on Gawker. The next step is a dictionary of common words and proper names. Many applications have a spelling dictionary that can be used as a starting point. Then try some simple variations like adding a number to the beginning or of words, capitalizing letters in the word and make common substitutions for letters such as 1 for the letter 'i' and 5 or $ for 's'.&lt;br /&gt;&lt;br /&gt;So now that you now how it is done, think about your passwords and how easy they can be attacked by brute force, and excuse me while I go and change some of mine.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5921327010532360312?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5921327010532360312/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5921327010532360312' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5921327010532360312'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5921327010532360312'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/12/gawker-password-fiasco.html' title='The Gawker Password Fiasco'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2182460490641514509</id><published>2010-12-11T16:09:00.000-08:00</published><updated>2010-12-11T16:28:26.545-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Visualization'/><title type='text'>Now You See It: the Book</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_rAvqFiewcf4/TQQWx1XbCeI/AAAAAAAAAB0/uRyDLIG1iOE/s1600/31Drlf3dfSL._SL160_.jpg"&gt;&lt;img style="float: left; margin: 0pt 10px 10px 0pt; cursor: pointer; width: 121px; height: 160px;" src="http://1.bp.blogspot.com/_rAvqFiewcf4/TQQWx1XbCeI/AAAAAAAAAB0/uRyDLIG1iOE/s320/31Drlf3dfSL._SL160_.jpg" alt="" id="BLOGGER_PHOTO_ID_5549585686222014946" border="0" /&gt;&lt;/a&gt;If you are of a data analytics bent or know someone who is and are looking for a book to put on the Christmas list, consider &lt;a href="http://www.amazon.com/gp/product/0970601980?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0970601980"&gt;Now You See It: Simple Visualization Techniques for Quantitative Analysis&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0970601980" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;  by Stephen Few. This is a beautiful book that would not look out of place on a coffee table, yet at the same time, is full of practical information about how to do analytics with charts, graphs and other visual tools.&lt;br /&gt;&lt;br /&gt;The book is divided into three sections. The first section covers visual perception and general visualization techniques for looking at data. Then the second section goes into more detail with chapters on specific techniques for different types of analysis including time-series analysis, ranking analysis, deviation analysis and multivariate analysis amongst others. Each chapter in this section ends with a summary of the techniques and best practices for that type of analysis. Finally the book ends with a shorter section that looks at promising new trends in visualization.&lt;br /&gt;&lt;br /&gt;There are copious examples of graphs and charts drawn by different software tools. While some of these graphs come from high end tools like &lt;a href="http://bandb.blogspot.com/2007/01/tableau-software.html"&gt;Tableau&lt;/a&gt; and &lt;a href="http://bandb.blogspot.com/2008/11/spotfire.html"&gt;Spotfire&lt;/a&gt;, others are drawn by Microsoft Excel. In fact there are several specific procedures for using features of Excel to do sophisticated analytics. That is not to say that the book suggests that you can do everything with a spreadsheet. The first part shows you what to look for in visual analytics software and it essential reading before going out and choosing which tool to use.&lt;br /&gt;&lt;br /&gt;So, if you are looking for a quality and practical gift for an analytician, choose "Now You See It".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2182460490641514509?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2182460490641514509/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2182460490641514509' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2182460490641514509'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2182460490641514509'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/12/now-you-see-it-book.html' title='Now You See It: the Book'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_rAvqFiewcf4/TQQWx1XbCeI/AAAAAAAAAB0/uRyDLIG1iOE/s72-c/31Drlf3dfSL._SL160_.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3546002779840215811</id><published>2010-11-30T23:13:00.000-08:00</published><updated>2010-11-30T23:49:57.889-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>The Registration Dilemma</title><content type='html'>To register or not to register, that is the question:&lt;br /&gt;Whether 'tis better to create a new online account,&lt;br /&gt;or just make do with with the existing ones,&lt;br /&gt;and so lead a slightly less ennobled life.&lt;br /&gt;&lt;br /&gt;Online account registration is a barrier, something that we are all thinking about as this is the season for buying stuff. As I said &lt;a href="http://bandb.blogspot.com/2010/11/write-down-your-password.html"&gt;previously&lt;/a&gt;, I have about 70 online accounts where I actively maintain a user identity, and I have created many many more. Thus every time I am presented with the choice of registering for a new site, I stop and think, do I really want to create another account? In the past couple of weeks I have decided to forgo on creating 3 new online accounts and just stick to my well traveled paths.&lt;br /&gt;&lt;br /&gt;Registration is not always thought of as a bad thing. For example, &lt;a href="http://500hats.typepad.com/"&gt;Dave McLure&lt;/a&gt;, Master of 500 Hats, micro Venture Capitalists and relentless promoter of analytics to improve web based businesses, has Activation as the second of his 5 step program to web enterprise success. Now Activation does not necessarily imply Registration, however Registration is the most common and strongest form of Activation. Dave's perspective is that to succeed on the net, your product needs to be strong enough to overcome any barriers to Activation.&lt;br /&gt;&lt;br /&gt;There have been many initiatives to vault over the registration hurdle. The most promising one is &lt;a href="http://openid.net/"&gt;OpenId&lt;/a&gt;, an open system that allows you to use your account at one web site to log onto other web sites. A couple of years ago I thought that this was a good solution to the Single Sign-on problem and worth promoting. Now OpenId seems to be moribund and it is not widely used. I am not sure what happened, but I did hear rumors of a argument and a split which diminished the organization.&lt;br /&gt;&lt;br /&gt;One of the problems with OpenId and any other system is that it tends to favor and strengthen the big players like Yahoo and Google. Another idea the people often trot out is some form of micro-payments system that would obviate the need for registration at many sites. There are a couple of problems. Firstly, any payment is its own barrier, and creating many little barriers instead of one is not a path that is likely to lead to success. For a broader discussion of this issue I recommend the book &lt;a href="http://www.amazon.com/gp/product/B00342VEP6?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B00342VEP6"&gt;Free&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B00342VEP6" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; by Chris Anderson.&lt;br /&gt;&lt;br /&gt;The second problem is that a successful micro-payment system will favors and strengthen the big players that operate it. It has to be a big player as no one is going to trust their payments to some small and unknown start-up. In practice, the only really successful micro-payment site is iTunes, and it shows up all these &lt;a href="http://bandb.blogspot.com/2007/04/times-they-are-changing.html"&gt;problems&lt;/a&gt;. In the beginning we all cheered as Steve Jobs took on the record companies. Now that iTunes is the leading purveyor of music, many people have taken to railing against the power of Apple.&lt;br /&gt;&lt;br /&gt;The Registration Dilemma is this. We can either continue with the current system that has a chaos of millions of sites, each with their own registration that we need to manage, or we can give in to consolidation and just deal with a few giants. Every time I think about it, I end up siding with chaos.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3546002779840215811?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3546002779840215811/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3546002779840215811' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3546002779840215811'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3546002779840215811'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/11/registration-dilemma.html' title='The Registration Dilemma'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6326568260624217522</id><published>2010-11-16T21:33:00.000-08:00</published><updated>2010-11-16T22:08:19.582-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><title type='text'>Yeah, Yeah, Yeah</title><content type='html'>This morning I woke up to the local newspaper headline "&lt;span style="font-weight: bold;"&gt;Do you want to know a Secret?&lt;/span&gt;", and knew that something was going on. Later they changed their &lt;a href="http://www.mercurynews.com/business/ci_16626983?nclick_check=1"&gt;tune&lt;/a&gt; to something more like the The Wall Street Journal which starts their &lt;a href="http://online.wsj.com/article/SB10001424052748703326204575617004052395816.html?mod=e2tw"&gt;piece&lt;/a&gt; "Steve Jobs is nearing the end of his &lt;span style="font-weight: bold;"&gt;long and winding&lt;/span&gt; pursuit of the Beatles catalog." Other newspapers had headlines like "&lt;a href="http://www.sunherald.com/2010/11/16/2645601/all-you-need-is-itunes.html"&gt;&lt;span style="font-weight: bold;"&gt;All you need is&lt;/span&gt;&lt;/a&gt; iTunes", "&lt;a href="http://www.irishtimes.com/newspaper/features/2010/1117/1224283516673.html"&gt;&lt;span style="font-weight: bold;"&gt;Let it be&lt;/span&gt;&lt;/a&gt; Available" and "Apple and The Beatles finally &lt;a href="http://www.usatoday.com/tech/news/2010-11-16-beatles-itunes_N.htm"&gt;&lt;span style="font-weight: bold;"&gt;come together&lt;/span&gt;&lt;/a&gt; on iTunes". All in all, it seems like bunch stupid headline tricks from the old media, a sure sign that they are getting past it.&lt;br /&gt;&lt;br /&gt;Meanwhile the new media is a lot more standoffish. Wired News is like "&lt;a href="http://www.wired.com/threatlevel/2010/11/beatles-itunes-yawn/"&gt;Yawn&lt;/a&gt;". TechCrunch is all &lt;a href="http://techcrunch.com/2010/11/16/17-beatles-albums-top-100/"&gt;business&lt;/a&gt; with "All 17 Beatles Albums Are In The Top 100 On iTunes". Of course Fake Steve Jobs had a &lt;a href="http://www.fakesteve.net/2010/11/you-will-never-forget-this-day-even-if-you-live-a-thousand-years.html"&gt;field day&lt;/a&gt;, providing by far the best commentary on the whole event.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6326568260624217522?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6326568260624217522/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6326568260624217522' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6326568260624217522'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6326568260624217522'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/11/yeah-yeah-yeah.html' title='Yeah, Yeah, Yeah'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5182865927833193139</id><published>2010-11-15T23:37:00.000-08:00</published><updated>2010-11-15T23:49:19.025-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><title type='text'>Open Source Coopetition</title><content type='html'>&lt;a href="http://en.wikipedia.org/wiki/Coopetition"&gt;Coopetition&lt;/a&gt; is the driving force behind many of the best Open Source projects. In the past, I have written about several &lt;a href="http://bandb.blogspot.com/2009/02/open-source-business-models.html"&gt;different&lt;/a&gt; &lt;a href="http://bandb.blogspot.com/2004/10/open-source-dilemma.html"&gt;reasons&lt;/a&gt; that Open Source projects exist. There are business models like the low cost sales channel. Open Source can act as a home for old software that is still useful, but not commercially essential to a business. There have been attempts to use Open Source as a weapon, to suck the air out of a competitors lungs, by devaluing the intellectual property of the competitor, although many of these attempts have been less successful than their originator hoped.&lt;br /&gt;&lt;br /&gt;A &lt;a href="http://www.slideshare.net/hadoop/practical-problem-solving-with-apache-hadoop-pig"&gt;presentation&lt;/a&gt; on Hadoop got me thinking about Coopetition and Open Source. &lt;a href="http://hadoop.apache.org/"&gt;Hadoop&lt;/a&gt; is a big Open Source project to implement all the components of what I have called the &lt;a href="http://bandb.blogspot.com/2008/10/google-database-system.html"&gt;Google Database System&lt;/a&gt; and a lot more. The major contributors to Hadoop are Yahoo!, Facebook and Powerset - now a part of Microsoft. While these companies are related in that Microsoft owns a stake in Facebook, has tried to buy Yahoo! and now Yahoo! uses Microsoft's Bing search engine, they are also competitors, fighting each other for the attention of web users.&lt;br /&gt;&lt;br /&gt;So is it strange that these three companies should cooperate to build Hadoop, an incredibly useful and widely used Open source project? Firstly, the genius of Open Source is that they are not cooperating directly with each other they are all contributing code to a third party, the non-profit Apache foundation that oversees the Hadoop project. Secondly, by spreading the cost of the software over many contributors, they all gain much more than they contribute. Finally, many eyes and the public nature of the code tends to make it better than code that is bottled up in secret and protected from prying eyes. Because the Open Source model allows for the kind of coopetition that brings us software like Hadoop, we all benefit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5182865927833193139?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5182865927833193139/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5182865927833193139' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5182865927833193139'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5182865927833193139'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/11/open-source-coopetition.html' title='Open Source Coopetition'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6295898878367523461</id><published>2010-11-11T23:25:00.000-08:00</published><updated>2010-11-11T23:42:19.787-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Write Down Your Password</title><content type='html'>If Bruce Schneier &lt;a href="http://www.schneier.com/blog/archives/2010/11/changing_passwo.html"&gt;says&lt;/a&gt; that you should write down your password, then write down your password. What he means is that given the choice between having a weak password that is so easy to remember that you do not need to write down and a strong password that you do need to write down to remember, it is better to go for the strong password. However, the problem of online identity management is much more complicated. Note that even the terminology is broken. We need to distinguish "online reputation management" which is about managing your personal brand online, with "online identity management" which is about managing how you authorize yourself with websites. Often, the term online identity management is &lt;a href="http://en.wikipedia.org/wiki/Online_identity_management"&gt;used&lt;/a&gt; for online reputation management.&lt;br /&gt;&lt;br /&gt;The problems of online identity management starts long before you need to provide a password. First you have to provide a user name. Each site has its own rules about what your user name should be. About half of web sites use an email address as a user identifier, while the other half insist that you play the game of user name roulette where you have to keep guessing a user name until you find one that has not been used. I have enough different user names that I have to write down my user name for each site, before even thinking about writing down a password.&lt;br /&gt;&lt;br /&gt;Next problem is the large number of sites where you have an account. I have about 70 sites where I actively maintain a user identity, and there are many more sites where I have registered an identity and then abandoned. Of those 70 site, about 15 are sites like banking sites that are important to protect with a strong password. &lt;br /&gt;&lt;br /&gt;One site that is particularly important to protect is your email account. Use a strong password with your email account and do not use that password on any other account. If your email account is compromised, you are in a lot of trouble. For example, many sites allow you to reset your password by mailing you a new one. Remember, an attacker who gains access to your email account is able to read your email including emails from other sites where you are registered. Many sites store your email address and password, so if they are compromised, and you use the same password for all accounts, the attacker has got your email address and the password to the account.&lt;br /&gt;&lt;br /&gt;Another serious &lt;a href="http://bandb.blogspot.com/2008/05/software-tools-used-by-criminals.html"&gt;problem&lt;/a&gt; is any account that gives you access after answering security questions. The security questions are effectively another password and they encourage answers that are easy to guess. You are better giving nonsense answers to security questions, except for the fact that you now need to write down the answers to those questions as well. All in all, online identity management is a pain.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6295898878367523461?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6295898878367523461/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6295898878367523461' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6295898878367523461'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6295898878367523461'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/11/write-down-your-password.html' title='Write Down Your Password'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3389487473182929826</id><published>2010-10-31T21:25:00.000-07:00</published><updated>2010-10-31T21:47:38.708-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><title type='text'>The New OLAP</title><content type='html'>Just as there are new approaches to database management with the NoSQL movement, so is there a move to a new OLAP, although this movement is just emerging and has not taken a name yet. This month at the SDForum Business Intelligence SIG meeting, Flurry &lt;a href="http://bandb.blogspot.com/2010/10/accidental-data-empires.html"&gt;talked&lt;/a&gt; about how they put their data on mobile app usage in a giant data-cube. More recently, Chris Riccomini of LinkedIn &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13769"&gt;spoke&lt;/a&gt; to the SDForum SAM SIG about the scalable data cubing system that they have developed. Here is what I learned about Avatara, the LinkedIn OLAP server. DJ Cline has also written a &lt;a href="http://www.djcline.com/2010/10/30/oct-27-2010-sdf-linkedin-avatara/"&gt;report&lt;/a&gt; of the event.&lt;br /&gt;&lt;br /&gt;If you do not know what OLAP is, I had hoped to just point to an online explanation, but could not find any that made sense. The Wikipedia &lt;a href="http://en.wikipedia.org/wiki/OLAP_cube"&gt;entries&lt;/a&gt; are pretty deplorable, so here is a short description. Conceptually, &lt;a href="http://www.olapcouncil.org/research/glossaryly.htm"&gt;OLAP&lt;/a&gt; stores data in a multi-dimensional data cube, and this allows users to look at the data from different perspectives in real time. For example, take a simple cube of sales data has three dimensions, a date dimension, a sales person dimension, and a product dimension. In reality, OLAP cubes have more than dimensions than this. Each dimension contains a hierarchy, so the sales person dimension groups sales person by state then sales region, then country. At the base level the cube contains a data point called a measure for each sale of each product made by each sales person and the date when the sales was made. OLAP allows the user to look at the data in aggregate, and then drill down on the dimensions. In the example cube, a user could start by looking at the sales of all products grouped by quarter. Then they could drill down to look at the sales in the most recent quarter divided by sales region. Next they could drill down again to look at sales in the most recent quarter by sales person, comparing say the the Northern region to the Western region, and so on.&lt;br /&gt;&lt;br /&gt;The new OLAP is enabled by the same forces that are changing databases with &lt;a href="http://nosql-database.org/"&gt;NoSQL&lt;/a&gt;. Firstly, the rise of commodity hardware that runs Linux, the commodity operating system, allows the creation of cheap server farms that encourages parallel distributed processing. Secondly, the inevitable march of &lt;a href="http://bandb.blogspot.com/2005/10/moores-law.html"&gt;Moore's law&lt;/a&gt; is increasing the size of main memory so that now you can spec a commodity server with more main memory that a commodity server had in disk space 10 years ago. An OLAP data cube can be partitioned along one or more of its dimensions to be distributed over a server farm, although at this stage partitioning is more of a research topic than standard practice. Huge main memory allows large cubes to reside in main memory, giving near instantaneous response to queries. For another perspective on in memory OLAP, &lt;a href="http://www.bi-verdict.com/fileadmin/FreeAnalyses/Comment_InMemBI.htm"&gt;read&lt;/a&gt; the free commentary by Nigel Pendse at the BI-Verdict (it used to be called the OLAP Report) on "What in-memory BI ‘revolution’?"&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.linkedin.com/"&gt;LinkedIn&lt;/a&gt; is a fast growing business oriented social networking site. They have developed Avatara to support their business needs and currently run several cubes on it. The plan is to to open source the code later this year. Avatara is an in memory OLAP server that uses partitioning to provide scalability beyond the capabilities of a single server.&lt;br /&gt;&lt;br /&gt;The presentation was fast paced and it has taken me some time to appreciate the full implications of what was said. Here are some thoughts. Avatara offers an API that is reminiscent of &lt;a href="http://www.jcp.org/en/jsr/detail?id=69"&gt;Jolap&lt;/a&gt; rather than the &lt;a href="http://msdn.microsoft.com/en-us/library/ms145506.aspx"&gt;MDX&lt;/a&gt; language that is the standard way of programming OLAP, probably because an API is easier to implement than a programming language. Avatara does not support hierarchies in its dimensions, but the number of dimension in a typical cube seems to be higher than usual. It may be that they use more dimensions rather than hierarchies within a dimension to represent the same information. This is a trade off of roll-up within the cube for slicing of dimensions. Slicing is probably more efficient and easier to implement while a hierarchy is easier for the user to understand as it allows for drill up and down.&lt;br /&gt;&lt;br /&gt;Chris mentioned that most dimensions are small and that can be true, however the real problems with OLAP implementations start when you have more than one large dimension and you have to deal with the issue of sparsity in the data cube. Chris spent some time on the problem of a dimension with more than 4 billion elements and this seems to be a real requirement at LinkedIn. Current OLAP servers seem to be &lt;a href="http://en.wikipedia.org/wiki/Comparison_of_OLAP_Servers"&gt;limited&lt;/a&gt; to 2 billion elements in a dimension, so they are going to be even more constraining than Avatara.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3389487473182929826?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3389487473182929826/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3389487473182929826' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3389487473182929826'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3389487473182929826'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/10/new-olap.html' title='The New OLAP'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1698700752385926712</id><published>2010-10-24T23:32:00.000-07:00</published><updated>2010-10-24T23:48:53.645-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Apps'/><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><title type='text'>Accidental Data Empires</title><content type='html'>In the new world of big data and analytics a winning business model is to find a novel way to collect interesting big data. Once you have the data, the ways to exploit it are endless. It is a phenomenon that I have seen several times, the latest example is &lt;a href="http://www.flurry.com/"&gt;Flurry&lt;/a&gt;, a company that collects and aggregates data from mobile applications. Peter Farago, VP Marketing, and Sean Byrnes, CTO abd Co-founder of Flurry spoke to the October &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13765"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG on "Your Company’s Mobile App Blind Spot".&lt;br /&gt;&lt;br /&gt;The Flurry proposition is simple, they offer a toolkit that an app developer combines with their mobile app. The app developer goes to the Flurry website, creates a free account and downloads the toolkit. Whenever an instance of the app with the Flurry code is activated or used, it collects information about the usage that is sent back to the Flurry. The amount of information is small, usually about 1.2 kB compressed, so the burden of collection is small. At Flurry, the data is collected, cleansed and put in a gigantic data cube. At any time, an app developer can log into the Flurry website and get reports on how their application is being used. You can get a feel for their service by taking the short Analytics developer &lt;a href="http://www.flurry.com/product/analytics/tour.html"&gt;tour&lt;/a&gt;. Flurry have committed that their Analytics service will always be free.&lt;br /&gt;&lt;br /&gt;While there are some issues with data collection that Flurry deals with, the quality of the data is great. Every mobile phone has a unique identifier so there is no problem with identifying individual usage patterns. As the service is free, there is very little friction to its use. Flurry estimates that they are in one in five mobile apps that are out there. In fact, for an app developer, the only reason for not using Flurry is that they have chosen to use a rival data collection service.&lt;br /&gt;&lt;br /&gt;In the end however, the big winner is Flurry, who collect huge amounts of information about mobile app and phone usage. In the meeting Peter Farago gave us many different analyses of where the mobile smartphone market is and where it is going, including adoption rates for iPhones versus Android based phones and how the follow on market for apps on each platform is developing. You can get a mouthwatering feel for the information they presented by looking at their blog in which they publish a series of analyses from their data. As I write their latest &lt;a href="http://blog.flurry.com/"&gt;post&lt;/a&gt; shows a graph on the "Revenue Shift from Advertising to Virtual Goods Sales" which shows that apps are growing their revenue from sales of virtual goods, while advertising revenue seems to be stagnant.&lt;br /&gt;&lt;br /&gt;With data aggregators, there is always something creepy when you discover just how much data they have on you. Earlier this year there was an incident where a Flurry blog &lt;a href="http://blog.flurry.com/bid/30019/Apple-Tablet-The-Second-Stage-Media-Booster-Rocket"&gt;post&lt;/a&gt; described some details of the iPad a few days before it was announced that they had gleaned from apps running on these new devices in the Apple offices. Steve Jobs was so provoked by this that he called out Flurry by name and changed the iPhone app developer terms of service to prevent apps from collecting certain sorts of data. You can read more about this incident in the blog &lt;a href="http://ororke.com/paul/blog/?p=743"&gt;report&lt;/a&gt; on the meeting by my colleague Paul O'Rorke.&lt;br /&gt;&lt;br /&gt;The title of this piece is a reference to the entertaining and still readable book &lt;a href="http://www.amazon.com/gp/product/0887308554?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0887308554"&gt;Accidental Empires&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0887308554" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; by Robert X. Cringely about the birth of the personal computer industry and the rivalry between Steve Jobs and Bill Gates.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1698700752385926712?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1698700752385926712/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1698700752385926712' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1698700752385926712'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1698700752385926712'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/10/accidental-data-empires.html' title='Accidental Data Empires'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3611511091745125181</id><published>2010-10-13T23:45:00.000-07:00</published><updated>2010-10-14T00:02:47.691-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='programming languages'/><title type='text'>A Critique of SQL</title><content type='html'>SQL is not a perfect solution as I told the audience at the SDForum Business Intelligence SIG September &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13757"&gt;meeting&lt;/a&gt;, where I spoke about "Analytics: SQL or NoSQL". The presentation discusses the difference between SQL and structured data on the one hand versus the NoSQL movement and semi-structured data on the other hand. There is more to the presentation than I can fit in one blog post, so here is what I had to say about the SQL language itself. I will write more about the presentation at another time. You can &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Page.ViewPage&amp;amp;PageID=620"&gt;download&lt;/a&gt; the presentation from the BI SIG web site.&lt;br /&gt;&lt;br /&gt;Firstly the good. SQL has given us a model of a query language that seems so useful as to be essential. Every system that provides persistence has developed a query language. Here are a smattering of examples. The Hibernate object persistence system has Hibernate Query Language (HQL) which has been developed into the Java Persistence Query language (JPQL). Other Java based object oriented persistence systems either use JPQL or their own variant. Hive is a query interface built on top of the Hadoop Map-Reduce engine. Hive was initially developed by Facebook as a simplified way of accessing their Map-Reduce infrastructure when they discovered that many of the people who need to write queries did not have the programming skills to handle a raw Map-Reduce environment. XQuery is a language for querying a set of XML documents. It has been adopted into the SQL language and is also used with stand alone XML systems. If data is important enough to persist, there is almost always a requirement to provide a simple and easy to use reporting system on that data. A query language handles the simple reporting requirements easily.&lt;br /&gt;&lt;br /&gt;On the other hand, SQL has many problems. Here is my thoughts on the most important ones. The first problem is that SQL is not a &lt;a href="http://en.wikipedia.org/wiki/Turing_completeness"&gt;programming language&lt;/a&gt;, it is a data access language. SQL is not designed for writing complete programs, it is intended to fetch data from the database and then anything more than a simply formatted report is done in another programming language. This concept of a data access language for accessing a database goes back to the original concept of a database as promulgated by the &lt;a href="http://en.wikipedia.org/wiki/CODASYL"&gt;CODASYL&lt;/a&gt; committee in the late 1960's.&lt;br /&gt;&lt;br /&gt;While most implementations of SQL add extra features to make it a complete programming language, they do not solve the problem because SQL is a language unlike any of the other other programming language we have. Firstly, SQL is a relational language. Every statement in SQL starts with a table and results in a table. (Table means a table like in a document, a fixed number of columns and as many rows as are required to express the data.) This is a larger chunk of data than programmers are used to handling. The procedural languages that interface to SQL expects to deal with data at most a row at a time. Also, the rigid table of SQL does not fit well into the more flexible data structures of procedural languages.&lt;br /&gt;&lt;br /&gt;Moreover SQL is a declarative language where you specify the desired results and the database system works out the best way to produce them. Our other programming languages are procedural where you describe to the system how it should produce the desired result. Programming SQL requires a different mindset from programming in procedural languages. Many programmers, most of whom just dabble in SQL as a sideline, have difficulty making the leap and are frustrated by SQL because it is just not like the programming languages that they are used to. The combination of a relational language and a declarative language creates a costly mismatch between SQL and our other programming systems.&lt;br /&gt;&lt;br /&gt;Finally, SQL becomes excessively wordy, repetitive and opaque as the queries becomes more complicated. Sub-Queries start to abound and the need for correlated sub-queries, outer joins and pivoting data for presentation cause queries to explode in length and complexity. Analytics is the province of complicated queries so this is a particular problem for data analysts. In the past I have &lt;a href="http://bandb.blogspot.com/2009/02/do-we-need-new-programming-language.html"&gt;suggested&lt;/a&gt; that persistence is a ripe area for a new programming language, however although there are many new languages being proposed none of them are concerned with persistence or analytics. The nearest thing to an analytics programming language is &lt;a href="http://bandb.blogspot.com/2008/07/gentle-introduction-to-r.html"&gt;R&lt;/a&gt; which is powerful but neither new nor easy to use.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3611511091745125181?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3611511091745125181/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3611511091745125181' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3611511091745125181'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3611511091745125181'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/10/critique-of-sql.html' title='A Critique of SQL'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-512741971283758868</id><published>2010-10-06T23:35:00.000-07:00</published><updated>2010-10-06T23:51:48.940-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><title type='text'>Vertical Pixels are Disappearing</title><content type='html'>The quality of monitors for PCs is going backwards. A few years ago, noticing the rise of the widescreen monitor and fearful that all reasonably proportioned monitors would soon disappear, I bought a Samsung Syncmaster 204B (20.1" screen, 1600x1200 pixels). Last night it popped and stopped working. When I went online to research a replacement monitor, the full gravity of the situation became obvious.&lt;br /&gt;&lt;br /&gt;Not only is it virtually impossible to find a monitor that is not widescreen, almost all monitors that you can buy, whatever the size of their screen, are 1920x1080 pixels. In the years since I bought the 204B, the number of pixels that we get in the vertical direction has shrunk from 1200 to 1080!  Funnily enough, there is a &lt;a href="http://hardware.slashdot.org/story/10/10/06/1522206/Why-Are-We-Losing-Vertical-Pixels"&gt;post&lt;/a&gt; on Slashdot this morning titled "Why are we losing our vertical pixels" about this topic. The post has drawn many more that the usual number of comments.&lt;br /&gt;&lt;br /&gt;For me, the vertical height of the screen is important. I use my computer for reading, writing, programming, editing media and some juggling with numbers. For each activity, having a good height to the screen helps, and width after a certain point does not add much.  A Television uses a 1920x1080 pixels for a full 1080p display. The monitor manufacturers are just giving us monitors made from cheap LCD panels designed for televisions. When I watch TV, I have a much larger screen in another room with more comfortable chairs and more room between me and the screen. Thus, I do not need or want a computer monitor that is expressly designed for watching TV.&lt;br /&gt;&lt;br /&gt;The real problem is that 1920x1080 monitors are so ubiquitous that it is difficult to find anything else. After a lot of searching I only found a couple of alternatives. Apple has a 27" widescreen monitor that is 2560x1440 pixels at a cost of ~$1000, and only works well with some recent Apple systems. Dell has a 20" monitor in their small business section that is 1600x1200 and costs ~$400. However, Dell seems to vary the type of LCD panel that they use between production runs and one type of panel is a lot better than the other. Unfortunately, you do not know which type of panel you are going to get until it arrives at your door. Neither alternative gets me really excited. One thing is certain, technology is supposed to be about progress, and I am not going backwards and accepting less pixels in any dimension for my next monitor.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-512741971283758868?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/512741971283758868/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=512741971283758868' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/512741971283758868'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/512741971283758868'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/10/vertical-pixels-are-disappearing.html' title='Vertical Pixels are Disappearing'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3791979556624118177</id><published>2010-09-30T21:56:00.000-07:00</published><updated>2010-09-30T22:49:16.224-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>Tablet Aspect Ratios</title><content type='html'>One important issue with tablet computers that is getting little attention is the screen &lt;a href="http://en.wikipedia.org/wiki/Aspect_ratio_%28image%29"&gt;aspect ratio&lt;/a&gt;. Some time ago I wrote about "&lt;a href="http://bandb.blogspot.com/2005/08/aspect-ratio-hell.html"&gt;aspect ratio hell&lt;/a&gt;" while trying to decide how to crop holiday photographs. The answer seems to be that you have to crop each photograph independently for each way the photograph is going to be output or displayed. For photographs, the variety of different aspect ratios is a perplexing problem that has no good answer.&lt;br /&gt;&lt;br /&gt;Tablet computers have the same problem except that the responsibility lies with app developers who need to make their app work well with the aspect ratios of their target platforms. Aspect ratios for a tablet needs to take into consideration that it will be used in both portrait and landscape mode. The iPad has an aspect ratio of 4:3 (AR 1.33...), which is the same as the iPod Classic while the iPhone and iPod touch have an aspect ratio of 3:2 (AR 1.5). Anyone trying to develop apps for Apple products needs to take this difference into account. On the other hand, both Blackberry and Samsung has announced their Android based &lt;a href="http://www.bbleaks.com/2010/09/in-depth-comparison-between-blackberry.html"&gt;tablets&lt;/a&gt; with a 7 inch screen which has an aspect ratio of 128:75 (AR 1.706...), which is close to 16:9 (AR 1.77...).&lt;br /&gt;&lt;br /&gt;When we look to media, television uses 16:9 and most cinema has a higher ratio  like 2.40:1 except for iMax (AR 1.44) which is much squarer. Books and newspaper use a 3:2 ratio (AR 1.5) while magazines tend to be broader with a lower aspect ratio. Frankly, anything with an aspect ratio of more than 3:2 tends to look unnaturally skinny when viewed in portrait mode. A cell phone can get away with a higher aspect ratio because it has to be pocketable, but larger devices meant for viewing both media in both landscape and portrait mode needs to keep its aspect ratio to 3:2 or less. For example, the &lt;a href="http://www.amazon.com/gp/redirect.html?ie=UTF8&amp;amp;location=http%3A%2F%2Fwww.amazon.com%2F%3Fie%3DUTF8%26redirect%3Dtrue&amp;amp;tag=buiandbre-20&amp;amp;linkCode=ur2&amp;amp;camp=1789&amp;amp;creative=390957"&gt;Kindle&lt;/a&gt;&lt;img src="https://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=ur2&amp;amp;o=1" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;, which is mostly used in portrait mode has an aspect ratio of 4:3 (AR 1.33...). From this point of view, the Samsung and Blackberry tablets seem to be designed to be used in landscape mode and not in portrait mode. I hope that other tablet makers do not make the same mistake.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3791979556624118177?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3791979556624118177/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3791979556624118177' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3791979556624118177'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3791979556624118177'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/09/tablet-aspect-ratios.html' title='Tablet Aspect Ratios'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3669257675366392857</id><published>2010-09-04T14:10:00.000-07:00</published><updated>2010-09-04T15:44:47.481-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>Understanding the iPad</title><content type='html'>Some people still struggle to understand the iPad. When it was first announced, there were shrieks of &lt;a href="http://boingboing.net/2010/04/02/why-i-wont-buy-an-ipad-and-think-you-shouldnt-either.html"&gt;outrage&lt;/a&gt; from techies, complaining that it was not a free and open computer system and so nobody should buy one. Then it came out and was adopted by the millions. Steve Ballmer, CEO of Microsoft, expressed &lt;a href="http://tech.fortune.cnn.com/2010/07/30/steve-ballmer-on-the-ipad-the-transcript/"&gt;dismay&lt;/a&gt; that the iPad is easily outselling any tablet computer that Microsoft and ever had a hand in. More recently an executive from LG &lt;a href="http://online.wsj.com/article/SB10001424052748704476104575439394082294812.html"&gt;told&lt;/a&gt; the Wall Street Journal that they would bring out a Tablet that would be better than the iPad because it would be oriented towards content creation rather than content consumption.&lt;br /&gt;&lt;br /&gt;Then there are many people who get it. For example, Jerry Kaplan, founder of Go Computing, an early slate computer in an &lt;a href="http://www.mercurynews.com/chris-obrien/ci_15580520?nclick_check=1"&gt;interview&lt;/a&gt; with Chris O'Brian of the San Jose Mercury News understood that the iPad is oriented for media consumption as opposed to the more general purpose Go slate computer. My belief is that the iPad is a new category of device that addresses a new market.&lt;br /&gt;&lt;br /&gt;Last year I wrote about &lt;a href="http://bandb.blogspot.com/2009/08/media-convergence.html"&gt;Media Convergence&lt;/a&gt;, the idea that in the past, each type of media was different. Books were bound paper sold by booksellers, video was delivered as movies in movie theaters and broadcast as television, records were vinyl goods sold in record stores and heard over the radio, magazines were sold by booksellers or delivered by mail, newspapers had their own content delivery network to ensure that everybody got the previous days news by the following morning. With the digital revolution, all these different types of media are now the same. They are all just buckets of digital bits that are delivered through the Internet. Given this, the next thing we need are devices for consuming all this media. Audio just needs a device the size of your thumb and headphones, whereas video, books, magazines etc. need a screen that is big enough to see, and that is what the iPad is for.&lt;br /&gt;&lt;br /&gt;When thinking about these things, I find it useful to draw up some requirements and use cases and then see how the offered devices match those requirements. Here is what I want from my Personal Information Appliance (PIA - remember that acronym).&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Light enough that I can lie in bed and read or view media with it. &lt;/li&gt;&lt;li&gt;Instant on, long battery life, able to handle all media types.&lt;/li&gt;&lt;li&gt;Get media without having to plug it into anything else.&lt;/li&gt;&lt;li&gt;A screen large enough to read or view and small enough to make the device portable.&lt;/li&gt;&lt;/ol&gt;So how does the iPad match these requirements? At 1.5 pounds it is a little heavier than most "light" reading, but there are plenty of hardback books that weigh more. For the second requirement, Adobe Flash is the major missing media type, however there is probably an app to do that. As for screen size, we are going to have to resign ourselves to having multiple devices with different screen sizes until they work out the technology to project images directly onto the retina.&lt;br /&gt;&lt;br /&gt;The funny thing is that even although the iPad is speced as a device for  consuming media it turns out to be capable of much more. Computer games  are the newest type of media and the iPad is a great games platform  with a lot of future as Steve Jobs boasted in the recent iPod  announcement &lt;a href="http://www.engadget.com/2010/09/01/live-from-apples-fall-2010-event/"&gt;event&lt;/a&gt;. There are many instances in the business world where  it will be useful, for example in sales and marketing for giving a  presentation or demonstration to an individual. The other day I was  astonished to find my boss using his iPad for email while waiting for  his laptop to be repaired.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3669257675366392857?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3669257675366392857/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3669257675366392857' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3669257675366392857'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3669257675366392857'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/09/understanding-ipad.html' title='Understanding the iPad'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6744827626036761305</id><published>2010-08-31T23:17:00.000-07:00</published><updated>2010-08-31T23:26:39.598-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Windows'/><title type='text'>Software Update Business Models</title><content type='html'>These days software updates are a fact of life. If we do not keep our software up to date we risk all sorts of horrendous infections and debilitating attacks. Unfortunately, the providers of our software know this and are starting to use software update to make money or at least remind us that they exist. I have done several software updates recently and noticed this in action.&lt;br /&gt;&lt;br /&gt;Adobe just wants to remind me of their presence, so they insist on putting a shortcut to the Adobe Reader on my desktop every time they update. This is relatively benign as it is a matter of a few seconds at most to confirm that it is a shortcut and delete it. Apple is more pushy. I expect to get a new version of iTunes any day now, and I will need to carefully uncheck boxes to ensure that I do not get several applications more than I want. Most insidious is Java, now owned by Oracle. On one system they offered me the Yahoo tool bar, on another system which already had the Yahoo tool bar, they offered me some other software, so they obviously look to see what is installed to guide the offer. Judging by the fact that these offers were for third party software, I am sure that they get some sort of compensation for it.&lt;br /&gt;&lt;br /&gt;Soon we will see advertisements and offers in the installer, and new ways to confuse us. The tactic that always gets me is to require some input that I forget to fill in, then when I go back to fill in this information, all the boxes I so carefully unchecked have been mysteriously filled in again. In a hurry, I just click "Install" not noticing that I am now getting all the extras that I had carefully tried to avoid. It is coming to a computer near you soon.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6744827626036761305?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6744827626036761305/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6744827626036761305' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6744827626036761305'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6744827626036761305'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/08/software-update-business-models.html' title='Software Update Business Models'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6422790277878809794</id><published>2010-08-28T12:30:00.000-07:00</published><updated>2010-08-28T14:49:49.164-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Web Analytics'/><title type='text'>Mad Skills for Big Data</title><content type='html'>Big Data is a big deal these days, so it was with great interest that we welcomed Brian Dolan to the SDForum Business Intelligence SIG August &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13737"&gt;meeting&lt;/a&gt; to speak on "MAD Skills: New Analysis Practices for Big Data". MAD is an acronym for Magnetic Agile Deep, and as Brian explained, these skills are all important in handling big data. Brian is a mathematician who came to Fox Interactive Media as Lead Analyst. There he had to help the marketing group with deciding how to price and serve advertisements to users. As they had tens of millions of users that they often knew quite a lot about, and served billions of advertisements per day, this was a big data problem. They used a 40 node Greenplum parallel database system and also had access to a 105 node map reduce cluster.&lt;br /&gt;&lt;br /&gt;The presentation started with the three skills. Magnetic, means drawing the analyst in by giving them a free reign over their data and access to use their own methods. At Fox, Brian grappled with a button down DBA to establish his own his own private sandbox where he could access and manipulate his own data. There he could bring in his own data sets, both internal and external. Over time the analysts group established a set of mathematical operations that could be run in parallel over the data in the database system speeding up their analyses by orders of magnitude.&lt;br /&gt;&lt;br /&gt;Agile means analytics that adjust react and learn from your business. Brian talked about the virtuous cycle of analytics, where the analyst first acquires new data to be analyzed, then runs analytics to improve performance and finally the analytics causes business practices to suit. He talked through the issues at each step in the cycle and led us through a case study of audience forecasting at Fox which illustrated problems with sampling and scaling results.&lt;br /&gt;&lt;br /&gt;Deep analytics is about producing more than reports. In fact Brian pointed out that even data mining can concentrate on finding a single answer to a single problem where big analytics has the need to solve millions of problems at the same time. For example, he suggested that statistical density methods may be better at dealing with big analytics than other more focused techniques. Another problem with deep analysis of big data is that, given the volume of data, it is possible to find data that supports almost any conclusion. Brian used the parable of the &lt;a href="http://www.theparableteller.com/2010/07/american-university-professor-resigns.html"&gt;Zen Tea Cup&lt;/a&gt; to illustrate the issue. The analyst needs to be to approach their analysis without preconceived notions or they will just find exactly what they are looking for.&lt;br /&gt;&lt;br /&gt;Of all the topics that came up during the presentation, the one the caused most frissons with the audience was dirty data. Brian's experience has been that cleaning data can lose valuable information and that a good analyst can easily handle dirty data as a part of their analysis. When pressed by an audience member he said "well 'clean' only means that it fits your expectation". As an analyst is looking for the nuggets that do not meet obvious expectations, sanitizing data can lose those very nuggets. The recent trend to load data and then do the cleaning transformations in the database means that the original data is in the database as well as the cleaned data. If that original data is saved, the analyst can do their analysis with either data as they please.&lt;br /&gt;&lt;br /&gt;Mad Skills also refers to the ability to do amazing and unexpected things, especially in motocross motor bike riding. Brian's personal sensibilities were more forged in punk rock, so you could say that he showed us the "kick out the jams" approach to analytics. You can get the presentation from the BI SIG &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Page.viewPage&amp;amp;pageId=620&amp;amp;parentID=483&amp;amp;nodeID=1"&gt;web site&lt;/a&gt;. The original MAD Skills paper was presented at the 2009 VLDB conference and a version of it is available &lt;a href="http://db.cs.berkeley.edu/jmh/papers/madskills-032009.pdf"&gt;online&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6422790277878809794?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6422790277878809794/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6422790277878809794' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6422790277878809794'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6422790277878809794'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/08/mad-skills-for-big-data.html' title='Mad Skills for Big Data'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6697560365593918082</id><published>2010-08-23T23:01:00.000-07:00</published><updated>2010-08-23T23:13:21.350-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><title type='text'>End of Moore's Law</title><content type='html'>The recent announcement that Intel is buying McAfee, the security software company, has the &lt;a href="http://blogs.wsj.com/deals/2010/08/20/intel-mcafee-the-semiconductor-giant-needs-a-new-ceo/"&gt;analysts&lt;/a&gt; and &lt;a href="http://www.businessweek.com/idg/2010-08-20/mcafee-inside-how-intel-bid-could-shake-up-it-security.html"&gt;pundits&lt;/a&gt; &lt;a href="http://bits.blogs.nytimes.com/2010/08/23/a-different-take-on-the-intel-mcafee-deal/"&gt;talking&lt;/a&gt;. The ostensible reason for the deal is that Intel wants the security company to help them add security to their chips. Now, while security is important, I do not believe that is the reason Intel bought McAfee. In my opinion, this purchase signals that Intel sees the coming end of &lt;a href="http://en.wikipedia.org/wiki/Moores_Law"&gt;Moore's Law&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;In 2005, the Computer History Museum &lt;a href="http://bandb.blogspot.com/2005/10/moores-law.html"&gt;celebrated&lt;/a&gt; 40 years of Moore's Law, the technology trend that every 2 years, the number of transistors on a silicon chip, and thus its capabilities doubles. On the stage Gordon Moore told us that throughout the 40 years, "they have always been able to see out about 3 generations of manufacturing technology", where each generation is about 2 years. So Intel can see its technology path for about the next 6 years. At that time Moore told us that they could still see how they were going to carry on Moore's Law for the next three generations.&lt;br /&gt;&lt;br /&gt;Now what would happen if Intel looked 6 years into the future and saw that it was no longer there. That they could see the end of Moore's law and that meant that they would no longer have the ability to create new and more powerful chips to keep their revenue growing. I believe that they would start looking to buy up other profitable companies in related lines of business to diversify their revenue.&lt;br /&gt;&lt;br /&gt;McAfee is a large security software company, its main business is selling security solutions to large enterprises. If Intel had wanted to buy security technology they could have gone out and bought a security start-up with better technology than McAfee for a few hundred million dollars. Instead they are spending an expensive 8 billion dollars on an enterprise security software company. This deal does not make sense for the reasons given, however it does make sense if Intel wants to start buying its way into other lines of business.&lt;br /&gt;&lt;br /&gt;Now there are many reasons that Intel wants diversify their business. Perhaps they see the profitable sales of processor chips disappearing as chips gain so many transistors that they do not know what to do with them. However the most likely reason is that they can see the end of Moore's Law and that it is now time to move on and add some other lines of business.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6697560365593918082?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6697560365593918082/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6697560365593918082' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6697560365593918082'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6697560365593918082'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/08/end-of-moores-law.html' title='End of Moore&apos;s Law'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-45719010643299255</id><published>2010-08-14T12:15:00.000-07:00</published><updated>2010-08-14T12:20:25.693-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Analytics at Work</title><content type='html'>Analytics has become a major driving force for competitive advantage in business. The new book "&lt;a href="http://www.amazon.com/gp/product/1422177696?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=1422177696"&gt;Analytics at Work: Smarter Decisions, Better Results&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=1422177696" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;" by Thomas H. Davenport, Jeanne G. Harris and Robert Morison discusses what analytics can do for a business, how to manage analytics and how to make a business more analytical.&lt;br /&gt;&lt;br /&gt;Analytics at Work has a useful introductory chapter and then divides into two parts. The first part discusses five major aspects of analytics in a business environment. The second part looks at the lifecycle of managing analytics in a business. The organization is good and there is no overlap between the topics in each part, however the order in which the information is presented seems designed to put the reader off.&lt;br /&gt;&lt;br /&gt;The first part starts with a plodding chapter on what needs to be done to get the data organized and related topics, followed by a diffuse chapter called Enterprise. The interesting chapters in this part are the last two chapters. The Targets chapter discusses the important topic of picking targets for analytics. The Analysts chapter discusses how to effectively employ and organize analysts in a large enterprise. Similarly the second part of the book starts with a plodding chapter on how to Embed Analytics in Business Processes, followed by much more inspiring chapters on building an analytical culture, and the need to continually review a business comprehensively as part of an analytics push. If you find yourself stuck reading the book, try skipping to one of the interesting chapters that I have indicated.&lt;br /&gt;&lt;br /&gt;Scattered throughout the book are many useful tools. In the introductory chapter there are the six key questions that an analyst asks. We come back to these questions from several places in the book. Running throughout the book is a five step capability maturity model for judging how analytical an organizations is and showing the path to making the organization more analytical. Each chapter in the first part ends with a discussion on how to take that aspect of the organization through the five steps.&lt;br /&gt;&lt;br /&gt;It is important to understand the target audience. The book is aimed at senior management and executives, particularly in large enterprises. While the book contains many brief case studies as inspiration and it touches on all the important management issues that need to be considered, it does not go into great depth about what analytics is or the specific analytical techniques and how they can be used. This is not a book for analysts, unless they have ambitions to grow their career beyond analytics. I recommend this book to anyone in the target audience who wants to grow their organizations analytics capabilities.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-45719010643299255?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/45719010643299255/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=45719010643299255' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/45719010643299255'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/45719010643299255'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/08/analytics-at-work.html' title='Analytics at Work'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7423078331756608680</id><published>2010-07-24T18:18:00.000-07:00</published><updated>2010-07-24T19:26:27.679-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Data Management in the Cloud</title><content type='html'>Over the last couple of years, I have seen &lt;a href="http://bandb.blogspot.com/2008/03/developing-on-cloud.html"&gt;several&lt;/a&gt; &lt;a href="http://bandb.blogspot.com/2009/06/databases-in-cloud.html"&gt;presentations&lt;/a&gt; on the computing Cloud and how it is the next big thing. I realized that I still have a lot to learn from Daniel Graham's presentation "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13720"&gt;Data Management in the Cloud&lt;/a&gt;" at the July meeting of the Business Intelligence SIG. Dan leads Active Data Warehouse marketing programs for &lt;a href="http://www.teradata.com/"&gt;Teradata&lt;/a&gt;. If you have been living under a rock and do not know what cloud computing is, Wikipedia has a reasonable &lt;a href="http://en.wikipedia.org/wiki/Cloud_computing"&gt;explanation&lt;/a&gt;. Dan distinguished between the public cloud as a rentable computing resource like Amazon's Elastic Computing Service and a private cloud which is your businesses computing resources in a datacenter behind the company firewall which uses virtualization software like VMWare to allow many applications to share hardware.&lt;br /&gt;&lt;br /&gt;The big picture that Dan painted is that cloud computing is coming and that you need to get ready for it. By 2015, 20% of computing resources worldwide will be in the cloud. Start now by getting experience with the cloud to find out what works, what needs to be changed to make it work and what does not work. Teradata has been experimenting with cloud computing and is working with hardware and software vendors like VMWare and Amazon to ensure that Teradata database systems work well in the cloud. Informatica is another example of a software vendor that is working to ensure that their data integration software works well in the cloud and between clouds. NetFlix is an example of a company that has adopted cloud computing and recently &lt;a href="http://www.enterpriseirregulars.com/17539/netflix%E2%80%99s-movie-cloud-is-moving-into-the-amazon-cloud/"&gt;announced&lt;/a&gt; that they were moving all their movie hosting into the Amazon computing cloud. The US Government is the leading user of cloud services having moved much of their computing needs into the cloud.&lt;br /&gt;&lt;br /&gt;Cloud computing uses commodity hardware, which combined with the overhead of virtual machine software will not give you the best performance, however it is "good enough" for most applications. Dan took the well known quote from the movie Forrest Gump and bent it to his needs. “Clouds are like a box of chocolates. You never know what you're gonna get.” There is some high end software that is not suitable for cloud computing, the main problem coming from high IO requirements. The size and capabilities of a cloud computing host is often optimized to run a single instance Oracle database doing OLTP. In practice most applications are less demanding than this.&lt;br /&gt;&lt;br /&gt;There were many other interesting tidbits in the presentation. Here are some examples. It is more expensive to get data out of a cloud than to bring it in. Why is unknown, but something to take into consideration when using a cloud. An interesting application for cloud computing is what Dan called "Workload Isolation". The idea is that when you have partners or consultants who need access to your data it is often preferable to put the data they need in the cloud rather than let them inside your firewall. In all the examples that Dan showed of Business Intelligence applications in the cloud, he talked about a Data Mart with the implication that a full Enterprise Data Warehouse was too large and demanding an application for the cloud for now.&lt;br /&gt;&lt;br /&gt;The slides from the presentation are available at the SDForum Business Intelligence SIG &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Page.ViewPage&amp;amp;PageID=620"&gt;web site&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7423078331756608680?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7423078331756608680/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7423078331756608680' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7423078331756608680'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7423078331756608680'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/07/data-management-in-cloud.html' title='Data Management in the Cloud'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-771590381115405802</id><published>2010-07-15T23:10:00.000-07:00</published><updated>2010-07-16T00:24:16.529-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><title type='text'>Another Angle on the iPhone Woes</title><content type='html'>In &lt;a href="http://www.crunchgear.com/2010/07/14/video-david-lettermans-top-10-signs-youve-purchased-a-bad-iphone-4/"&gt;all&lt;/a&gt; the &lt;a href="http://www.slate.com/id/2260465/"&gt;discussion&lt;/a&gt; over the antenna problems with the new Apple iPhone 4, there is one thing that I have not heard, and that is how few product lines that Apple has. The iPhone 4 is a prime example. It comes in two memory sizes and we are promised a second color real soon now. On the Apple &lt;a href="http://www.apple.com/iphone/"&gt;site&lt;/a&gt; you can still buy the previous generation iPhone 3, but that still makes it 3 models available with another one or two to come. Compare this with &lt;a href="http://na.blackberry.com/eng/devices/"&gt;BlackBerry&lt;/a&gt; which has six ranges and thirteen models. Blackberry is restrained in the number of models it offers compared to the likes of Motorola, Samsung or Nokia.&lt;br /&gt;&lt;br /&gt;The same its true in other areas. With the Mac, Apple has three ranges of laptops, three ranges of desktops and one rackable server. Compared to HP, Dell or Acer this is a ridiculously small number of product lines. Again with the iPod there are 4 product ranges each with a couple of different memory size and and more variation on color.&lt;br /&gt;&lt;br /&gt;There are a number of advantages in having a small number of product lines. Economy of scale will make the product cheaper to manufacture, however by the time you get to millions of devices, the additional advantage is not that great. More important are a brutally strong brand image and a lack of consumer confusion. There is no question about which version of the iPhone to get, the only question is whether you are willing to pay more for the extra memory.&lt;br /&gt;&lt;br /&gt;However, there is one big disadvantage to having a single product line like the iPhone, and that is that you have all your eggs in one basket. If the product should prove to have a flaw, there is no other product line to take up the slack. If a consumer want to buy an iPhone now, they either have to go ahead and take the risk that it might be a lemon or wait until the problem is fixed. They cannot go out and buy the other Apple phone because it does not exist. For a big consumer goods company, Apple has had remarkably few dud products, but their life depends on getting each one right.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-771590381115405802?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/771590381115405802/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=771590381115405802' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/771590381115405802'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/771590381115405802'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/07/another-angle-on-iphone-woes.html' title='Another Angle on the iPhone Woes'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1526362880404853264</id><published>2010-07-05T00:16:00.000-07:00</published><updated>2010-07-05T00:30:39.833-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Windows'/><title type='text'>The HP Tablet and the Elephant</title><content type='html'>Recently HP bought Palm and in the acquisition &lt;a href="http://www.hp.com/hpinfo/newsroom/press/2010/100701xa.html"&gt;press release&lt;/a&gt; announced that they are developing "... webOS based hardware products, from a robust smartphone roadmap to future slate PCs and netbooks". In &lt;a href="http://www.crunchgear.com/2010/07/02/3-lessons-hp-hopefully-learned-from-the-ipad/"&gt;all&lt;/a&gt; the &lt;a href="http://www.informationweek.com/news/mobility/smart_phones/showArticle.jhtml?articleID=224700269"&gt;discussion&lt;/a&gt; of this event, nobody seems to be discussing the elephant in the room, or more correctly, the elephant who is no longer in the room.&lt;br /&gt;&lt;br /&gt;Ten years ago, HP would not have dared announce that it was going produce its own operating system (OS) in competition with the dominant Microsoft Windows OS. Then, most hardware developers had been cowed by Microsoft's aggressive and successful response to any attempt to develop a rival operating system. To give a couple of examples, in the early 90's the &lt;a href="http://en.wikipedia.org/wiki/GO_Corporation"&gt;Go Corporation&lt;/a&gt; had developed its Penpoint OS for handheld computing. Then in 1992, Microsoft announced its own Windows for Pen Computing. Go Corporation faltered, was taken over by AT&amp;amp;T and then the project was shuttered. Another example is the fate of Be Inc. who had developed &lt;a href="http://en.wikipedia.org/wiki/BeOS"&gt;BeOS&lt;/a&gt;, initially to power their own hardware. In 2002, Be Inc. sued Microsoft claiming that Hitachi had been dissuaded from selling PCs loaded with BeOS, and that Compaq had been pressured to not market an Internet appliance in partnership with Be. The case was eventually settled out of court with no admission of liability on Microsoft's part. However by this time Be Inc had admitted defeat and sold its intellectual property to Palm Inc.&lt;br /&gt;&lt;br /&gt;In the late 90's Microsoft was so dominant that no Silicon valley Venture Capital firm would fund a start up that would have the remotest chance of challenging Microsoft in any way. Since then Microsoft seems to have been transformed from a lithe competitor into a stumbling giant. The Vista version of the Windows OS is widely regarded as a failure, and was quickly replaced by Windows 7. While the Windows Mobile OS for smartphones has been around for a long time and gone through several versions, it has been losing market share for some time. Recently Microsoft introduced a new smartphone, the Kin with much ballyhoo, only to &lt;a href="http://www.engadget.com/2010/07/02/life-and-death-of-microsoft-kin-the-inside-story/"&gt;give up&lt;/a&gt; on it six week later. There are plenty other examples of Microsoft's left hand not knowing what the right hand was doing, like the PlayForSure &lt;a href="http://reason.com/archives/2008/07/17/the-day-the-music-dies"&gt;debacle&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;We have come to the point where Microsoft is so crippled by its own self inflicted wounds that one of its most important OEM customers is going to use its own operating system on future slate PCs and netbooks. The elephant is no longer in the room.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1526362880404853264?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1526362880404853264/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1526362880404853264' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1526362880404853264'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1526362880404853264'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/07/hp-tablet-and-elephant.html' title='The HP Tablet and the Elephant'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2038185966391359265</id><published>2010-06-26T14:47:00.000-07:00</published><updated>2010-06-27T13:14:14.789-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Visualization'/><title type='text'>Winning With Big Data</title><content type='html'>Michael Driscoll gave us Secrets of a Successful Data Scientist at  the June &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13698"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG in his talk "Winning With Big Data". Michael is founder of a data consultancy &lt;a href="http://dataspora.com/"&gt;Dataspora&lt;/a&gt;, where he has done work on projects ranging to analyzing baseball pitchers through helping cell phone companies understand their customer churn. You can see slides for the talk &lt;a href="http://www.slideshare.net/dataspora/driscoll-bi-sig15jun2010"&gt;here&lt;/a&gt;, and follow Micheal's thoughts in his excellent blog on the Dataspora site.&lt;br /&gt;&lt;br /&gt;After Michael revved up the crowd by giving the Hal Varian &lt;a href="http://www.mckinseyquarterly.com/Strategy/Innovation/Hal_Varian_on_how_the_Web_challenges_managers_2286"&gt;quote&lt;/a&gt; that "... the sexy job in the next ten years will be statisticians", he went through 9 ways to win as a Data Scientist. His first suggestion is to use the right tools. Michael uses a variety of tools including database systems, Hadoop and the R language. Large data takes a long time to process and often we can gain insights by just working with a sample of the data, however you have to be careful when taking a sample to ensure that it makes sense and that the results will scale. Which leads us to the another way to win, which is to know, understand and use statistics.&lt;br /&gt;&lt;br /&gt;Statistics is a field of mathematics that is still developing and it is not easy, however statistics is a core competence of a Data Scientist. It is not enough to do the analysis, the Data Scientist has to be able to present the results and turn them into a compelling story. Both analysis and presentation requires good visualization tools and the knowledge of how to use them.&lt;br /&gt;&lt;br /&gt;To illustrate his ways to win, Michael led us through a specific example of a successful data analysis that he had done. He had been asked by a cell phone company to investigate customer churn. Although he looked at the data in several different ways, his successful analysis went as follows. The starting point was Call Data Record (CDR) which records each call that a customer makes. Cell phone traffic generates billions of CDRs, so Michael first cut the data set down to a more manageable size by just looking at the CDRs for a single city. He then created social graphs between customers that call one another frequently, and was able to show that if one customer dropped service it was a predictor that other customers in that social graph would also leave the service. The study ended with a clever visualization of connected customers leaving the cell phone provider.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2038185966391359265?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2038185966391359265/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2038185966391359265' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2038185966391359265'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2038185966391359265'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/06/winning-with-big-data.html' title='Winning With Big Data'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2456534458479408213</id><published>2010-06-24T22:52:00.000-07:00</published><updated>2010-06-24T23:50:01.977-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Which Cloud Standards Matter?</title><content type='html'>The SDForum Cloud Services SIG June &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13702"&gt;meeting&lt;/a&gt; was a panel session with multiple speakers devoted to the question "Which Cloud Standards Matter?". The answer came through loud an clear as speaker after speaker discussed Open Virtualization Format (&lt;a href="http://en.wikipedia.org/wiki/Open_Virtualization_Format"&gt;OVF&lt;/a&gt;). &lt;span class="appTitle1"&gt;No other standard got more than a mention or so.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;OVF is a container that defines the contents of a virtual machine. It is simply a set of file in a directory and an XML descriptor file. The standard is managed by the  Distributed Management Task  Force (&lt;a href="http://www.dmtf.org/home"&gt;DMTF&lt;/a&gt;). &lt;span class="appOutput"&gt;Panel speaker Priya Ketkar of Abiquo showed OVF being used to move a virtual machine from one cloud service provider &lt;/span&gt;to another. Winston Bumpus, the final panel speaker, is President of the of the DMTF and Director of Director of Standards Architecture for VMWare. He made a convincing case for DMTF and its management of the OVF standard.&lt;br /&gt;&lt;br /&gt;Another panel member James Urquhart of Cisco mentioned several standards including OVF, however he spent considerable time on XMPP, surely the most unlikely standard for cloud computing.  I &lt;a href="http://bandb.blogspot.com/2008/05/pull-dressed-as-push.html"&gt;discussed&lt;/a&gt; XMPP some time ago. It is a standard for exchanging instant messages and Twitter feeds between large service providers. While it is a useful standard I do not see its place in cloud computing. If you can explain how XMPP helps cloud computing, please enlighten me.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2456534458479408213?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2456534458479408213/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2456534458479408213' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2456534458479408213'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2456534458479408213'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/06/which-cloud-standards-matter.html' title='Which Cloud Standards Matter?'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5998122122726870151</id><published>2010-06-13T13:17:00.000-07:00</published><updated>2010-06-13T13:34:12.178-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><title type='text'>Reporting from the Production Database</title><content type='html'>Salesforce.com does their analytics directly out of their production database. For me, this was the interesting story that emerged from the &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13678"&gt;talk&lt;/a&gt; on "Real Time Analytics at Salesforce.com" at the May meeting of the SDForum Business Intelligence SIG. Note that this post is not a report on the meeting, rather it is a reflection on a topic that came up during the meeting. Both my co-chair Paul O'Rorke and SIG member James Downey have written &lt;a href="http://ororke.com/paul/blog/?p=722"&gt;great&lt;/a&gt; &lt;a href="http://cloudofinnovation.com/articles/real-time-analytics-at-salesforce-com/"&gt;summaries&lt;/a&gt; of the meeting.&lt;br /&gt;&lt;br /&gt;Directly reporting from a production database is an issue that comes up from &lt;a href="http://www.dwinfocenter.org/against.html"&gt;time&lt;/a&gt; to time. Deciding on whether to do it is a two step process. The first question is to ask whether it is possible. A database can be oriented to report the current state of affairs or alternatively to contain a record of how we got to the current state of affairs. In practice we need both views, and it is common to have a production database that is oriented to the maintaining the current status and a data warehouse that maintains the historical record. Typically an enterprise has several databases with production information and the historical record is combined in a single reporting data warehouse.&lt;br /&gt;&lt;br /&gt;The tension between the requirements for production and reporting databases shows up in a number of ways. Production needs a fast transaction execution. One way to achieve this is to make the database small, cutting out anything that is not really needed. On the other hand, we want to keep as much information as possible for reporting, so that we can compare this time period with a year ago or maybe even two years ago. Reporting wants a simple database structure like a star schema that makes it straightforward to write ad-hoc queries that that generate good answers. Production databases tend to have more interlinked structures.&lt;br /&gt;&lt;br /&gt;Salesforce.com is in the business of Customer Relationship Management (CRM), where it is useful to keep the  historical record of interactions with each customer. As Salesforce.com has the historical record in their production database, reporting from that database makes perfect sense. In fact much of the impetus for real time data warehousing has come from CRM like applications. One common example is where a business wants to drive call center applications from data in their data warehouse.&lt;br /&gt;&lt;br /&gt;The next question is whether it is a good idea to combine reporting and production queries in the same database. Production queries are short, usually reading a few records and then updating and inserting a few records. Reporting queries are read only, but they are longer running and may touch many records to produce aggregate results. A potential issue is that a longer running reporting query may interfere with production queries and prevent them from doing their job. This is the other major reason for doing reporting from a separate database than the production database.&lt;br /&gt;&lt;br /&gt;The Oracle database used by Salesforce.com has optimistic read locking so that read only queries do not lock out queries that update the database. Also, as came out in the presentation, Salesforce.com has a multi-tenant database where each customer customizes their use of data fields in a different ways. Because of this, they sometimes copy the data out of the big table into a smaller temporary table to transform the data into the form that the customers query expects. Making a copy of the relevant data for further massaging is a common tactic in data reporting tools so this is not unusual. It also gets the reporting data out of the way of production data so they two do not interfere with one another.&lt;br /&gt;&lt;br /&gt;Finally, Salesforce.com is large enough that they can afford a luxury of having a performance team whose sole purpose is to look at queries that take the longest to run or use up the most resources. Any database application requires some performance tuning, however it is especially important when doing reporting from a production database.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5998122122726870151?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5998122122726870151/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5998122122726870151' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5998122122726870151'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5998122122726870151'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/06/reporting-from-production-database.html' title='Reporting from the Production Database'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8598749069796634473</id><published>2010-06-10T23:16:00.000-07:00</published><updated>2010-06-10T23:59:38.631-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Google's Got Background</title><content type='html'>Go away for a few days and when I come back, Google looks like Bing. Instead of a restful blank page they had a background picture. Arrgh! Fortunately, it &lt;a href="http://techcrunch.com/2010/06/10/google-kills-its-homepage-background-image-experiment-early/"&gt;lasted&lt;/a&gt; for less than a day, and then we went back to the blank page we knew and loved.&lt;br /&gt;&lt;br /&gt;Actually it is very clever. Firstly it tells the people who might be attracted to Bing because they can customize how the page looks that they can do the same thing with Google. Secondly, and more importantly, it encourages people to &lt;a href="http://bandb.blogspot.com/2006/07/yahoo-business-model.html"&gt;create and log in&lt;/a&gt; to their Google account so that they can customize their Google home page. Google can give you a better search experience when it knows who you are, and it can make more money from the advertisements that are pitched at you when it knows who you are.&lt;br /&gt;&lt;br /&gt;I thought of trying to customize the page to something less distracting when I realized that I would have to give up something of my identity in exchange. On weighing this transaction I decided that what I would give up outweighed the benefit, particularly when the backlash would probably cause the background image to be a short lived experiment.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8598749069796634473?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8598749069796634473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8598749069796634473' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8598749069796634473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8598749069796634473'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/06/googles-got-background.html' title='Google&apos;s Got Background'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-185824155861231557</id><published>2010-05-30T14:08:00.000-07:00</published><updated>2010-05-30T14:26:14.086-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>Dancing About Architecture</title><content type='html'>"Writing about music is like dancing about architecture." is one of these quotes that never seem to die. Last week I heard it again while listening to a podcast. The quote is &lt;a href="http://www.paclink.com/%7Eascott/they/tamildaa.htm"&gt;attributed&lt;/a&gt; to many people including Elvis Costello in a 1983 interview, although the origin seems to be older, perhaps much older than that. More interesting, from reading the linked piece is that someone tested dancing about architecture to see if it "was really so strange".&lt;br /&gt;&lt;br /&gt;Whoever said it, they certainly caught the truth that written words cannot adequately capture an aural sensation. A great illustration of this is the 1998 &lt;a href="http://www.npr.org/templates/rundowns/rundown.php?prgId=13&amp;amp;prgDate=02-Sep-2002"&gt;interview&lt;/a&gt;* of Ray Manzarek of The Doors by Terry Gross on the Fresh Aire radio program. Manzarek describes with the help of a keyboard how the Doors worked as a group and how they wrote the song "Light My Fire". A written transcript of this interview would be unintelligible, whereas the audio interview is a revelation. Terry Gross has recorded many interviews with musicians where they play their music and they are all worth hearing.&lt;br /&gt;&lt;br /&gt;Although I spend plenty of time listening, I have never found it useful to read about music. That is not to say that there cannot be good writing about music. In my experience the best has been in fiction, particularly the novels of Ian McEwan. In "&lt;a href="http://www.amazon.com/gp/product/1400076196?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=1400076196"&gt;Saturday&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=1400076196" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;", there are a few pages with a magical description of a rock band performing one song, followed by a meditation on how certain passages of music, and certain performances can affect us to the core.&lt;br /&gt;&lt;br /&gt;A good part of "&lt;a href="http://www.amazon.com/gp/product/0385494246?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0385494246"&gt;Amsterdam&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0385494246" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;" is about the process of composing a classical symphony. While I have never composed music, I do design and write software and there are similarities to the process. I will write more about this another time. Unfortunately the novel and the symphony are cut short by the books annoying ending.&lt;br /&gt;&lt;br /&gt;* Unfortunately, to listen to this piece, you have to have the Real Player. I have it because I have installed the BBC iPlayer to &lt;a href="http://www.bbc.co.uk/radio7/"&gt;listen&lt;/a&gt; to old comedy shows including the Goon Show. If you do not want the Real Player, &lt;a href="http://www.npr.org/templates/story/story.php?storyId=1081248&amp;amp;ps=rs"&gt;here&lt;/a&gt; is a another piece from NPR Music about "Light My Fire" with more palatable download requirements.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-185824155861231557?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/185824155861231557/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=185824155861231557' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/185824155861231557'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/185824155861231557'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/05/dancing-about-architecture.html' title='Dancing About Architecture'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4740747451298037919</id><published>2010-05-19T22:20:00.000-07:00</published><updated>2010-05-19T23:21:56.191-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Apps'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>The App Economy</title><content type='html'>The evolution of the App Economy is a marvelous thing to watch. In March I &lt;a href="http://bandb.blogspot.com/2010/03/ipad-app-conundrum.html"&gt;questioned&lt;/a&gt; whether apps for the iPad would develop with the same strength as apps for the iPhone, because more content is accessible through the browser. Jacob Weisberg at Slate &lt;a href="http://www.slate.com/id/2253821/"&gt;discussed&lt;/a&gt; the same thing recently in more depth. He exhorts publishers to beware of getting tangled up with Apple for both monetary and censorship reasons.&lt;br /&gt;&lt;br /&gt;On the other hand, web content is not fully available on the iPad. Steve Jobs has denigrated Flash for being slow, buggy and inefficient, and has sworn that it will never be seen on the iPad. In its place Jobs suggests HTML5. The problem is that HTML5 does not do everything that Flash does. This recent piece on on Apple Insider &lt;a href="http://www.appleinsider.com/articles/10/05/13/hulu_has_no_plans_to_support_ipad_browser_with_html5.html"&gt;explains&lt;/a&gt; the shortcomings of HTML5 and why &lt;a href="http://www.hulu.com/"&gt;Hulu&lt;/a&gt; will not be using it any time soon for their video distribution.&lt;br /&gt;&lt;br /&gt;If Hulu cannot use Flash, then its only alternative is to develop an App, which it is reportedly doing. If Hulu has an App, it may charge a subscription as is being discussed. If Hulu charges a subscription, some of that revenue flows to Apple. By banning a rival development platform, Apple is encouraging the App Economy to its own advantage. Thus it is a pity that so many of the early publishing apps have received such bad reviews.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4740747451298037919?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4740747451298037919/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4740747451298037919' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4740747451298037919'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4740747451298037919'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/05/app-economy.html' title='The App Economy'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7756009475719284490</id><published>2010-05-09T22:41:00.000-07:00</published><updated>2010-05-16T15:29:46.816-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Google Books Rocks</title><content type='html'>Awesome is too small a word to express what Google Books has achieved. Last year Google &lt;a href="http://books.google.com/googlebooks/agreement/"&gt;settled&lt;/a&gt; the class action law suit that allows them to index out of print books that they had digitized. As part of the settlement they also have to &lt;a href="http://www.thebigmoney.com/blogs/feeling-lucky/2010/05/05/google-books-what-we-know-and-what-we-dont"&gt;sell&lt;/a&gt; the books, which means that Google is now a bookseller. The most important part of the settlement is the Books Right Registry:&lt;br /&gt;&lt;blockquote&gt;"The agreement will also create an independent, not-for-profit Book  Rights Registry to represent authors, publishers and other  rightsholders.  In essence, the Registry will help locate rightsholders  and ensure that they receive the money their works earn under this  agreement.  You can visit the &lt;a href="http://books.google.com/booksrightsholders/"&gt;settlement  administration site&lt;/a&gt;, the Authors Guild or the AAP to learn more  about this important initiative."&lt;br /&gt;&lt;/blockquote&gt;One of the biggest practical issue with Intellectual Property is that it is impossible to use most Intellectual Property because you do not know who owns it, and therefore you do not know who to ask for permission to use it. &lt;a href="http://en.wikipedia.org/wiki/Lawrence_Lessig"&gt;Laurence Lessig&lt;/a&gt; has been talking about this for a long time as a part of his campaign to fix copyright laws. The establishment of a Book Rights Registry goes some way to address the problem with one type of Intellectual Property. Perhaps this will be the beginning of a trend.&lt;br /&gt;&lt;br /&gt;I will write more about this issue another day. For now, here is how I stumbled upon the awesomeness of Google Books. My father would often quote "but tomorrow by the living god, we'll try the game again" after some setback. I knew it was from a poem, but not much more. So the other day, I typed "but tomorrow by the living god" into Google and was astonished by the progress that has been made in search over the last few years. The first entry in the search results linked to a poetry anthology  in Google Books that has the full poem by John Masefield.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://theotherpages.org/poems/masef01.html"&gt;Masefield&lt;/a&gt; is best known for his poems "Sea Fever", "I must go down to the seas again, to the lonely seas and the sky, ..." and "Cargoes", "Quinquireme of Nineveh from distant Ophir, ..." For poem collectors, here is the rarely seen poem TOMORROW by John Masefield:&lt;br /&gt;&lt;blockquote&gt;Oh yesterday the cutting edge drank thirstily and deep,&lt;br /&gt;The upland outlaws ringed us in and herded us as sheep,&lt;br /&gt;They drove us from the stricken field and bayed us into keep;&lt;br /&gt;        But tomorrow&lt;br /&gt;By the living God, we'll try the game again!&lt;br /&gt;&lt;br /&gt;Oh yesterday our little troop was ridden through and through,&lt;br /&gt;Our swaying, tattered pennons fled a broken, beaten few,&lt;br /&gt;And all a summer afternoon, they hunted us and slew;&lt;br /&gt;       But tomorrow&lt;br /&gt;By the living God, we'll try the game again!&lt;br /&gt;&lt;br /&gt;And here upon the turret-top the bale-fires glower red,&lt;br /&gt;The wake-lights burn and drip about our hacked, disfigured dead,&lt;br /&gt;And many a broken heart is here and many a broken head;&lt;br /&gt;       But tomorrow&lt;br /&gt;By the living God, we'll try the game again!&lt;/blockquote&gt;In my original search results, there was a link to Google newspapers where a Virgin Islands Daily News edition from 1950 quotes part of the poem. This time when I did the search, that link did not come up. Instead there was a link to a 1991 zine for Vietnam War vets that quotes a verse of the poem. Who knows what you may find when you do the search.&lt;br /&gt;&lt;blockquote&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7756009475719284490?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7756009475719284490/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7756009475719284490' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7756009475719284490'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7756009475719284490'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/05/google-books-rocks.html' title='Google Books Rocks'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1742752392415763894</id><published>2010-04-30T23:41:00.000-07:00</published><updated>2010-05-01T01:16:41.406-07:00</updated><title type='text'>More on The Big Short</title><content type='html'>When I &lt;a href="http://bandb.blogspot.com/2010/04/big-short.html"&gt;wrote&lt;/a&gt; that Michael Lewis had written an almost uplifting account of the financial crisis in "&lt;a href="http://www.amazon.com/gp/product/0393072231?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0393072231"&gt;The Big Short&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0393072231" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;" by concentrating on some of the winners, I did not consider that he was keeping something back. If you want to find out what he really thinks, read this &lt;a href="http://www.bloomberg.com/apps/news?pid=newsarchive&amp;amp;sid=aUpBrUJ.pIiA"&gt;interview&lt;/a&gt; on Bloomberg.com. He explains many of the choices that he made in the book, like for instance not including John Paulson who has been celebrated in other places for "&lt;a href="http://www.amazon.com/gp/product/0385529910?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0385529910"&gt;The Greatest Trade Ever&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0385529910" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;". He also expresses his outrage over what happened and suggests that part of the reason he left Wall Street in 1989 was because his job was basically "exploiting the idiocy of my customers". It is a long interview and well worth reading in its entirety.&lt;br /&gt;&lt;br /&gt;One issue that Lewis touches on is the fact that shorting the market is supposed to dampen the market and perhaps bring sanity into it, but in this case the structured investment vehicles like synthetic CDOs had the opposite effect of amplifying the market and making the subsequent downfall much worse. The "This American Life" radio show and podcast has a recent &lt;a href="http://www.thisamericanlife.org/radio-archives/episode/405/inside-job"&gt;segment&lt;/a&gt; where they discuss the role of the Magnetar Hedge Fund in creating many several subprime bonds and then making huge sums of money by shorting parts of them. Again well worth hearing.&lt;br /&gt;&lt;br /&gt;Finally, Lewis discusses the poisonous interface between the big Wall Street firms and their customers. If Goldman Sachs is responsible for defrauding its customers as the recent lawsuit suggests, there is the question of why anyone would want to do business with them. The Big Money blog posits that Goldman Sachs is losing its "Social License" to operate in an interesting &lt;a href="http://www.thebigmoney.com/articles/moneybox/2010/04/30/goldmans-social-license"&gt;post&lt;/a&gt;. Given their behavior, this may be a good thing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1742752392415763894?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1742752392415763894/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1742752392415763894' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1742752392415763894'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1742752392415763894'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/04/thoughts-on-big-short.html' title='More on The Big Short'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1677023449116139301</id><published>2010-04-26T00:10:00.000-07:00</published><updated>2010-04-26T00:16:44.095-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><title type='text'>Business Rules OK!</title><content type='html'>Performance Management Systems collect the data to make decisions but they do not make decisions, they do not ensure that decisions get made or even track the results of the decision so made. James Taylor (no relation) called this the "over-instrumented" enterprise when he spoke to the the April &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13665"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG on "Performance Management and Agility". James is CEO of &lt;a href="http://www.decisionmanagementsolutions.com/"&gt;Decision Management Solutions&lt;/a&gt; where he consults on using technology to better effect decision making.&lt;br /&gt;&lt;br /&gt;James divides the decisions that an organization makes into three levels: strategic, tactical and operational. He is interested in the operation decisions, the little decisions that are taken all the time. An example of an operational decision is what offer to make to a customer that has called a call center. Every enterprise has their own set of operational decisions, however they have the characteristic that is a large number of them that in aggregate they represent a lot of value, so they are well worth managing.&lt;br /&gt;&lt;br /&gt;Many operational decisions are or should be automated, and there are a set of principles that need to be recognized when decision making is automated. The first principle is that no decision is going to be forever, so the logic for making the decision should not be locked up into something inflexible such as program code. Much better to use a rules based decision engine which allows everybody to see the rules in a language that they can understand. Another principle is that making a decision is a business process and as such should be managed. A good business rules engine allows rules to be tested, measured and perhaps even simulated in action to understand what they are doing and how they can be optimized.&lt;br /&gt;&lt;br /&gt;According to James, the purpose of the information gathered for a Performance Management Systems is to make decisions, so it should be used to make decisions. Too many enterprises are over-instrumented. They have spent all their effort to get and present the data, however they have no measurable ability to turn that data into actions. You can read more about these ideas in the book &lt;a href="http://www.amazon.com/gp/product/0132347962?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0132347962"&gt;Smart Enough Systems: How to Deliver Competitive Advantage by Automating Hidden Decisions&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0132347962" alt="" style="border: medium none ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; by James Taylor and Neil Raden. You can also read my co-chair Paul O'Rorke's take on the meeting in his &lt;a href="http://ororke.com/paul/blog/?p=326"&gt;blog&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1677023449116139301?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1677023449116139301/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1677023449116139301' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1677023449116139301'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1677023449116139301'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/04/business-rules-ok.html' title='Business Rules OK!'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4622788810338992569</id><published>2010-04-15T23:18:00.000-07:00</published><updated>2010-04-17T01:44:55.754-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>And the Future of Television is ...</title><content type='html'>Wait for it, wait for it ... Sports! The path to this conclusion requires a couple of steps, so bear with me. The Convergence Consulting Group just published their annual report on "&lt;a target="_blank" href="http://www.convergenceonline.com/downloads/NAMNewContent2010.pdf"&gt;The  Battle for the North American Couch Potato&lt;/a&gt;", and several news sources and commentators immediately picked up on one element of their report. According to Convergence, by the end of 2009, 800,000 US households had cut the cable and that they expected this to double to 1.6 million households by 2011. Cutting cable means cutting subscription TV service like cable or satellite and getting all media content from the Internet, Netflix and over the air. I recently &lt;a href="http://bandb.blogspot.com/2009/12/television-in-trouble.html"&gt;wrote&lt;/a&gt; about Television being in trouble because of increasing subscription fees and less content. There has been a trickle of cable cutters for some time and now Convergence Consulting tells us that the numbers are starting to swell.&lt;br /&gt;&lt;br /&gt;So why should we not cut the cable? It turns out that sports is the only type of content where subscription television offers a compelling product that you cannot easily get if you cut the cable.  I came to this conclusion after skimming through the &lt;a href="http://techcrunch.com/2010/04/13/800000-households-abandoned-tvs-web/#comments"&gt;comments&lt;/a&gt; on TechCrunch post on the cable cutting story. The majority of comments are either from people who have cut the cable and the only thing they miss is sports, or from people who say that they cannot cut the cable because they would not be able to get the sports that they want to see. The fact that sports is the only type of content mentioned is quite startling.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4622788810338992569?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4622788810338992569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4622788810338992569' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4622788810338992569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4622788810338992569'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/04/and-future-of-subscription-television.html' title='And the Future of Television is ...'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4132401962702582241</id><published>2010-04-11T15:33:00.000-07:00</published><updated>2010-04-11T16:01:22.450-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><title type='text'>The Big Short</title><content type='html'>What is the best way to write an uplifting book about the recent financial crisis? In his new book, &lt;a href="http://www.amazon.com/gp/product/0393072231?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0393072231"&gt;The Big Short&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0393072231" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt;, Michael Lewis has taken the approach of following the winners, the people who saw that the bubble would burst and made a huge sums of money by betting on it bursting. Along the way we also meet some of the people who took the other side of the bet and lost big. When you compare these two groups, the losers come across as your average every day kind of person while the winners area strange group of outsiders.&lt;br /&gt;&lt;br /&gt;The book follows three groups of people. There is the Frontpoint Partners hedge fund led by Steve Eisman, who as a stock analyst had been best known for correctly trashing companies that he followed. Then there is Mike Burry, a one eyed doctor living in San Jose who took up investing because it allowed him to get away from having to interact with other people. Finally there are the guys at the garage band hedge fund who turn $100,000 in to more than $100 million and whose main problem is being taken seriously by the big Wall Street Firms.&lt;br /&gt;&lt;br /&gt;Along the way we see scenes of madness from the financial machine that created the bubble. There is the explanation for why a Mexican strawberry picker with no English and an income of $14,000 per year could be loaned every penny he needed to buy a house for $724,000 in Bakersfield California. As it turns out, because he had no debt and no credit history, he has a relatively high credit rating, and that credit rating was needed to balance out the low credit rating of some deadbeat American when their home loans were packaged together with many others into a mortgage bond.&lt;br /&gt;&lt;br /&gt;Another scene is the American Securitization Forum, the annual conference of the of the subprime mortgage industry. In early 2007 the conference takes place in the Venetian Hotel in Las Vegas. The Frontpoint Partners and the garage band hedge fund are both there trying to get more information to substantiate their huge bets against the subprime mortgage market. By this time the cracks were beginning to show. For example, the CEO of the Option One mortgage corporation gave a reassuring speech even although Option One was in trouble because they had made loans to people who could not afford to make even the first payment on the loan. When his partner asks "Who takes out a home loan and doesn't make the the first payment?" Steve Eisman responds "Who the #$%^ lends money to people who can't make the first payment?"&lt;br /&gt;&lt;br /&gt;Michael Lewis knows what he is talking about in writing about Wall Street because he started his career as a bond salesman for Salomon Brothers as hilariously told in his first book, &lt;a href="http://www.amazon.com/gp/product/039333869X?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=039333869X"&gt;Liar's Poker&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=039333869X" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt;. In The Big Short he successfully continues that tradition. The Big Short is full of interesting characters, amusing insights, clear explanations and some genuine tension as towards the end you wonder whether the hero's will get their big payoff or whether Wall Street will totally collapse taking down everybody with them. Like of his other books, &lt;a href="http://www.amazon.com/gp/product/0393072231?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0393072231"&gt;The Big Short&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0393072231" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt; is highly recommended.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4132401962702582241?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4132401962702582241/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4132401962702582241' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4132401962702582241'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4132401962702582241'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/04/big-short.html' title='The Big Short'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2157698893763084926</id><published>2010-04-04T14:11:00.000-07:00</published><updated>2010-04-04T15:00:30.223-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Web Analytics'/><title type='text'>Web Analytics 2.0</title><content type='html'>Web analytics is changing fast as we discovered at the March &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13635"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG. Avinash Kaushik, Analytics Evangelist for Google, spoke on 'Web Analytics 2.0: Rethinking Decision Making in a "2.0" World'. Avinash started off by telling us how he became well known as a web analytics guru. A few years ago he started writing his blog "&lt;a href="http://www.kaushik.net/avinash/"&gt;Occam's Razor&lt;/a&gt;". It soon gathered a large readership and a publisher approached him to write a book. His first book "&lt;a href="http://www.amazon.com/gp/product/0470130652?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0470130652"&gt;Web Analytics: An Hour a Day&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0470130652" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt;" was a distillation of his blog posts. The book is a best seller even although much of its contents is available for free on the web. His second book "&lt;a href="http://www.amazon.com/gp/product/0470529393?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0470529393"&gt;Web Analytics 2.0&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0470529393" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt;" came out recently.&lt;br /&gt;&lt;br /&gt;Avinash is an excellent communicator with a strong personal style. One aspect of that style, quite obvious from his blog posts, is the urge to create lists of ideas. For his presentation, Avinash offered us a list of simple ideas on web sites metrics and analytics. Here are some of the ideas that he presented.&lt;br /&gt;&lt;br /&gt;The first idea is simple and direct - Don't Suck. The suckage of a web page can be measured by a metric called &lt;a href="http://en.wikipedia.org/wiki/Bounce_rate"&gt;Bounce&lt;/a&gt;. This is a relatively new metric that we had not previously heard discussed at the Business Intelligence SIG. Bounce measures the users whose experience of the web page and site is, as Avinash put it "I came. I puked. I left." and he showed us some pretty pukey pages that might back up this behavior. A typical analysis is to look at the pages with the highest bounce rate, determine why they cause that behavior and what can be done about it.&lt;br /&gt;&lt;br /&gt;His next idea is Segment or Die. Analytics is about aggregating data to make sense of large datasets, however over-aggregation results in a single number and nothing to compare it with. Segmenting the data gives us a number of data items that we can compare. Avinash showed us a simple example where he took a hospital web site and classified the content into 8 segments and then compared the amount of content against the number of page views in each segment. It was immediately apparent where the effort should go into adding and improving content.&lt;br /&gt;&lt;br /&gt;Analyzing your web logs only tells a part of the story, you also have to worry about what the analytics cannot tell you. Ex Defense Secretary Donald Rumsfeld is &lt;a href="http://www.slate.com/id/2081042/"&gt;infamous&lt;/a&gt; for having talked about "the known knowns, the known unknowns, and the unknown unknowns". The unknown unknowns are the things that you don't even know that you don't know and therefore the thing you should be most worried about. You can start to get a handle on what you do not know by looking at your performance relative to your competitors. This is known as Benchmarking, or in the case of a deep study as Competitive Intelligence. For an example of what can be done, a recent &lt;a href="http://www.kaushik.net/avinash/2010/02/competitive-intelligence-data-sources-best-practices.html"&gt;post&lt;/a&gt; on the Occam's Razor blog discusses 8 sources for Competitive Intelligence data.&lt;br /&gt;&lt;br /&gt;Most web site analytics only looks for the one big conversion from a web site, however there are many other small conversions that are tracked and worth evaluating because there may be hidden value lurking in the long tail. For example, recently Avinash wanted to know what his blog was worth so that he could defend taking time away from the family to write it. After determining a value for each reader, he started adding up all the other micro-conversion like people who subscribe to the RSS feed and advertisements for his books and the non-profit organizations that he supports. Overall he came up with a figure of about $26000 per month. Now Avinash does not make a penny from his blog, so this is notional money that adds to his personal brand value, but that value seems to make the effort of  writing the blog well worth the time spent.&lt;br /&gt;&lt;br /&gt;The next idea is Fail Faster. By this Avinash means do lots of different experiment, many of which will fail, to find out what works. He led us through an example from the Obama Presidential campaign. President Obama raised huge amounts of money from many small donations on his web site. The initial web page worked well. The experiments were to try some variations on the theme. Pages with video, and stirring video at that, did very badly. A simple picture of Obama with his family did a little better than the initial picture, so that one was chosen.&lt;br /&gt;&lt;br /&gt;Avinash showed us this example to make a number of points. The Obama analytics team was tiny. Often the best work is done by a small agile team that has the freedom to experiment. The team used free tools. Avinash believes that that good people are much more important than good tools. His suggestion for dividing up the analytics budget is to spend 90% on people and 10% on tools. Sometimes, a web site design feature starts from a HiPPO (Highest Paid Persons Opinion), which can be destructively bad, and difficult to get around because in all organizations the highest paid persons opinion is taken very seriously. The best way to counter a HiPPO is to show that other ideas work better through the results of experiments that produce hard evidence.&lt;br /&gt;&lt;br /&gt;While some may think that web analytics is a mostly solved problem, Avinash believes we are just starting to figure out what can be done, and that there is plenty of room for more innovation. I will continue to read Occam's Razor to find out where he takes us next.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2157698893763084926?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2157698893763084926/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2157698893763084926' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2157698893763084926'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2157698893763084926'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/04/web-analytics-20.html' title='Web Analytics 2.0'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3213727566820801091</id><published>2010-03-29T23:19:00.000-07:00</published><updated>2010-03-30T00:15:04.549-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Analytics'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>How Data is Changing the Study of Economics</title><content type='html'>Andrew Leonard in How The World Works recently &lt;a href="http://www.salon.com/technology/how_the_world_works/2010/03/29/computers_and_economics/index.html"&gt;posted&lt;/a&gt; on how computers and the availability of data is changing the study of Economics, and I have to agree. There are a number of forces that are converging to make this happen right now.&lt;br /&gt;&lt;br /&gt;Open Government initiatives are making more data available and the internet makes it easier to get the data. Emerging movements like the &lt;a href="http://www.opendatacommons.org/"&gt;Open Data Commons&lt;/a&gt; emulate the Open Source movement that has made software more available. The &lt;a href="http://radar.oreilly.com/2010/03/truly-open-data.html"&gt;Open Data&lt;/a&gt; movements are concerned to not only make the data more openly available but to make the data better by providing tools to manage it and inspection so that problems with the data can be corrected.&lt;br /&gt;&lt;br /&gt;Web sites to make data available have been around for &lt;a href="http://bandb.blogspot.com/2007/01/visualization-for-all.html"&gt;some time&lt;/a&gt;. For example, &lt;a href="http://numbrary.com/"&gt;Numbary.com&lt;/a&gt; exists to make public data more available. Sites like &lt;a href="http://manyeyes.alphaworks.ibm.com/manyeyes/"&gt;Many Eyes&lt;/a&gt; and &lt;a href="http://www.swivel.com/"&gt;Swivel&lt;/a&gt; allow the user to upload data sets and analyze them. You do not need to find your own data sets because you can go to these sites and play around with data sets that others have uploaded.&lt;br /&gt;&lt;br /&gt;Several popular books have shown us what can be done. The best known example is &lt;a href="http://www.amazon.com/gp/product/B001L4NJ02?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B001L4NJ02%22%3EFreakonomics%3C/a%3E%3Cimg%20src=%22http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B001L4NJ02%22%20width=%221%22%20height=%221%22%20border=%220%22%20alt=%22%22%20style=%22border:none%20%21important;%20margin:0px%20%21important;%22%20/%3E"&gt;Freakonomics&lt;/a&gt;, which takes a number of interesting data sets and shows us how they can be analyzed to tell interesting and sometimes quite startling stories. Less flamboyant and more educational is &lt;a href="http://www.amazon.com/gp/product/B0027IQB60?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B0027IQB60"&gt;Super Crunchers&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B0027IQB60" alt="" style="border: medium none ! important; margin: 0px ! important;" width="1" border="0" height="1" /&gt;, subtitled "why thinking-by-numbers is the new way to be smart".&lt;br /&gt;&lt;br /&gt;Leonard suggests that the rise of large scale data analysis will displace the old guard who sit in their ivory towered and built model. I have to disagree. The economic model is the explanation of what is happening, the result of analysis. Building a model to explain some aspect of the data or behavior that is brought to light by the data is the result of Analytics. More and better data means that the models will be better, more definitive and most importantly in a fractious discipline, more defensible.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3213727566820801091?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3213727566820801091/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3213727566820801091' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3213727566820801091'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3213727566820801091'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/03/how-data-is-changing-study-of-economics.html' title='How Data is Changing the Study of Economics'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7458351167757722728</id><published>2010-03-24T22:49:00.000-07:00</published><updated>2010-03-26T10:02:08.940-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><category scheme='http://www.blogger.com/atom/ns#' term='Media'/><title type='text'>The iPad App Conundrum</title><content type='html'>While the iPad looks like it is going to be successful, I think that there is a question over how the market for iPad Apps will develop. Apps for the iPhone are a stunning success that caught many by &lt;a href="http://www.cloudfour.com/iphone-app-store-the-surprise-of-2008/"&gt;surprise&lt;/a&gt;. I recall a post in TechCrunch when the iPhone App-store was about to be introduced. Using numbers that were probably leaked from Apple, the column predicted a substantial and valuable market for iPhone apps. The commenters were full of scorn, suggesting that the idea of anyone paying money for little apps was ridiculous. Apple showed us just how it could be done.&lt;br /&gt;&lt;br /&gt;On the other hand we have the iPad. Apps for the iPhone make sense because of its small screen. Each app makes the best use of the limited screen space for its own dedicated purpose. The iPad has a much larger screen where the browser with scripting and plug-ins can support most of the experience. Therefore the need for specialized apps is less compelling.&lt;br /&gt;&lt;br /&gt;There will still be iPad apps. For example I expect games to do well. However, media companies hoping to monetize their content through subscriptions have a tricky tightrope to walk. Most media companies now make their content available for free on the internet supported by advertising. They cannot afford to withdraw this content completely, but on the other hand if they want to monetize through the iPad, they have to provide a value added experience through their app if they expect to get people to pay. I look forward with interest to see how this all plays out.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Update:&lt;/span&gt; TechCrunch just &lt;a href="http://techcrunch.com/2010/03/26/magazine-cover-ipad/"&gt;posted&lt;/a&gt; on how magazines might work this issue with video on the cover.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7458351167757722728?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7458351167757722728/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7458351167757722728' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7458351167757722728'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7458351167757722728'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/03/ipad-app-conundrum.html' title='The iPad App Conundrum'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4575293370017320889</id><published>2010-03-20T11:56:00.000-07:00</published><updated>2010-03-20T12:24:37.462-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='programming languages'/><title type='text'>Emerging Languages Face Off</title><content type='html'>New programming languages are popping up all over the place. In March the SDForum Emerging Tech SIG held an "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13632"&gt;Emerging Languages Face Off&lt;/a&gt;" to try and make sense out of what is going on. The new languages represented at the meeting were Clojure, Scala and Go, with Ruby as a more established control language. The panel was moderated by Steve Mezak, author of "&lt;a href="http://www.amazon.com/gp/product/0977826805?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0977826805%22%3ESoftware%20without%20Borders%3C/a%3E%3Cimg%20src=%22http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0977826805%22%20width=%221%22%20height=%221%22%20border=%220%22%20alt=%22%22%20style=%22border:none%20%21important;%20margin:0px%20%21important;%22%20/%3E"&gt;Software without Borders&lt;/a&gt;" and CEO of Accelerance, Inc.&lt;br /&gt;&lt;br /&gt;The night kicked off with Amit Rathore, Chief Software Architect at Runa, Inc. speaking for &lt;a href="http://clojure.org/"&gt;Clojure&lt;/a&gt; (pronounced closure). He told us that Clojure is a Lisp that runs on the Java Virtual Machine. Lisps are dynamic, functional languages with automatic garbage collection that have been around since the early 60's. Although Amit told us that Clojure programs contain less parenthesis than Java, the examples he showed us did not seem to bear this out. Clojure does try to control the amount of brackets by using both round and square ones. Apart from list, Clojure provides support for both common data structures like Map and lazy sequences.&lt;br /&gt;&lt;br /&gt;More importantly, Amit introduced what turned out to be a major theme of the evening, support for concurrency. Clojure has a surprisingly sophisticated (read complicated) support for concurrency. The basic idea is that reads are versioned to be lock free while writes are managed to ensure that they overlap properly. Existing data is immutable, updates are made by writing the new data to new locations. Access to shared memory is delimited by transactions that correspond to the program block structure (good). There are 4 ways of referencing data in a transaction that allow the different use cases each to be handled efficiently. If a transaction fails, it is automatically retried until it succeeds. Their implementation of concurrency goes under the banner of &lt;a href="http://en.wikipedia.org/wiki/Software_transactional_memory"&gt;Software Transactional Memory&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;While I will give Clojure two and a half cheers for trying, I am not a great fan of Lisp like languages. Amit touched on one of my bugaboos, the ability to change the meaning of the language by writing code. In my mind, this makes Lisp a low level language as any program requires close reading to discover what it might do. Also, my only practical experience of Lisp is Emacs configuration, a scary mess of global variables and functions.&lt;br /&gt;&lt;br /&gt;Next up, Evan Phoenix, lead developer of Rubinius spoke about &lt;a href="http://www.ruby-lang.org/en/"&gt;Ruby&lt;/a&gt;. Ruby is a dynamic language with automatic garbage collection and is built on the principal of least surprise. While there are several implementations of the language, these implementations have not provided the best performance, so Rubinius is working on a high performance implementation, where more of the implementation is in Ruby itself. The genesis of Ruby was with Lisp and Smalltalk, although the actual language went in a very different direction than these two languages.&lt;br /&gt;&lt;br /&gt;Evan admitted that Ruby does not have great support for concurrency. Ruby will work with green threads, that is cooperative multiprocessing that can exploit a single core. The problem of the "&lt;a href="http://en.wikipedia.org/wiki/Global_Interpreter_Lock"&gt;interpreter lock&lt;/a&gt;" means that a native threads support is not an immediate goal.&lt;br /&gt;&lt;br /&gt;After Evan, David Pollak, author of "&lt;a href="http://www.amazon.com/gp/product/1430219890?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=1430219890%22%3EBeginning%20Scala%3C/a%3E%3Cimg%20src=%22http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=1430219890%22%20width=%221%22%20height=%221%22%20border=%220%22%20alt=%22%22%20style=%22border:none%20%21important;%20margin:0px%20%21important;%22%20/%3E"&gt;Beginning Scala&lt;/a&gt;" and Benevolent Dictator for Life of the Lift Web Framework spoke on &lt;a href="http://www.scala-lang.org/"&gt;Scala&lt;/a&gt;. Scala is a hybrid object oriented/functional language with static typing and  garbage collection that runs on the Java Virtual Machine. In some ways it is like Java with less words, the type inference system eliminates the need to explicitly specify data types most of the time. The goal of Scala is to achieve the speed (and safety) of Java with the conciseness of Ruby.&lt;br /&gt;&lt;br /&gt;Scala has no specific built in support for concurrency, however the Actors paradigm has been successfully implemented on top of Scala. I have written about both &lt;a href="http://bandb.blogspot.com/2008/11/scala-programming-language.html"&gt;Scala&lt;/a&gt; and &lt;a href="http://bandb.blogspot.com/2009/06/actors-and-concurrent-computation.html"&gt;Actors&lt;/a&gt; with Scala previously, so I will say no more here. It is worth noting that both Clojure and Scala get full native threads support and many other benefits from running on the Java Virtual Machine.&lt;br /&gt;&lt;br /&gt;Finally Robert Griesemer spoke about the new &lt;a href="http://golang.org/"&gt;Go&lt;/a&gt; language. Robert is a member of the team developing Go at Google. Go is a statically typed language with automatic garbage collection that compiles down to native hardware. It is a system programming language with control over memory layout of the data. Robert listed the problems with current system programming languages. They are are verbose and repetitious, the data type system gets in the way, build time is slow, particularly compared to dynamic languages, and managing dependencies between modules is difficult. Go aims to be a simple and powerful language with fast tools.&lt;br /&gt;&lt;br /&gt;Go does not have inheritance or type hierarchies and is not object oriented, rather it aims to be more flexible. I was somewhat disturbed by this. Although type hierarchies can be misused, they are useful for helping to organizing large projects. On the other hand, for concurrency, Go offers lightweight processes that communicate via channels, which is a welcome move away from the threads paradigm with all its &lt;a href="http://bandb.blogspot.com/2006/05/taming-concurrency.html"&gt;problems&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The Go language is not quite complete yet. The language designers are still working on providing support for a number of features including generics, operators and exceptions. Robert told us that he expects the language to be complete and mature in 6 months to a year from now. Programming language design is not easy and needs to proceed at its own pace. It is worth remembering that the C++ language spend about 15 years in the state of being almost but not quite finished. We will have to wait and see how long it takes Go to mature.&lt;br /&gt;&lt;br /&gt;Overall there were two themes that emerge from the the new languages presented at the meeting. One is the desire to make programming simpler and more approachable. It is easier to start writing a program in a dynamic language. Both Scala and Go are statically typed languages with the goal of making the programming experience more like writing a program in a dynamic language. The other theme is the need for new programming languages to provide better support for concurrency. In particular language need to support something that is safer and more controlled than threads.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4575293370017320889?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4575293370017320889/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4575293370017320889' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4575293370017320889'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4575293370017320889'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/03/emerging-languages-face-off.html' title='Emerging Languages Face Off'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5996849363283693342</id><published>2010-02-27T18:06:00.000-08:00</published><updated>2010-02-27T20:04:47.332-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><title type='text'>Avatar Issues</title><content type='html'>After &lt;a href="http://bandb.blogspot.com/2010/01/3d-not.html"&gt;complaining&lt;/a&gt; about the latest revival in 3D movies, I finally went to see Avatar in glorious IMAX 3D. The movie is a stunning spectacle, well worth the price of admission and even hanging around for half an hour to ensure that we got a good seat in a full cinema a couple of months after the films release. However there are issues.&lt;br /&gt;&lt;br /&gt;The first issue is that we had to sit through several trailers for 3D movies that are going to be released in the next few months. Hollywood seems to be more determined to make 3D movies work this time around by building a pipeline of 3D movies for us to go and see. On the other hand, all of the forthcoming 3D movies are animated, meaning that serious people do not need to watch them. To do digital animation properly, the film makers build 3D model of everything in the foreground of a scene, so it is not a huge amount of extra work to throw off a 3D version of an animated movie. Come to think of it, Avatar is mostly an animated movie with slightly different visual aesthetic and much more detail in the models and textures.&lt;br /&gt;&lt;br /&gt;The other issues relate to movie making. 3D is a different medium that requires different film making techniques. For example, limited &lt;a href="http://www.indie-film-making.com/depth-of-field/"&gt;depth of field&lt;/a&gt; is beloved technique for "art house" movies. In 3D all parts of the image need to be in focus all the time because you cannot resolve a 3D image that is out of focus, moreover it is likely to give the viewer a &lt;a href="http://www.shadowlocked.com/index.php?option=com_content&amp;amp;view=article&amp;amp;id=70:how-to-avoid-getting-a-3d-headache-while-watching-avatar&amp;amp;catid=41:feature"&gt;headache&lt;/a&gt;. The rule for 3D movies is &lt;a href="http://en.wikipedia.org/wiki/Group_f/64"&gt;f/64&lt;/a&gt; all the way.&lt;br /&gt;&lt;br /&gt;Another issue is the foreground. 3D movies need to be very careful about composing the picture so the foreground does not protrude. There were only a couple of instances in Avatar where the foreground was a problem. The one noticeable incident had foreground leaves in the forest sticking well into the field of view combined with a camera movement that caused the fronds to move rapidly past the eye in a most distracting way.&lt;br /&gt;&lt;br /&gt;Depth of field and foreground are only a couple of issued with the language of 3D movies, there are many more need to be considered. One example is the &lt;a href="http://www.slashfilm.com/2009/05/31/3d-avatar-vs-2d-avatar-and-the-importance-of-aspect-ratios/"&gt;aspect ratio&lt;/a&gt; that the movie is made in. It seems that Avatar was made so that it could be viewed in several aspects ratios. All in all I question whether it is possible to make a movie that succeeds in both 2D and 3D.&lt;br /&gt;&lt;br /&gt;James Cameron has said that he did not want the 3D effects to distract the viewer from the movie. I still remember the scene in "&lt;a href="http://www.imdb.com/title/tt0071508/"&gt;Flesh for Frankenstein&lt;/a&gt;" where the children go into the belfry, bats fly around and one flies out of the screen and into your face. Fortunately, there were no such scenes in Avatar.&lt;br /&gt;&lt;br /&gt;Finally there is the political angle. &lt;a href="http://reader.creativeminorityreport.com/2009/12/shock-avatar-is-anti-american.html"&gt;Some&lt;/a&gt; &lt;a href="http://cbs2chicago.com/local/Avatar.Balcer.negative.2.1412100.html"&gt;people&lt;/a&gt; &lt;a href="http://www.weeklystandard.com/Content/Public/Articles/000/000/017/350fozta.asp?pg=2"&gt;have&lt;/a&gt; &lt;a href="http://www.nypress.com/article-20710-blue-in-the-face.html"&gt;complained&lt;/a&gt; that Avatar is anti-American. For anyone who thinks that the sub-&lt;a href="http://abcnews.go.com/Blotter/blackwater-senate-committee-contractor-armed-dangerous/story?id=9936807"&gt;Blackwater&lt;/a&gt; corporation mining Pandora, and walking over the natives without so much as a "by your leave", represents the American &lt;a href="http://www.ustraining.com/new/index.asp"&gt;ideal&lt;/a&gt;, I feel very sorry for them.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5996849363283693342?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5996849363283693342/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5996849363283693342' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5996849363283693342'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5996849363283693342'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/02/avatar-issues.html' title='Avatar Issues'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5326345371708579626</id><published>2010-02-14T14:03:00.000-08:00</published><updated>2010-02-14T15:25:28.724-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><title type='text'>Predatory Lending</title><content type='html'>While the practice of Predatory Lending is difficult to define, it is easy to see the results, people stuck with high priced loans that they cannot get out of. Today I saw a couple of references to an &lt;a href="http://www.washingtonpost.com/wp-dyn/content/article/2010/02/13/AR2010021303745.html"&gt;article&lt;/a&gt; in the Washington Post that reports more than half of of the mortgages in the US have an interest rate that is greater than 6% while for the last year the mortgage interest rate has been hovering around 5%. This means that over half the homeowners in the US are unable to refinance their mortgages to take advantage of a lower interest rate.&lt;br /&gt;&lt;br /&gt;Many people think of predatory lending as something like &lt;a href="http://www.predatorylendingassociation.com/"&gt;payday lending&lt;/a&gt; to the poor. In practice it happens at all levels of the economy. At the highest level there are the exploits of the "&lt;a href="http://en.wikipedia.org/wiki/Confessions_of_an_Economic_Hit_Man"&gt;economic hit man&lt;/a&gt;" whose job was to enable the selling of economic development loans to poor countries that could ill afford them. Recently, the economic woes of Greece may have been &lt;a href="http://www.calculatedriskblog.com/2010/02/wall-street-and-greece-debt.html"&gt;exacerbated&lt;/a&gt; by clever derivative swaps from Goldman Sachs, designed to hide the true nature of the debt that it owed.&lt;br /&gt;&lt;br /&gt;Predatory housing loans inflamed the housing bubble, sticking the middle classes with high priced loans cleverly disguised with low initial teaser rates. Now the middle classes are stuck with loans that they cannot refinance because their houses are underwater or because they do not have a job, a good enough job or the credit rating for the refinance to go ahead. This is a yet further drag on the economy, already in recession. As the Washington Post article says:&lt;br /&gt;&lt;blockquote&gt;"More refinancing activity would have helped household budgets, but also the national economy because homeowners might have spent some of the extra cash they pocketed, giving the recovery an added lift."&lt;/blockquote&gt;&lt;blockquote&gt;&lt;/blockquote&gt;Homeowners do have an option. As Roger Lowenstein &lt;a href="http://www.nytimes.com/2010/01/10/magazine/10FOB-wwln-t.html"&gt;writes&lt;/a&gt; in the New York Times "Walk Away From Your Mortgage". Part of his argument is that banks are &lt;a href="http://www.bloomberg.com/apps/news?pid=20601206&amp;amp;sid=aLYZhnfoXOSk"&gt;walking away&lt;/a&gt; from their mortgage obligations, and the American people should not feel obliged to behave better than the corporations who sold them their home loans in the first place. I would add to that argument that much of the vitality of the US economy comes from labor mobility. Having people stuck in a home that they cannot sell because their loan is underwater and unable to get a job nearby is yet another drag on the economy.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5326345371708579626?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5326345371708579626/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5326345371708579626' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5326345371708579626'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5326345371708579626'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/02/predatory-lending.html' title='Predatory Lending'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8636938722361374028</id><published>2010-01-31T22:56:00.000-08:00</published><updated>2010-03-20T12:27:20.526-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Two Kinds of Book Readers</title><content type='html'>To get to the heart of &lt;a href="http://www.amazon.com/gp/product/B0015T963C?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B0015T963C%22%3EKindle%3C/a%3E%3Cimg%20src=%22http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B0015T963C%22%20width=%221%22%20height=%221%22%20border=%220%22%20alt=%22%22%20style=%22border:none%20%21important;%20margin:0px%20%21important;%22%20/%3E"&gt;Kindle&lt;/a&gt; - &lt;a href="http://www.apple.com/ipad/"&gt;iPad&lt;/a&gt; issue we need to understand that there are two kinds of books. One kind of book is what I will call "page turners". You start reading the book at page 1 and read sequentially turning the pages until you get to the end. The other type of book I will call "reference". You start reading the book by going to the index or the page of contents, finding the topic you are interested in and proceeding from there. Books such as dictionaries and encyclopedias are organized as indexes to make look up easier.&lt;br /&gt;&lt;br /&gt;Of course, nothing is ever completely black and white. There are plenty of books that fall between the two poles of page turners and reference books. For example, a book of poems may be organized to be read as a page turner, however it will also contain an index of poems and perhaps an index of first lines. Many technical books are written as both a page turner and a reference. Perhaps on first encounter you read it cover to cover and then use it as a reference book. The way we read magazines and newspapers is more like a reference book than a page turner. In a magazine we look at the index to find the article we want to read, or flip through the pages to find something eye catching. Scanning a newspaper is a similar act.&lt;br /&gt;&lt;br /&gt;The point of all this is that the Kindle book reader is a device that is optimized for reading page turners. Navigation is difficult. While the Kindle in theory can be used to read newspapers and magazines, it has not been very successful in this application. The iPad on the other hand has all the touch based navigation mechanisms and also the advantages of the web like hypertext and search engines.&lt;br /&gt;&lt;br /&gt;In summary the Kindle is a gadget for reading a specific type of book and nothing else. As such is is effective and well received by dedicated readers of page turner books, a small but devoted audience. The Kindle is limited by its niche. The iPad is a much more general device that does book reading as one of its many functions. Better navigation and the display means that it can be used to consume all media types, and it is particularly good for navigation heavy media such as newspapers and magazines and reference material.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8636938722361374028?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8636938722361374028/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8636938722361374028' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8636938722361374028'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8636938722361374028'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/01/two-kinds-of-book-readers.html' title='Two Kinds of Book Readers'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2172441229734791572</id><published>2010-01-28T21:40:00.000-08:00</published><updated>2010-01-28T23:22:36.270-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><title type='text'>iPad Excitement</title><content type='html'>After weeks of excitement, build up suspense and leaks, Apple finally got around to announcing their large form iPod Touch, called the &lt;a href="http://www.apple.com/ipad/"&gt;iPad&lt;/a&gt;. Already it is being &lt;a href="http://www.linuxtoday.com/infrastructure/2010012703135NWRLAP"&gt;decried&lt;/a&gt; &lt;a href="http://i.gizmodo.com/5458382/8-things-that-suck-about-the-ipad"&gt;all&lt;/a&gt; over the internet for its many deficiencies. In summary, the complaints come from a bunch of geeks who say that the iPad is too deficient and locked down to be the kind of computer system that they can play with, and anyway they already have enough computer systems, so why do they need another one. The simple answer to these complaints is that nobody is forcing them to buy an iPad.&lt;br /&gt;&lt;br /&gt;I see the iPad as a lifestyle device. It is the thing I have in my living room so that while watching a movie, I can look it up on IMDB, perhaps to find out who is that familiar looking actor in a cameo role. In the kitchen I can use the iPad to look up a recipe, and in the breakfast nook to skim the morning newspaper headlines over a cup of coffee. Last thing at night, I can use the iPad to read blogs, news magazines or a few pages from a book before going to sleep.&lt;br /&gt;&lt;br /&gt;In my opinion, the most interesting thing about the iPad is that Apple is playing to their strength as a systems integrator. The iPad has lightning performance and good battery life because Apple has developed their own processor chip in tune with the software that runs on it. Admittedly the software comes with plenty of restrictions like a lack of multi-tasking. However, as is often the case with Apple, the end result is "&lt;a href="http://en.wikipedia.org/wiki/Minimalism"&gt;less is more&lt;/a&gt;".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2172441229734791572?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2172441229734791572/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2172441229734791572' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2172441229734791572'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2172441229734791572'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/01/ipad-excitement.html' title='iPad Excitement'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3027970873536911754</id><published>2010-01-13T22:42:00.000-08:00</published><updated>2010-01-13T23:28:11.694-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><title type='text'>3D - Not</title><content type='html'>We are hearing big things about &lt;a href="http://www.huffingtonpost.com/steve-rosenbaum/can-you-see-in-3d-ces-say_b_417988.html"&gt;3D&lt;/a&gt;. The movie &lt;a href="http://www.imdb.com/title/tt0499549/"&gt;Avatar&lt;/a&gt; was recently released in two different 3D systems, and at the CES trade show, several television makers announced that they will have 3D TVs available later this year. Optimism &lt;a href="http://www.pcworld.com/article/186748/3d_tv_separating_the_hope_from_the_hype.html"&gt;abounds&lt;/a&gt;. Someone in the panel for the &lt;a href="http://twit.tv/230"&gt;This Week in Tech&lt;/a&gt; episode broadcast from the CES show in Las Vegas said they managed to watch a 3D demo for several minutes before they got a headache.&lt;br /&gt;&lt;br /&gt;The real problem is that we have been here before. Every few years there is a new 3D system that is going to change the world of visual media. 3D still pictures are 100 years old. The first 3D movies were released in the 1950's. I saw "&lt;a href="http://www.imdb.com/title/tt0071508/"&gt;Flesh for Frankenstein&lt;/a&gt;" in 3D more than 30 years ago. Since then we have had 3 or 4 more 3D hype cycles. The result has always been the same, a lot of huff and puff with no lasting result. I see no reason why 3D should fare any better this time than it has on all the previous occasions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3027970873536911754?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3027970873536911754/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3027970873536911754' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3027970873536911754'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3027970873536911754'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2010/01/3d-not.html' title='3D - Not'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4006563839671109657</id><published>2009-12-31T19:54:00.000-08:00</published><updated>2010-01-01T17:16:14.620-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><category scheme='http://www.blogger.com/atom/ns#' term='Entertainment'/><title type='text'>Television in Trouble</title><content type='html'>There are forces at work that are going to completely change the television business in the US. On the one side there are the major television networks who believe that it is their right to earn large sums of money from television, just because they have in the past. On the other side is the consumer who is tired of the cost and increasingly switching off. In the middle are the cable companies and the cable content companies.&lt;br /&gt;&lt;br /&gt;The consumers are fed up. Television has become unwatchable as the number and length of the commercial breaks has extended. We used to get 48 to 50 minutes of content in each hour, and now we get just 42 minutes. At that rate a season of 24 has less than 17 hours of content. The only way to watch a TV show is to record it on a DVR and watch later, skipping the commercials. Once we get in the habit of watching TV offline, it becomes much easier to cut the cable completely and just watch the web. Between Netflix, Hulu and YouTube there is quite enough stuff to keep entertained.&lt;br /&gt;&lt;br /&gt;Another source of complaint is the constantly rising cost of cable. This is caused by the cable company paying more and more for content. For example, the cable companies pay ESPN $4 per month per viewer to carry the channel, and that fee is rising. Other cable content companies are jumping into the valuable content pool. Ten years ago, the AMC channel used to show very old movies with no commercial breaks, now AMC puts on award winning shows like Mad Men full of commercials. Every cable channel seems to have its must see TV program from the BBC with Top Gear through the the USA network with Burn Notice.&lt;br /&gt;&lt;br /&gt;The cost of cable is about to go up sharply as the major TV networks &lt;a href="http://www.marketwatch.com/story/news-corp-time-warner-cable-extend-talks-2010-01-01?siteid=yhoof"&gt;demand&lt;/a&gt; commensurate fees for their programming from the cable companies. This does not seem like a winning idea in recessionary times. As fees rise more and more people will cut the cable. Either the cost of cable has to stabilize with cuts to content, or TV risks going the way of radio. (I hear that radio still broadcasts, but I do not listen to it, and nobody that I know still listens.) I think that we will see some big changes coming to TV business over the next year or so.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4006563839671109657?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4006563839671109657/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4006563839671109657' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4006563839671109657'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4006563839671109657'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/television-in-trouble.html' title='Television in Trouble'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1563542191491291636</id><published>2009-12-27T12:46:00.000-08:00</published><updated>2009-12-27T12:52:47.900-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Kindle Chronicles</title><content type='html'>Amazon &lt;a href="http://phx.corporate-ir.net/phoenix.zhtml?c=176060&amp;amp;p=irol-newsArticle&amp;amp;ID=1369429&amp;amp;highlight="&gt;announced&lt;/a&gt; that "On Christmas Day, for the first time ever, customers purchased more Kindle books than physical books." Well duh! If you want a physical book for Christmas, you have to buy it before Christmas day. On the other hand, every one who received a Kindle as a gift used the wireless book &lt;a href="http://bandb.blogspot.com/2009/10/e-readers-for-all.html"&gt;download&lt;/a&gt; feature to get a book to read on Christmas day. In the very same announcement, Amazon said that "Kindle has become the most gifted item in Amazon's history". Amazon's statement is a nice piece of spin but not a lot more.&lt;br /&gt;&lt;br /&gt;More interesting commentary on electronic book readers is found in the &lt;a href="http://www.thebigmoney.com/features/kindle-chronicles"&gt;Kindle Chronicles&lt;/a&gt; blog. In the early days of emusic, musicians generally stood by their record companies. Book authors seem to be a much more independent lot according to the most recent &lt;a href="http://www.thebigmoney.com/features/kindle-chronicles/2009/12/18/what-we-have-here-failure-communicate"&gt;post&lt;/a&gt; "What We Have Here Is a Failure To Communicate". The publishers have been trying to preserve their position by keeping the prices of ebooks high, while the authors want to be read and the books that sell most on the Kindle are the cheaper ones. Also authors do not see why the publishers should get such a large share of the revenue when there is no cost to their ebook inventory.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1563542191491291636?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1563542191491291636/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1563542191491291636' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1563542191491291636'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1563542191491291636'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/kindle-chronicles.html' title='Kindle Chronicles'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8765242546263862160</id><published>2009-12-26T11:53:00.000-08:00</published><updated>2009-12-26T12:44:04.776-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Security'/><category scheme='http://www.blogger.com/atom/ns#' term='Windows'/><title type='text'>Die AutoRun Die</title><content type='html'>Another year has almost passed and I have not yet ranted about an &lt;a href="http://bandb.blogspot.com/search/label/Windows"&gt;awful&lt;/a&gt;, &lt;a href="http://bandb.blogspot.com/2006/03/hurray.html"&gt;unnecessary&lt;/a&gt; and &lt;a href="http://bandb.blogspot.com/2005/03/more-windows-fuglies.html"&gt;totally annoying&lt;/a&gt; feature of Microsoft Windows, so today I am going to tell you why AutoRun should die.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/AutoRun"&gt;AutoRun&lt;/a&gt; is the "feature" where you plug something into your computer and then stuff happens completely out of your control. The thing you plug in might be key drive, camera, iPod or whatever. Last Christmas I won a 4GB SanDisk Cruzer USB key drive as a door prize. When I plugged this horrible little thing into my computer it installed the &lt;a href="http://www.u3.com/"&gt;U3&lt;/a&gt; driver with useless and dangerous functions that I DO NOT WANT! To make matters worse, there is no obvious way to remove the driver or its annoying functionality. To top off the bad behavior, even although I immediately erased the entire contents of the drive, when it was plugged into another computer, it infected that computer with its unwanted drivers as well. I have thrown the key drive away to prevent further damage.&lt;br /&gt;&lt;br /&gt;The combination of USB key drives and AutoRun is a serious computer virus infection vector to the extent that key drives are being &lt;a href="http://www.scmagazineus.com/militarys-ban-of-usb-thumb-drives-highlights-security-risks/article/121326/"&gt;banned&lt;/a&gt; in critical places. However the problem is not just with key drives. I have not disabled AutoRun because I use it two to three times a week to sync my iPod with the latest podcasts. Recently my daughter plugged her iPod into my computer just to recharge the battery. First this caused iTunes to crash, then when I brought it back, it wanted to sync my stuff onto her iPod. My daughter does not want anything of mine on her iPod and I had to jump through hoops to prevent the sync.&lt;br /&gt;&lt;br /&gt;The problem is that iTunes and everyone else has totally bought in to the automagic nonsense of AutoRun behavior. A much simpler, safer and easier to use behavior is to have the user plug in a device and then bring up a program to use the device. Unfortunately the designers(?) of Windows decided to emasculate their users and instead give the device the power to decide what it wants to do. The subliminal message from Microsoft is that you are too stupid to operate you own computer so we are going do it for you, or let anyone else who might have more of a clue do it for you. The consequence of this design is that our computers do not belong to us, but to hackers who exploit these "features" as attack vectors to take control of them.&lt;br /&gt;&lt;br /&gt;If you sit back and think about it, Autorun is obviously ill conceived. The design center is that a single user is logged into their computer and actively using it. What does AutoRun do when nobody has logged into the computer, what does it do when two users are logged in? In the example that I gave above, my daughter plugged her iPod into my computer when two people were logged in and the screen saver had locked both accounts. Of course iTunes crashed, it did not know what to do.&lt;br /&gt;&lt;br /&gt;The iPod and iTunes is particularly annoying because it is unusable without AutoRun. On the iTunes support web site, the top support &lt;a href="http://www.apple.com/support/itunes/"&gt;issue&lt;/a&gt; is "iPod doesn't appear in iTunes" and the second issue is "iPhone does not appear in iTunes". However there is no button in iTunes to go and look for an iPod or iPhone, instead they rely on AutoRun with no easy fall back should that fail.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8765242546263862160?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8765242546263862160/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8765242546263862160' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8765242546263862160'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8765242546263862160'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/die-autorun-die.html' title='Die AutoRun Die'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7569806940593663559</id><published>2009-12-20T11:14:00.000-08:00</published><updated>2009-12-20T12:55:51.206-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><title type='text'>BI Megatrends: Directions for Business Intelligence in 2010</title><content type='html'>Every year &lt;a href="http://www.perceptiveinfo.com/"&gt;David Stodder&lt;/a&gt;, Research Fellow with &lt;a href="http://www.ventanaresearch.com/"&gt;Ventana Research&lt;/a&gt; and editor-at-large with &lt;a href="http://intelligent-enterprise.informationweek.com"&gt;Intelligent Enterprise&lt;/a&gt; writes a column on Business Intelligence Megatrends for the next year. This column looks back at what has happened in the last year and what he expects to happen in the next year. This year David also presented his thoughts to the December &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13606"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG. David talked about many topics, here I will just cover what he said about the big players.&lt;br /&gt;&lt;br /&gt;Two years ago there was a huge wave of &lt;a href="http://bandb.blogspot.com/2007/11/olap-vendors-capitulate.html"&gt;consolidation&lt;/a&gt; in Business Intelligence when the major independent BI vendors were bought up by IBM, SAP and Oracle, who along with Microsoft are the major enterprise software vendors. In the last year SAP has integrated Business Objects with SAP software to the point that SAP is now ready to threaten Oracle.&lt;br /&gt;&lt;br /&gt;Consolidation has not finished. In 2009, two important mergers were announced. Firstly IBM bought SPSS to round out its analytics capabilities. This move threatens SAS which is in the same market, however SAS is a larger and more successful company that SPSS, also SAS is a private company which means that it does not necessarily need to respond to the pressures to consolidate.&lt;br /&gt;&lt;br /&gt;The other merger is Oracle's offer to buy Sun and the effect that has on Oracle's relationship with HP. HP and Sun are bitter rivals for enterprise hardware, and HP was the launch partner for Oracle Exadata, the high end Oracle database. Now Oracle is pushing Sun hardware with Exadata, leaving HP in the lurch. David pointed out that there are plenty of up and coming companies with scalable database systems for HP to buy up. That list includes Aster Data Systems, GreenPlum, Infobright, ParAccell and Vertica. Expect to see something happen in this area in 2010.&lt;br /&gt;&lt;br /&gt;Of the three major database vendors, Microsoft has the weakest offering, despite SQL Server 2008. However Microsoft does have the advantage of the Excel spreadsheet which remains the most used BI reporting tool. A new version of Excel is due in 2010. Also Microsoft is making a determined push in the direction of collaboration tools with SharePoint. As we heard at the BI SIG November &lt;a href="http://bandb.blogspot.com/2009/11/consumerization-of-it.html"&gt;meeting&lt;/a&gt;, collaboration is an important new direction for enterprise software capabilities.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7569806940593663559?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7569806940593663559/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7569806940593663559' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7569806940593663559'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7569806940593663559'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/bi-megatrends-directions-for-business.html' title='BI Megatrends: Directions for Business Intelligence in 2010'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-9091010363794425483</id><published>2009-12-17T22:29:00.000-08:00</published><updated>2009-12-17T22:42:50.270-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='software engineering'/><title type='text'>A Systematic and Platform Independent Approach to Time and Synchronization</title><content type='html'>Managing time and synchronization in any software is complicated. &lt;a href="http://en.wikipedia.org/wiki/Leon_Starr"&gt;Leon Starr&lt;/a&gt;, a leading proponent of building executable models in UML, talked about the issues of modeling time and synchronization to the December &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13561"&gt;meeting&lt;/a&gt; of the SDForum SAM SIG. Leon has spoken to the SAM SIG &lt;a href="http://bandb.blogspot.com/2008/10/articulate-uml-modeling.html"&gt;previously&lt;/a&gt; on executable models. This time he brought along two partners to demonstrate how the the modeling technique can be applied to a broad range of problems.&lt;br /&gt;&lt;br /&gt;Leon started the meeting by talking through five rules for handling time and synchronization. The first and most important rule is that there is no global clock. This models real systems which may consist of many independent entities and allows for the most flexible implementation of the model on a distributed system. In practice, other rules are a consequence of this first rule.&lt;br /&gt;&lt;br /&gt;The next rule is that that the duration of a step is unknown. The rule does not imply that any step can take forever, its purpose is to say that you cannot make assumptions about how long a step may take. In particular, you cannot expect independent steps in the model to somehow interleave themselves in some magical way. The third rule is that busy objects are  never interrupted. This forces the modeller to create a responsive system by building it from many small steps so that an object is always available to handle whatever conditions that it needs to handle.&lt;br /&gt;&lt;br /&gt;The fourth rule is that signals are never lost. This is an interesting rule as it gets to an issue at the heart of building asynchronous systems. The rule implies that there is a handshake between sender and receiver. If the receiver is not ready, the sender may be held up waiting to deliver the signal. Perhaps the signal can be queued, but then there is the problem that the queue is not big enough to handle all the queued signals. In the end you have to build a system that can naturally handle all the events thrown at it, if it is a safety critical system, or that fails gracefully if it is not.&lt;br /&gt;&lt;br /&gt;The fifth rule is that there is no implicit order in the system, except that if one object sends signals to another object, the signals arrive in the order that they were sent. Note that I may have interpolated some of my own experience into this discussion of the rules. If you want to explore further watch &lt;a href="http://www.youtube.com/watch?v=mhFQduidH9M"&gt;this&lt;/a&gt; video on You-Tube and go to Leon's web &lt;a href="http://www.modelint.com/"&gt;site&lt;/a&gt; which leads to many interesting &lt;a href="http://knol.google.com/k/leon-starr/time-and-synchronization-in-executable/2hnjef6cmm97l/7#"&gt;papers&lt;/a&gt; and discussions.&lt;br /&gt;&lt;br /&gt;Next at the meeting, Leland Starr, younger brother of Leon, talked about a web application that he had been the lead on for his employer, TD Ameritrade. The online application is for arranging participants in online webinars. By using the UML modelling technique, he created a model that could be both used to explain how the system would worked to the business sponsors of the project and that could be executed to check that it worked as expected. Leland has a SourceForge &lt;a href="http://sourceforge.net/projects/scalastateexecu/"&gt;project&lt;/a&gt; for his work.&lt;br /&gt;&lt;br /&gt;Finally Andrew Mangogna talked about a very different class of applications. He builds software to control implanted medical devices like heart pacemakers. The two overriding concerns are that the medical device performs its function safely and that it runs for at least 5 years on a single battery charge. Compared to many of the applications that we hear about at the SAM SIG the implantable device applications feel like a throwback to an earlier and simpler age of computing. The applications are written in the C programming language and the code typically occupy 3 to 4 kilobytes. The program data is statically allocated and an application can use from 150 bytes to 500 bytes. Andrew also has a &lt;a href="http://tcl-cm3.sourceforge.net/"&gt;project&lt;/a&gt; on SourceForge for his work.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-9091010363794425483?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/9091010363794425483/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=9091010363794425483' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/9091010363794425483'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/9091010363794425483'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/systematic-and-platform-independent.html' title='A Systematic and Platform Independent Approach to Time and Synchronization'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7767528031668538449</id><published>2009-12-04T23:56:00.000-08:00</published><updated>2009-12-06T21:13:22.769-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Bandwidth Hogging</title><content type='html'>There are &lt;a href="http://yro.slashdot.org/story/09/12/04/1518221/Hunting-the-Mythical-Bandwidth-Hog"&gt;several&lt;/a&gt; &lt;a href="http://www.fiberevolution.com/2009/12/herding-the-bandwidth-hogs.html"&gt;discussions&lt;/a&gt; going on around the web about bandwidth hogging started by a &lt;a href="http://www.fiberevolution.com/2009/12/whats-a-bandwidth-hog-.html"&gt;post&lt;/a&gt; from Benoit Felten in the &lt;a href="http://www.fiberevolution.com/"&gt;fiberevolution&lt;/a&gt; blog. I wrote about this issue last month in my &lt;a href="http://bandb.blogspot.com/2009/11/vote-for-net-neutrality-now.html"&gt;post&lt;/a&gt; on net neutrality. The basic problem is that when the internet becomes congested the person who has created the most connections wins. Congestion can happen anywhere from your local head end through to a backbone and the backbone interconnects. Felten claims that there is no problem, and given the data, he is willing to do the data crunching to prove it, while others &lt;a href="http://www.nnsquad.org/archives/nnsquad/msg02403.html"&gt;disagree&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The problem is a classic &lt;a href="http://en.wikipedia.org/wiki/Tragedy_of_the_commons"&gt;Tragedy of the Commons&lt;/a&gt;. There is a shared resource, the internet, and some people use more of it than others. That is fine provided that they do not interfere with each other and there is enough resource to go around. As I explained, the problem is that when there are not enough resources to go around, the people who win are the people who create a large number of connections, and these tend to be the people who use the most bandwidth. The point of a torrent client creating a large number of connections is to ensure that that the client gets its "share" of the net whether there is congestion or not. The only viable response is for everyone else to create large numbers of connections to do whatever they want to do, be it download a web page or make a internet phone call. This is undesirable because it can only lead to more congestion and less efficient use of the shared resource.&lt;br /&gt;&lt;br /&gt;There are two parts to a solution. Firstly, the internet service providers have to keep adding more equipment to reduce congestion as internet usage grows. Everything would be fine if there were no congestion. Secondly, we need better algorithms to manage congestion. Penalizing people for using the bandwidth they were sold is not the answer, particularly when that is not the real problem. I have suggested that we should look towards limiting connections. Another thought is to kill the connections of the users with the largest numbers of connections to reduce congestion. Again, I am sure that this will have some unintended consequences.&lt;br /&gt;&lt;br /&gt;The real problem is that unless we can all agree to be good internet citizens and get along, the forces against Net Neutrality may win. Then large companies with deeply vested interests will get to decide who has priority. The recently &lt;a href="http://www.marketwatch.com/story/comcast-to-buy-nbc-stake-as-venture-formed-2009-12-03?siteid=yhoof"&gt;announced&lt;/a&gt; merger of Comcast, a large Internet Service Provider and NBC, a large content provider is exactly the sort of thing that we need to be wary of.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7767528031668538449?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7767528031668538449/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7767528031668538449' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7767528031668538449'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7767528031668538449'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/12/bandwidth-hogging.html' title='Bandwidth Hogging'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-116811721486429797</id><published>2009-11-30T22:18:00.000-08:00</published><updated>2009-12-01T23:40:37.349-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='UI'/><title type='text'>Consumerization of IT</title><content type='html'>A new generation is entering the workforce and they are just not going to take it any more.  Brian Gentile, CEO of &lt;a href="http://www.jaspersoft.com/"&gt;Jaspersoft&lt;/a&gt;, did not say these exact words, but it conveys the intent of the introduction to his talk on "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13558"&gt;Consumerization of IT&lt;/a&gt;" at the November meeting of the SDForum Business Intelligence SIG.&lt;br /&gt;&lt;br /&gt;Brian was talking about Generation Y, the first generation to have grown up with computers and instant communication to the extent that they take them for granted. More that that they have expectations about these tools and what they can do with them. Unfortunately, enterprise software has often created systems that are slow, ugly and so difficult to use that it can requires weeks of training. While previous generations have put up with difficult software because they know no better, Gen Y does know that it can be better and is not going to put up with software that does not match up.&lt;br /&gt;&lt;br /&gt;Brian identified 4 characteristics that Business Intelligence, or any enterprise software must provide to meet the next generations expectations. They are:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Elegant presentation.&lt;/li&gt;&lt;li&gt;Easy access to data. &lt;/li&gt;&lt;li&gt;Extensive Customization.&lt;/li&gt;&lt;li&gt;Built In Collaboration.&lt;/li&gt;&lt;/ul&gt;To do collaboration properly, software applications must fit into a collaboration platform rather than have each application provide its own silo'ed collaboration mechanism.&lt;br /&gt;&lt;br /&gt;While I have heard people argue that current Business Intelligence software does not provide a good user experience, Brian put a positive light on this trend, as if the change is for the good and the right thing to do. He is certainly positioning JasperSoft to provide these features and meet the requirements of the next generation.&lt;br /&gt;&lt;br /&gt;Brian ended with another optimistic note. The cost of Information Technology is coming down with cheaper hardware and Open Source software. CIO's can direct the money they save to new innovative projects. A good example of this movement is Ingres talking about "&lt;a href="http://www.ingres.com/about/why-ingres/neit.php"&gt;The New Economics of IT&lt;/a&gt;" as they have been doing for some time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-116811721486429797?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/116811721486429797/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=116811721486429797' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/116811721486429797'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/116811721486429797'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/11/consumerization-of-it.html' title='Consumerization of IT'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1706805893642693852</id><published>2009-11-15T22:12:00.000-08:00</published><updated>2009-11-15T23:35:17.503-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Institutional Corruption'/><title type='text'>Fight Instutional Corruption</title><content type='html'>Many people think of &lt;a href="http://www.lessig.org/"&gt;Lawrence Lessig&lt;/a&gt; as a radical with an anti-IPR (&lt;a href="http://en.wikipedia.org/wiki/Intellectual_property"&gt;Intellectual Property Rights&lt;/a&gt;) agenda. In practice he is no radical, in fact his mission is to find a defensible middle ground between the Intellectual Property right and the Free Culture left. One of my first blog posts discussed his talk: "&lt;a href="http://bandb.blogspot.com/2004/09/comedy-of-commons.html"&gt;The Comedy of the Commons&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;Recently, I have been following his &lt;a href="http://phobos.apple.com/WebObjects/MZStore.woa/wa/viewPodcast?id=272091509"&gt;podcasts&lt;/a&gt; which discuss his work on copyright as well as his newer work on institutional corruption. Note, while I find these podcasts interesting, they are not for everybody. They are mostly records of lectures given to various groups. While they are accessible, they are about serious policy matters and as many of the talks are on similar subjects, there tends to be some repetition.&lt;br /&gt;&lt;br /&gt;Recently Lessig has been working on a new project on institutional corruption called &lt;a href="http://change-congress.org/"&gt;Change Congress&lt;/a&gt;. The issue is that large sums of money fed through lobbyists seems to have an undue influence on the lawmakers in Congress and the Senate. The money appears to have such a large influence that lawmakers are voting against the clear wishes of their constituents.&lt;br /&gt;&lt;br /&gt;Change Congress works to highlight these cases of apparent institutional corruption. It is fighting for citizen funded elections so that the lawmakers are not under pressure to raise the money they need to get re-elected. Thus they will be less likely to be swayed by the lobbyists. Go to the &lt;a href="http://change-congress.org/"&gt;site&lt;/a&gt;, see what they have to say, and help them with their mission.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1706805893642693852?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1706805893642693852/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1706805893642693852' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1706805893642693852'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1706805893642693852'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/11/fight-instutional-corruption.html' title='Fight Instutional Corruption'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7882601237314198542</id><published>2009-11-07T17:51:00.000-08:00</published><updated>2009-11-09T21:57:37.853-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Internet'/><title type='text'>Vote for Net Neutrality Now</title><content type='html'>There is a lot of talk about Net Neutrality now, and the issues are not completely clear cut as I will discuss later. However, there is also a big threat that needs to be addressed right now.&lt;br /&gt;&lt;br /&gt;Bills are being proposed in Washington  with friendly names like "The Internet Freedom Act" whose &lt;a href="http://boingboing.net/2009/10/26/yet-another-reason-m.html"&gt;effect&lt;/a&gt; would be to give more control of the internet to the big ISPs and take away power from the people who are giving us innovative services like Google, Skype and Amazon. While there is also a friendly bill, and the FCC is on the side of Net Neutrality, everyone needs to act to let their congressman know whose side they are on. Visit "&lt;a href="http://www.savetheinternet.com/"&gt;Save the Internet&lt;/a&gt;" and take action &lt;a href="https://secure.freepress.net/site/Advocacy?cmd=display&amp;amp;page=UserAction&amp;amp;id=356"&gt;now&lt;/a&gt;!&lt;br /&gt;&lt;br /&gt;Now that you have done your bit to save the internet, we can talk about the problem. When a node on the internet gets too much traffic, the traffic control algorithm will pick connections at random and kill them. While this is good for keeping the traffic flowing in the aggregate, it tends to favor one class of user over another. The disadvantage user is the one who is using a single connection to browse the web, download a song or make a voice call. The advantaged user is using Bit-torrent which opens a large number of connections to do a massive download. It does not matter if Bit-torrent loses a connection, it has many others to make up for it, but it does matter when a web browser, or Skype conversation loses a connection.&lt;br /&gt;&lt;br /&gt;One solution is to answer greedy software with greedy software. That is every internet application would emulate Bit-torrent and greedily create hundreds of connections in case any one of them gets stomped. While this solution puts all applications on an equal footing, it may strain resources leading to a "&lt;a href="http://bandb.blogspot.com/2004/09/comedy-of-commons.html"&gt;Tragedy of the Commons&lt;/a&gt;", something that should not be in our bright digital future.&lt;br /&gt;&lt;br /&gt;Another solution would be to limit the number  of simultaneous sessions a user can have.  I personally feel that this would be better than having Comcast or AT&amp;amp;T doing deep packet inspection of my packets. However a hard limit on the number of sessions may cause all sorts of problems with software that is not expecting it, leading to deadlock and other bad behavior. Does anyone have any other ideas?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7882601237314198542?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7882601237314198542/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7882601237314198542' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7882601237314198542'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7882601237314198542'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/11/vote-for-net-neutrality-now.html' title='Vote for Net Neutrality Now'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6004803536863799069</id><published>2009-10-23T22:35:00.000-07:00</published><updated>2009-10-23T22:57:46.334-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Database Systems for Analytics</title><content type='html'>The question "what are the attributes of a database system for analytics?" came up during Omer Trajman's &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13532"&gt;talk&lt;/a&gt; to the October meeting of the SDForum Business Intelligence SIG. The talk was titled "The Evolution of BI from Back Office to Business Critical Analytics". In the talk Omer gave several examples of applications that use real time analytics and explained the special attributes of each application. As he runs field engineering for &lt;a href="http://www.vertica.com/"&gt;Vertica&lt;/a&gt;, a Database Systems vendor, I am sure that these examples were based on his experience with Vertica deployments, however Omer was careful to keep his talk vendor neutral.&lt;br /&gt;&lt;br /&gt;So what are the the attributes of a database system for analytics? Omer discussed three attributes. Firstly, an analytics database system cannot use the row level locking that is found in a traditional transaction processing database. The database system needs to provide snapshot isolation that gives a query a consistent view of the data while not preventing other operations like data loads. Having helped implement a system like this in the past, I am in total agreement with Omer.&lt;br /&gt;&lt;br /&gt;The second attribute is the need to allow concurrency between loading and querying data. While this is related to the first attribute, it also comes with its own issues. Bulk loads are more efficient (particularly for a columnar database like Vertica), however, if you want access to the most up to minute data you need to do loads in small increments so that the data is available for query as soon as it is loaded. Managing this balance is difficult and as yet it has not been completely solved. Again, I have worked on this issue in several different systems.&lt;br /&gt;&lt;br /&gt;The final attribute was scaleout, that is the ability to add more processing systems to handle more data and larger queries. We are building systems out of hundreds and thousands of computer systems. Scaleout is vital to effectively use these systems.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6004803536863799069?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6004803536863799069/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6004803536863799069' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6004803536863799069'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6004803536863799069'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/10/database-systems-for-analytics.html' title='Database Systems for Analytics'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3194146494966968616</id><published>2009-10-14T22:27:00.000-07:00</published><updated>2009-10-14T22:46:41.590-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>e-Readers for All</title><content type='html'>The e-Reader market is heating up, just in time for Christmas. Amazon is expanding features and bringing the &lt;a href="http://www2.blogger.com/%3Ca%20href=%22http://www.amazon.com/gp/product/B00154JDAI?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B00154JDAI%22%3EKindle%3C/a%3E%3Cimg%20src=%22http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B00154JDAI%22%20width=%221%22%20height=%221%22%20border=%220%22%20alt=%22%22%20style=%22border:none%20%21important;%20margin:0px%20%21important;%22%20/%3E"&gt;Kindle&lt;/a&gt; down the price curve. Today came &lt;a href="http://gizmodo.com/5380942/barnes-and-nobles-e+reader-like-a-kindleiphone-chimera-first-photos-and-details"&gt;word&lt;/a&gt; of the Barnes and Noble e-Reader with two screens, an e-ink screen for reading and a small LCD touch screen for interactivity.&lt;br /&gt;&lt;br /&gt;Also today I caught up with the "&lt;a href="http://twit.tv/twit"&gt;This Week in Tech&lt;/a&gt;" podcast from last weekend where they talked about the real killer features of the Kindle - wireless download and almost unlimited capacity. You can buy as many books as you want any time you want, which leads to buying many more books than you would otherwise buy. Imagine the scene, at dinner with your friends, you discuss books that you have recently read, and bam you buy the books they recommend  there and then. In fact there was even a cry in the podcast "Friends don't let friends use a Kindle while drunk" (for fear that the judgmentally impaired friend may buy too many books).&lt;br /&gt;&lt;br /&gt;When the original Kindle came out there was a tremendous outcry against it with people complaining of gadgets destroying their book reading experience and authors expecting to have their livelihood destroyed just as the music industry has been laid waste. Hint, musicians are doing just as well as they have always done, it is the music moguls with &lt;a href="http://www.songfacts.com/detail.php?id=1389"&gt;their&lt;/a&gt; "by the way, which one is Pink?" who have been laid waste. The Kindle stimulates the publishing industry and makes it much easier to buy books, leading to more sales where author gets a larger slice of the pie.&lt;br /&gt;&lt;br /&gt;Competition is good, particularly for the consumer. The e-Reader needs another generation or so to iron out the kinks and bring the price down to the mass market levels. I am waiting for the $149 price point (iPod Nano) which should come by next Christmas if not sooner.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3194146494966968616?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3194146494966968616/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3194146494966968616' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3194146494966968616'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3194146494966968616'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/10/e-readers-for-all.html' title='e-Readers for All'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-7838993912263851043</id><published>2009-10-03T18:30:00.000-07:00</published><updated>2009-10-03T22:24:41.766-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Search'/><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='UI'/><title type='text'>Search User Experience Innovations</title><content type='html'>Innovations in the Search User Experience was the &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13520"&gt;topic&lt;/a&gt; at the September meeting of the SDForum Search SIG. The distinguished panel from Microsoft, Google and Yahoo was chaired by Safa Rashtchy, a long time analyst and commentator on the Search scene.&lt;br /&gt;&lt;br /&gt;First, Sean Suchter General Manager of Microsoft's Search Technology Center Silicon Valley told us about the latest innovations in Bing. Sean started out with some numbers, showing that the Internet is still growing at a fast pace and that search is growing faster than the Internet in general. They measure their user's experience and see that about a quarter of searches are failures, resulting in an immediate click back. On the other hand, getting on for a half the search queries are further refined meaning that the user is engaged in a search session. Microsoft will recognize these sessions and use them to improve the user experience.&lt;br /&gt;&lt;br /&gt;To simplify the user experience, when they are confident about what a user is searching for, Bing will show one subject on the first page with a number of related links. Sean showed us two examples. Firstly for the search term "target", where they assume the person is looking for the Target chain of stores, they show a complete set of links to Target and shopping related pages with a single link to get other search results that are not related to Target stores. The second example was "ups" where they they only show links related to United Parcel Services and sending parcels on the first page.&lt;br /&gt;&lt;br /&gt;Next up was Johanna Wright, Director of Web Search Product Management at Google. Johanna started off by telling us that that 20% of searches have never been seen before, and that Google is dedicated to serving the long tail of web searches as well as more popular ones. To show us how far the search experience has come in the last few years, she applied the search term "how to tie a tie" to an index that they had saved from 2001, and compared it with what you get today. In 2001 you got a miscellaneous collection of links to sites like "The Indus Entrepreneur" with none about tying ties. Now you get relevant links along with image and video links, a tremendous improvement.&lt;br /&gt;&lt;br /&gt;Johanna talked about how speed is essential to a good user experience. A couple of years ago, they added related links to popular search terms like "target" to reduce the number of steps a user needs to make to get to the page they want. Google continues to work on helping users with query formulation. She showed us the options panel that you access by clicking the "search options" link on a search results page and how it can be used to refine a search.&lt;br /&gt;&lt;br /&gt;Finally, Dr. Larry Cornett, vice president of the Yahoo! Search Consumer Products division spoke. He started by reassuring us that Yahoo! is still in the search business and that if and when the planned combination of Yahoo! Search with Microsoft goes through, they will still provide their own front end and control their user's experience. Yahoo!'s goal has always been to personalize and structure the web. We saw  the new layout for Yahoo! search results in the typical Yahoo! busy style.&lt;br /&gt;&lt;br /&gt;After the demo's, the floor was thrown open to audience questions. Someone asked about natural language support for queries. Sean told the story as he has been in the search business for a long time. In the early days of search, natural language queries were considered important research area. Then the issue went away as providing relevant answers to queries became the dominant problem. Now that giving good answers is under control, natural language queries are making a comeback. Recently Microsoft bought Powerset to help them in this area.&lt;br /&gt;&lt;br /&gt;There were several questions about the sizes of market segments, and growth rates, particularly in the mobile space, to which the panel would not give answers. The audience did manage to uncover the fact that while adult searches are more prevalent than mobile searches,  mobile searches have been growing fast since the introduction of the iPhone and other smartphones.&lt;br /&gt;&lt;br /&gt;Another set of questions related to real time search. All three search engines have been working on improving the speed with which they update their indexes so that they are current. There is still an open question about whether the major search engines embrace real time search or make it a separate option.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-7838993912263851043?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/7838993912263851043/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=7838993912263851043' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7838993912263851043'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/7838993912263851043'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/10/search-user-experience-innovations.html' title='Search User Experience Innovations'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2151646105705298926</id><published>2009-09-07T11:01:00.000-07:00</published><updated>2009-09-07T11:54:45.547-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><title type='text'>Ikea Culture</title><content type='html'>We live in an Ikea world. I like to find excuses to visit the nearest Ikea in Palo Alto to lunch in their cafeteria, eating either a smoked salmon plate or Swedish meatballs with Lingonberry jam. The cafeteria has a great view over the South Bay and the East Bay hills. However the reason for this post is to note that Ikea has been popping up in the conversation all over the world.&lt;br /&gt;&lt;br /&gt;In China, the Ikea stores have become a great success, for the people, if not for Ikea. &lt;a href="http://www.latimes.com/business/la-fi-china-ikea25-2009aug25,0,7736661.story"&gt;This&lt;/a&gt; LA Times story reports that Chinese people are flocking to the local Ikea store, to test the bedding, hang out and eat in the cafeteria, maybe even buy some plates, just not to buy anything big.&lt;br /&gt;&lt;br /&gt;Meanwhile in LA itself, several young aspiring producers have noticed that an Ikea store is just like a movie studio with lots of little well lit sets showing off bedrooms, living rooms, kitchens. Just the place to make a short episode on the cheap. The actors mike up with wireless mikes outside, rush in and take a few shots and then rush out before any employees notice. &lt;a href="http://www.vimeo.com/4921692"&gt;Here&lt;/a&gt; is Ikea Hights, a soap opera, and &lt;a href="http://www.metacafe.com/watch/292691/the_real_world_ikea/"&gt;here&lt;/a&gt; is a send up of The Real World.&lt;br /&gt;&lt;br /&gt;Finally, as reported in the New York Times, there has been &lt;a href="http://www.nytimes.com/2009/09/05/arts/design/05ikea.html?hpw"&gt;outrage&lt;/a&gt; over the decision by Ikea to change the font in their latest catalog from Futura to Verdana. Futura is a well respected modern san-serif font that suits the Ikea style. Verdana is the generic Microsoft version of a san-serif font that comes on every computer with Windows. I am not sure why this is so important, are these people really complaining that Ikea has lowered its standards to encompass the lowest common denominator font?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2151646105705298926?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2151646105705298926/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2151646105705298926' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2151646105705298926'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2151646105705298926'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/09/ikea-culture.html' title='Ikea Culture'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1916224323713827488</id><published>2009-09-02T23:29:00.000-07:00</published><updated>2009-09-02T23:51:02.454-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><title type='text'>Project Voldemort</title><content type='html'>There were three interesting trends exposed in the talk about &lt;a href="http://project-voldemort.com/"&gt;Project Voldemort&lt;/a&gt; at the August &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13482"&gt;meeting&lt;/a&gt; of the SDForum SAM SIG. Firstly Voldemort is another tuple store as opposed to a relational database, the trend that interested me the most. The second trend is implementation of systems described in academic papers. The final trend is to use Open Source as a support mechanism for a large software project. Lets break down each of these trends one at a time. By the way, the &lt;a href="http://www.djcline.com/2009/08/28/aug-26-2009-sdf-linkedin-voldemort/"&gt;presentation&lt;/a&gt; was given by Bhupesh Bansal and Jay Kreps, of &lt;a href="http://www.linkedin.com/"&gt;LinkedIn&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The relational databases have been the reliable store for serious computing for the last 20 years, but recently tuple stores and tuple processing like Map-Reduce have appeared and are starting to challenge the relational database hegemony. In the simplest terms, a tuple store is just a very degenerate relational database. Relations are based on the n-tuple, that is each row in a table contains a number of data items whereas a plain tuple is two data items, a key and a value.&lt;br /&gt;&lt;br /&gt;As Jay Kreps explained, to get a web service application to scale, you need to distribute it over a over a cluster of computer systems, and to make this work with a relational database, you need to denormalize your database. The end point of database denormalization is the plain flat tuple store. Jay Kreps also complained that relational databases are not very good at handling data structures like the graphs of connections found in social networking applications, and semi-structured data like text.&lt;br /&gt;&lt;br /&gt;In my opinion, tuple stores are no better or worse than relational databases at dealing with graphs between tuples. Tuple stores are more flexible for handling semi-structured data, but again this depends on the application (for more, read my &lt;a href="http://bandb.blogspot.com/2008/11/map-reduce-versus-relational-database.html"&gt;comparison&lt;/a&gt; of Map-Reduce with relational databases). Tuple stores are certainly simpler, easier to use, more stable under load and cheaper than a relational database. I will write more about tuple stores at another time.&lt;br /&gt;&lt;br /&gt;The second notable trend is for groups to pick up on systems described in academic papers and just implement them. Voldemort is an implementation of the &lt;a href="http://www.allthingsdistributed.com/2007/10/amazons_dynamo.html"&gt;Amazon Dynamo&lt;/a&gt; system as described in their paper at the ACM Symposium on Operating Systems Principles. We have seen several other examples of this recently. Google released a set of papers about their data processing systems including Map-Reduce, that has created a number of projects to emulate their functionality. I have written about &lt;a href="http://bandb.blogspot.com/2008/12/map-reduce-sort-benchmark.html"&gt;Hadoop&lt;/a&gt; and &lt;a href="http://bandb.blogspot.com/2008/04/hypertable-massively-parallel-database.html"&gt;Hypertable&lt;/a&gt;, two examples, and there are others. These are systems for doing very large scale analytic data processing, while Amazon Dynamo and Voldemort are systems for supporting rapid access to large volumes of data such as is needed to support large and complex web sites.&lt;br /&gt;&lt;br /&gt;The final trend is Open Source as a support model. Voldemort was developed by LinkedIn, a company whose main business is providing a social and business network on the web. Their primary business is social networking, not writing and supporting a lot of complicated software. LinkedIn decided that they needed a tuple store like Amazon Dynamo and, as they could not buy it, they built it. However they decided they wanted help with support, so they released the software as an Open Source project. Now, Voldemort is being used by several organizations and at least half the people working on code are from outside LinkedIn. When Sandeep Giri started the OpenI project, I asked him why he was releasing it as an Open Source project and he gave the same &lt;a href="http://bandb.blogspot.com/2005/07/open-source-bi.html"&gt;reason&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1916224323713827488?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1916224323713827488/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1916224323713827488' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1916224323713827488'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1916224323713827488'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/09/project-voldemort.html' title='Project Voldemort'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3968249156344265281</id><published>2009-08-30T14:11:00.000-07:00</published><updated>2009-08-30T14:19:32.456-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><title type='text'>Augmented Reality</title><content type='html'>Earlier this month there was an explosion of posts and comments on the TechCrunch blog about Apple rejecting the Google Voice application for the iPhone. Michael Arrington wrote a &lt;a href="http://www.techcrunch.com/2009/08/21/the-simple-truth-whats-really-going-on-with-apple-google-att-and-the-fcc/"&gt;post&lt;/a&gt; about how Apple reasons for the rejection were misleading and untrue that got over 400 responses. At the time I did not understand the reasons for the intensity of the comments and responses, particularly in a quiet news month like August. Last week I went to the SDForum Virtual Worlds SIG to hear a talk about &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13481"&gt;Augmented Reality&lt;/a&gt;, and started to appreciate what is going on.&lt;br /&gt;&lt;br /&gt;The Augmented Reality presentation was given by Kari Pulli and Radek Grzeszczuk, researchers at the Nokia Research Center in Palo Alto. What they mean by Augmented Reality is that you point the camera in your smart phone at something and the phone displays more information about what you are looking at. For example, you point the phone at a building and it tells you which building you are looking at with perhaps a link to a map or information about the building. Alternatively, you could point the phone at a book cover and the phone will identify the book and give you links to reviews and a web site where you can buy the book.&lt;br /&gt;&lt;br /&gt;Someone in the audience asked the interesting question "Where do you get your data?" There are many different places to get data. For the demos, the book cover data had been scraped from Amazon.com. But when it comes to data, the elephant in the room is &lt;a href="http://www.google.com/intl/en/corporate/"&gt;Google&lt;/a&gt;, the company that promises to organize the worlds information. To organize the worlds information, they first have to collect it and then they have to have the computer systems and technology to organize it. Google has been busy doing that for many years now.&lt;br /&gt;&lt;br /&gt;The history of mobile appears to be going like this. First came the cellular network companies. They proved themselves incapable of providing anything more than voice and data services, so they are doomed to continue providing nothing but these basic services. As time goes on these services become less differentiated and eventually mere commodities.&lt;br /&gt;&lt;br /&gt;Next come the smart phones providers like Blackberry and Apple. They have opened up the cell phone business model to provide services that their users really want. But they still rely on others for data to run these services. When the data providers get a little too close to core functionality they back off their openness, as Apple has with the Google Voice application.&lt;br /&gt;&lt;br /&gt;The final step is for the data providers to take over, as Google is doing with the Google Android cell phone operating system. This is a play to reduce the devices to mere commodities and put the interesting business where it really belongs, with software and data.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3968249156344265281?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3968249156344265281/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3968249156344265281' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3968249156344265281'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3968249156344265281'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/08/augmented-reality.html' title='Augmented Reality'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1719912718331502134</id><published>2009-08-20T23:22:00.000-07:00</published><updated>2009-08-20T23:32:19.123-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><title type='text'>Media Convergence</title><content type='html'>The digital age has brought an extraordinary convergence of media that I have not seen remarked on anywhere. In the old world, each type of media was manufactured and delivered in its own different way. Movies were printed onto film and shown in movie theaters. Newspapers were printed on  newspaper printing presses and delivered through a content delivery network that ends up with the product being thrown onto driveways in the early morning hours. Books were printed on  book printing presses, bound and delivered through wholesalers to bookstores around the country. Records were printed in record presses, delivered to music wholesalers and then to record stores. Radio and TV were produced in studios, sometimes recorded and sent around the country to be broadcast on local transmitters.&lt;br /&gt;&lt;br /&gt;That has all changed. In the new digital world, each media type has the same underlying form. Spoken words, music, written words, pictures, moving pictures are all buckets of digital bits. While we can still get each type of media in its old form we can also get them all delivered to our computer, cell phone or media player through the internet or the cellular phone network. &lt;br /&gt;&lt;br /&gt;Even the devices we use to consume media are converged. Most of them can handle everything, so lets take an extreme example, the Amazon Kindle book reader. While the primary purpose of the Kindle is a book reader, it also has text to speech and handles audio files so that you can listen to music while reading. It will also display black and white pictures in the 3 common formats. So when you come to list the types of media that a Kindle can handle, it is quicker to say what it cannot do, that is color and moving pictures, than list all the things that it can do.&lt;br /&gt;&lt;br /&gt;This change to digital media is just upon us, so it is going to take some time for all the consequences to shake out. At the moment, there is great wailing and gnashing of teeth from the newspaper industry. Newspapers rely on advertising which always does badly in a recession, but this time they also have to deal with the air  being sucked out of their lungs by internet advertising and free listings. For some time, movie producers have been worried that they may be MP3ed like the music industry. More recently, book publishers have become aware that their business model is targeted and they are starting to behave like deer in the headlights as well.&lt;br /&gt;&lt;br /&gt;These are just media industries and their travails are just the price of doing business in a time of technological change. The interesting question is how it will affect culture. If all types of media are fundamentally equivalent, will our preferences, being unfettered, change? One change is that there is a move towards shorter forms. For example, online journalism is certainly shorter and more punchy than the printed equivalent. This is just one example of one direction that change could go in. I am sure that there will be more consequential changes, so let me know what you think.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1719912718331502134?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1719912718331502134/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1719912718331502134' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1719912718331502134'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1719912718331502134'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/08/media-convergence.html' title='Media Convergence'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1308738591814619325</id><published>2009-08-13T23:30:00.000-07:00</published><updated>2009-08-13T23:54:15.928-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Too Cheap to Meter</title><content type='html'>Last month Malcolm Gladwell wrote a snarky &lt;a href="http://www.newyorker.com/arts/critics/books/2009/07/06/090706crbo_books_gladwell"&gt;review&lt;/a&gt; of Chris Anderson's book &lt;a href="http://www.amazon.com/gp/product/1401322905?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=1401322905"&gt;Free: The Future of a Radical Price&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=1401322905" alt="" style="border: medium none  ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; for the New Yorker. You will remember that Anderson's last book &lt;a href="http://www.amazon.com/gp/product/B001PTG4BO?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=B001PTG4BO"&gt;The Long Tail&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=B001PTG4BO" alt="" style="border: medium none  ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; produced a wide range of reactions and "Free" will be no different. I did not like the Gladwell review. He picks up on a lot of little things while missing the big picture. On the other hand the book is somewhat carelessly written so that it is easy to find little things to criticize.&lt;br /&gt;&lt;br /&gt;An example is the discussion of the phrase "too cheap to meter". In the 1950's, Lewis Strauss, then head of the Atomic Energy Commission, predicted that atomic energy would make electricity so cheap to produce that there would be no need for electricity meters. Unfortunately too many people see that phrase and take it to mean that electricity would be free, which is not what Strauss was claiming as I will explain.&lt;br /&gt;&lt;br /&gt;The book "Free" has a chapter called "Too Cheap to Matter" that starts with Strauss's claim and goes on to Moore's Law and other laws of shrinking prices. Anderson seems to imply that electricity could be free, and in a long and rambling footnote still does not get to the point. Gladwell in his review of the book picks up on the implication and castigates Anderson for thinking that electricity could ever be free, using his own words against him.&lt;br /&gt;&lt;br /&gt;To understand Strauss, you need to look at a utility bill, where you will see that the charges come in two parts, a fixed component for providing the service and a variable component which is your actual metered use of the utility. Strauss was claiming that for electricity there would be no need for the variable part, all that would be needed was a fixed part to cover the cost of fixed generator plant, transmission and billing. Sorry Gladwell and Anderson, Lewis Strauss was not trying to say that electricity would ever be so abundant that it would be free.&lt;br /&gt;&lt;br /&gt;It is not unusual for utilities to be unmetered. Here are three examples of unmetered utilities from my personal experience. Firstly, I pay a fixed price for the broadband pipe of my internet service. Secondly, where I grew up, domestic water is plentiful enough that it is not metered, householders pay a fixed price for a 1/2 inch water main connection or somewhat more for a 3/4 inch water main connection. You could say that my garbage is too difficult to meter, so I just pay a fixed price for the weekly emptying of a 32 gallon garbage cart.&lt;br /&gt;&lt;br /&gt;Apart from some writing that seems to imply more than is actually there, I found Chris Anderson's new book to be forward looking and full of familiar arguments. Well recommended. I will write more on the subject.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1308738591814619325?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1308738591814619325/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1308738591814619325' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1308738591814619325'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1308738591814619325'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/08/too-cheap-to-meter.html' title='Too Cheap to Meter'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3099886610756465139</id><published>2009-07-26T22:57:00.001-07:00</published><updated>2009-07-26T23:41:07.782-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Twittering Foodies</title><content type='html'>Given these difficult economic times, the latest trend in San Francisco dining is the unresturant, &lt;a href="http://www.sanfranmag.com/story/fork-road"&gt;according&lt;/a&gt; to San Francisco Magazine. That is a posh way of describing eating from a food cart or truck. For example: &lt;a href="http://spenceronthego.com/home.html"&gt;Spenser-On-The-Go&lt;/a&gt; serves Caper Braised Skate Cheeks or Frogs Legs and Curry from a converted Taco truck; Boccalone serves exquisite pulled pork sandwiches from a bicycle; the Creme Brulee Cart and Magic Curry Kart are just street carts.&lt;br /&gt;&lt;br /&gt;As the vendors come and go and many of them are not properly licensed, the only way to find out where they are going to be serving is to follow them on Twitter. At last! a purpose for Twitter, if you are a committed foodie. As I have not quite gotten to the Escargot Puffs level yet, I have not yet joined Twitter, although I can see a glimmer of hope. On the other hand David Letterman is still firmly in the camp that twitter is a colossal waste of time as this hilarious &lt;a href="http://blog.twitter.com/2009/07/letterman-discovers-twitter-via-spacey.html"&gt;segment&lt;/a&gt; with Kevin Spacey shows.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3099886610756465139?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3099886610756465139/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3099886610756465139' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3099886610756465139'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3099886610756465139'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/07/twittering-foodies.html' title='Twittering Foodies'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3121068098516187494</id><published>2009-07-26T00:16:00.000-07:00</published><updated>2009-07-26T01:19:25.678-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><title type='text'>LinkedIn's Data</title><content type='html'>LinkedIn has an extraordinary data resource. They have more than 40 million members and a complete job history of each member, in some cases going back 30 or 40 years. DJ Patil, Chief Scientist and Senior Director of Product Analytics at &lt;a href="http://www.linkedin.com/companies/linkedin"&gt;LinkedIn&lt;/a&gt; showed us some examples of their data when he spoke to a packed meeting of the SDForum Business Intelligence SIG on "&lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13425"&gt;The Analytics Behind LinkedIn&lt;/a&gt;" last week. Paul O'Rorke has written an excellent &lt;a href="http://ororke.com/paul/blog/2009/07/21/analytics-behind-linkedin/#more-225"&gt;account&lt;/a&gt; of the meeting and here I am just adding my impression to that record.&lt;br /&gt;&lt;br /&gt;DJ believes in the growing importance of the "data analyst" as a profession. He backed up that belief with some hard data when he shows us the growing importance of the job title over the last 35 years. Up to the mid 90's the appearance of that job title as a percentage of all job titles was flat, but since then it has been growing at a steady pace. As an aside, DJ told us that they use the Amazon &lt;a href="http://en.wikipedia.org/wiki/Amazon_Mechanical_Turk"&gt;Mechanical Turk&lt;/a&gt; service to do data cleansing of things like job titles. This is the first time I have heard of the service being used for this purpose.&lt;br /&gt;&lt;br /&gt;We were shown other interesting examples of LinkedIn analytics including the change in the top five job titles over the Dot Com bust and an excellent display of the volume of cross country links between LinkedIn members. The big problem with this data is that we cannot have access to it because it is private to LinkedIn and they will keep it private to protect the privacy of their members.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3121068098516187494?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3121068098516187494/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3121068098516187494' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3121068098516187494'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3121068098516187494'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/07/linkedins-data.html' title='LinkedIn&apos;s Data'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4726715550588842700</id><published>2009-07-11T11:47:00.000-07:00</published><updated>2009-07-12T13:06:24.193-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Graphs That Suck</title><content type='html'>Many years ago in the early days of the web, I learned about web site design by reading "&lt;a href="http://www.webpagesthatsuck.com/"&gt;Web Pages That Suck&lt;/a&gt;: Learn Good Design by Looking at Bad Design". It is a delightfully easy beginner level crawl through web site design, filled with examples ranging from excellent to awful with a capital 'A'. I would recommend the &lt;a href="http://www.amazon.com/gp/product/078212187X?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=078212187X"&gt;book&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=078212187X" alt="" style="border: medium none  ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt; today except that the examples that make up the bulk of the book are way out of date.&lt;br /&gt;&lt;br /&gt;For Business Intelligence the equivalent would be a book called something like "Graphs that Suck", and Stephen Few's Perceptual Edge blog is a good place to find examples of this genre. Recently they posted a &lt;a href="http://www.perceptualedge.com/blog/?p=533"&gt;spectacularly bad&lt;/a&gt; example, a pie chart put out by Business Objects to promote a user conference. I will not repeat the critique, however I will say that if this is an example of what Business Objects thinks their software should be used for, I would be leery of using it!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4726715550588842700?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4726715550588842700/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4726715550588842700' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4726715550588842700'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4726715550588842700'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/07/graphs-that-suck.html' title='Graphs That Suck'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3014064560471111616</id><published>2009-07-03T11:46:00.000-07:00</published><updated>2009-07-03T13:14:38.070-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Musician Uses Twitter to Her Advantage, Shock Horror Probe</title><content type='html'>Technology is turning the music business upside down, like any other media business. Some people embrace the change and some people decry it. When I read a post like &lt;a href="http://mikeking.berkleemusicblogs.com/2009/06/23/how-an-indie-musician-can-make-19000-in-10-hours-using-twitter/"&gt;this one&lt;/a&gt; about using Twitter to make money, I &lt;a href="http://bandb.blogspot.com/2007/10/future-of-music-revisited_04.html"&gt;always read&lt;/a&gt; the comments. Whether the post is at the Berklee School of Music or &lt;a href="http://www.techcrunch.com/2007/10/04/the-inevitable-march-of-recorded-music-towards-free/"&gt;TechCrunch&lt;/a&gt;, the range of responses is wide and consistent. Some commenters accept the new world and cheer it on, while others complain bitterly. Typical complaints range from: "I cannot do that because I do not have any fans" through "people should respect copyright and give me the money I am due" to "the record company put you there so you should give it all back to them".&lt;br /&gt;&lt;br /&gt;The most ridiculous response is the complaint that a musician who spends time developing their fan base is wasting time that could be better spend on creative activities. The point of the Amanda Palmer post is that if you are properly organized, it does not take a lot of time or effort to keep in contact with your fans, particularly when using new instant communication tools like Twitter.&lt;br /&gt;&lt;br /&gt;Technology changes. Music is no longer distributed as sheets of paper or by stamping it on 5, 7 or 12 inch pieces of plastic. The business model must change with the times.&lt;br /&gt;&lt;blockquote&gt;The moving finger [of technology change] writes; and having writ,&lt;br /&gt;Moves on: nor all your piety nor wit&lt;br /&gt;Shall lure it back to cancel half a line,&lt;br /&gt;Nor all your tears wash out a word of it.&lt;/blockquote&gt;HT to &lt;a href="http://bloggershepherd.blogspot.com/"&gt;Roger&lt;/a&gt; for the Berklee post.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3014064560471111616?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3014064560471111616/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3014064560471111616' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3014064560471111616'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3014064560471111616'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/07/musician-uses-twitter-to-her-advantage.html' title='Musician Uses Twitter to Her Advantage, Shock Horror Probe'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5723231740432298550</id><published>2009-06-30T23:18:00.000-07:00</published><updated>2009-06-30T23:59:49.800-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Concurrency'/><category scheme='http://www.blogger.com/atom/ns#' term='programming languages'/><category scheme='http://www.blogger.com/atom/ns#' term='programming'/><title type='text'>Actors and Concurrent Computation</title><content type='html'>Carl Hewitt was in fine form when he spoke about the Actor model of concurrent computation at the SDForum SAM SIG &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13413"&gt;meeting&lt;/a&gt; on "Actor Architectures for Concurrent Computation". The meeting was a panel with three speakers. &lt;a href="http://carlhewitt.info/"&gt;Carl Hewitt&lt;/a&gt;, Emeritus Professor in Electrical Engineering and Computer Science at the Massachusetts Institute of Technology. Hewitt and his students invented Actors in the 1970's. Frank Sommers of &lt;a href="http://www.artima.com/index.jsp"&gt;Artima Software&lt;/a&gt;, is an active writer in the area of information technology and computing and is currently writing a book on Actors in the Scala programming language. Robey Pointer is a software engineer at &lt;a href="http://twitter.com/"&gt;Twitter&lt;/a&gt;, and is an &lt;a href="http://bandb.blogspot.com/2009/04/ruby-versus-scala.html"&gt;enthusiastic&lt;/a&gt; user of Scala and Actors at Twitter. Bill Venners is president of Artima Software and an author of a &lt;a href="http://www.artima.com/shop/programming_in_scala"&gt;book&lt;/a&gt; on Scala moderated the proceedings.&lt;br /&gt;&lt;br /&gt;Hewitt had retired when the advent of multi-core processors and distributed parallelism renewed interest in the Actor model. He has come out of retirement and is currently visiting Stanford. Hewitt described the genesis of the Actor methodology in the Alan Kay's Smalltalk 72, where every object was autonomous and you acted on an object by sending it messages. Later versions of Smalltalk moves in a different direction. The most important aspect of the Actor model is that it decouples the sender from the communications. In practice this allows the Scala implementation to scale to millions of Actors engaged in concurrent communication (more on this later).&lt;br /&gt;&lt;br /&gt;Hewitt spoke with great flourish on a number of other topics, including his determination that the &lt;a href="http://en.wikipedia.org/wiki/Actor_model"&gt;Actor model&lt;/a&gt; should be properly represented on Wikipedia and spooks and the internet archive. He spent some time with the unbounded non-determinism in the Actor model versus other concurrency formalisms that only support bounded non-determinism. An audience member challenged him to explain this better and citing Map-Reduce. Amusingly, Hewitt answered by describing the parallelism in Map-Reduce as like Stalin. Stalin has three deputies and each of those deputies has three deputies. Stalin tells his deputies what to do, and those deputies tell their deputies what to do and so on. Thus the business of the state can proceed in parallel. Map-Reduce is a sequential algorithm which is speeded up by parallelism. There is no non-determinism. This is parallelism as &lt;a href="http://blogs.sun.com/yuanlin/entry/concurrency_vs_parallelism_concurrent_programming"&gt;opposed&lt;/a&gt; to concurrency.&lt;br /&gt;&lt;br /&gt;Next Frank Sommers spoke on how Actors are used in &lt;a href="http://www.scala-lang.org/"&gt;Scala&lt;/a&gt;. The good news is that Actors are implemented in Scala and Hewitt much preferred the Scala implementation over the Erlang implementation of Actors. The bad news is that there are a number of issues with the Scala implementation. For example, a Scala program cannot exit from a "receive" statement. Another issue is that messages are supposed to be immutable, however the current implementation may not ensure this. These and other issues are being worked on, and the next version of Actors in Scala will be much better.&lt;br /&gt;&lt;br /&gt;Finally, Robey Pointer &lt;a href="http://robey.lag.net/2009/06/25/actors-talk.html"&gt;talked&lt;/a&gt; about how he is using Scala. He implements message queuing systems that deals with large numbers of long lived connections where each connection is mostly idle but has sporadic bursts of activity. Robey has implemented this system in many different ways. For example, a straight thread implementation and a lot of tuning got up to 5000 thread based connections working at one time, however this fell well short of his goal of supporting millions of connections. A thread pool implementation with a few hundred threads worked better but the code became unwieldy and more complicated than it should have been. Now he has an Actor based implementation in Scala that does scale to the millions of connections and yet the code remains straightforward and small.&lt;br /&gt;&lt;br /&gt;He also showed us how Actors can be mixed-in with thread based synchronization to solve problems for which even Actors are too heavyweight. I am in two minds about this. On the one hand, there are legitimate uses for this low level synchronization (as discussed in my PhD thesis). On the other hand, thread based concurrency is awful as I keep promising to explain in another post. Also to do it safely, you need to understand is great detail how Actors are implemented in Scala, and one reason for adopting a high level construct like Actors is that it should hide gory implementation details.&lt;br /&gt;&lt;br /&gt;After the meeting I spoke with Carl Hewitt. We agreed that sending a message needs to have a low overhead. It should have a similar costs to calling a procedure. Computers have specialized instructions to support procedure calls and they need specialized instructions to support message passing. We did this for the &lt;a href="http://www.wotug.org/transputer.shtml"&gt;Tranputer&lt;/a&gt;, although that was before its time, and it is eminently possible for Actors. All we need is for a high level concurrency formalism like Actors to get enough traction that the chip manufacturers become interested in supporting it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5723231740432298550?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5723231740432298550/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5723231740432298550' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5723231740432298550'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5723231740432298550'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/06/actors-and-concurrent-computation.html' title='Actors and Concurrent Computation'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3436262446234082825</id><published>2009-06-22T22:02:00.000-07:00</published><updated>2009-07-12T13:10:08.254-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Now You See It</title><content type='html'>While visualization can be an effective tool to understand data, too many software vendors seem to view visualization as an opportunity to "bling your graph" according to &lt;a href="http://www.perceptualedge.com/"&gt;Stephen Few&lt;/a&gt; author, teacher and consultant. Few has written a new book just published called "&lt;a href="http://www.amazon.com/gp/product/0970601980?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0970601980"&gt;Now You See It: Simple Visualization Techniques for Quantitative Analysis&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=0970601980" alt="" style="border: medium none  ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;". He spoke to the SDForum Business Intelligence SIG June &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13464"&gt;meeting&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Few took us on a quick tour of visualization. We saw a short Onion News Network &lt;a href="http://www.theonion.com/content/video/breaking_news_series_of"&gt;video&lt;/a&gt; that satirized graphics displays in news broadcasts, followed by examples of blinged graphs and dashboards that were both badly designed and misleading in their information display. Not all visualizations are bad. An example of good visualization is the work of Hans Rosling who is a regular speaker at the TED conference (his &lt;a href="http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html"&gt;presentations&lt;/a&gt; are well worth watching, and then you can go to &lt;a href="http://www.gapminder.org/"&gt;Gapminder.org&lt;/a&gt; and play with the data just as he does). Another example of visualization used effectively to tell a story is in the Al Gore documentary "An Inconvenient Truth".&lt;br /&gt;&lt;br /&gt;Next came a discussion of visual perception, leading up to the idea that we can only keep a few items in our short term memory at one time, however these items can be complex pieces of visual information. Given that data analysis is about comparing data, visual encoding allow us to see and compare more complex patterns than, for example, tabular data.&lt;br /&gt;&lt;br /&gt;Any data display can only show us a small part of the picture. An analyst builds understanding of their data set by building up complex visualizations of the data, piece at a time. We saw some examples of these visualizations. Software should support the data analyst as they build up their visualizations without getting in the way. Few told us that the best software is rooted in academic research. He recommend several packages including &lt;a href="http://www.tableausoftware.com/"&gt;Tableau&lt;/a&gt; and &lt;a href="http://spotfire.tibco.com/"&gt;Spotfire&lt;/a&gt;, both of whom have presented to the Business Intelligence SIG in the past.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3436262446234082825?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3436262446234082825/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3436262446234082825' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3436262446234082825'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3436262446234082825'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/06/now-you-see-it.html' title='Now You See It'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-3448743628996054013</id><published>2009-06-15T22:14:00.000-07:00</published><updated>2009-07-12T13:12:21.135-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Technology'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><title type='text'>Free: The Future of a Radical Price</title><content type='html'>For some time I have intended to write a post on the economics of media now that the cost of manufacturing it has gone to nothing. Today I discovered that Chris Anderson, editor of Wired and author of The Long Tail" has written a book on the subject called "&lt;a href="http://www.amazon.com/gp/product/1401322905?ie=UTF8&amp;amp;tag=buiandbre-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=1401322905"&gt;Free: The Future of a Radical Price&lt;/a&gt;&lt;img src="http://www.assoc-amazon.com/e/ir?t=buiandbre-20&amp;amp;l=as2&amp;amp;o=1&amp;amp;a=1401322905" alt="" style="border: medium none  ! important; margin: 0px ! important;" border="0" width="1" height="1" /&gt;", available in July. I will write a post after reading the book, here is an outline of what I expect it to say.&lt;br /&gt;&lt;br /&gt;Firstly, as the &lt;a href="http://www.techcrunch.com/2009/06/15/chris-andersons-counterintuitive-rules-for-charging-for-media-online/"&gt;TechCrunch&lt;/a&gt; post says, "As products go digital, their marginal cost goes to zero." It is now economic to give the product away, and make it up on volume. Closely related is the network effect, the more widespread that some piece of media is, the more "valuable" that it becomes. Barriers to media becoming widespread reduce the likelihood that it is seen or heard. Cost is definitely a barrier.&lt;br /&gt;&lt;br /&gt;Moreover, putting a price on your media item creates the opportunity for others to price for free and undercut you. A good example is &lt;a href="http://www.craigslist.org/about/sites"&gt;craigslist&lt;/a&gt;. It may not be quite what you think of as media, but craigslist is in the process of decimating the newspaper industry by destroying their market for classified advertisements. Craigslist makes their money by selling access to narrow niche markets, so it seems to fit in perfectly with Anderson's thesis.&lt;br /&gt;&lt;br /&gt;In the past I have written about the &lt;a href="http://bandb.blogspot.com/2005/11/future-of-music.html"&gt;future of music&lt;/a&gt; and how musicians are moving to make their money from performance rather than from record sales. As goes music, so goes all media. My sister is currently writing a book. This last week she told me that she expects to make her living from touring to lecture on the books contents rather than from book sales.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-3448743628996054013?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/3448743628996054013/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=3448743628996054013' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3448743628996054013'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/3448743628996054013'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/06/free-future-of-radical-price.html' title='Free: The Future of a Radical Price'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5453357458920733818</id><published>2009-06-02T23:10:00.000-07:00</published><updated>2009-06-02T23:25:17.271-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Databases in the Cloud</title><content type='html'>Last week was a busy week, with Databases in the Cloud on Tuesday followed by Hadoop and MapReduce with Cascading on Wednesday. These were both must attend SDForum SIG meetings for anyone who wants to keep up with new approaches to database and analytics systems. The two meetings had very different characteristics. &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13367"&gt;MapReduce with Cascading&lt;/a&gt; was a technical presentation that required concentration to follow but did contain some real nuggets of information. The Cloud Services SIG &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13366"&gt;meeting&lt;/a&gt; on Tuesday Demo Night: Databases in the Cloud was more accessible. This post is about Databases in the Cloud.&lt;br /&gt;&lt;br /&gt;Roger Magoulas of &lt;a href="http://oreilly.com/"&gt;O'Reilly&lt;/a&gt; Research started the meeting by discussing big data and their experience with it. A useful definition of "Big Data" is that when the size of the data becomes a problem, you have Big Data. O'Reilly has about 6 TBytes of data in their Job database, that is more than a billion rows. The data comes from the web and it is messy. They use GreenPlum, a scalable MPP database system suitable for cloud computing. It also has built in MapReduce. Like many people doing analytics, they are not really sure what they are going to do with the data so they want to keep things as flexible as possible with flexible schemas. Roger and the O'Reilly team believe that 'making sense of "Big Data" is a core competency of the information Age'. On the technology side, Big Data needs MPP parallel processing and compression. Map-Reduce handles big data with flexible schemas and is resilient by design.&lt;br /&gt;&lt;br /&gt;After Roger came three demos. Ryan Barrett from Google showed us a &lt;a href="http://code.google.com/appengine/"&gt;Google App Engine&lt;/a&gt; application that uses the Google Store. Google App Engine is a service for building web applications that is free for building small applications, and paid when the application scales. The Google Store is BigTable, a sharded stateless tuple store for big data (see my previous posts on the Google Database System and Hypertable, a clone of BigTable). Like every other usable system, Google has its own high level language called GQL (Google Query language), whose statements start with the verb SELECT. To show that they are serious about supporting cloud applications, Google also provides bulk upload and download. Google App Engine is a service that allows you to build and test your cloud web application for free.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.cloudera.com/"&gt;Cloudera&lt;/a&gt; is a software start up that provides training and support for the Open Source Hadoop MapReduce project. Christophe Bisciglia from Cloudera gave a an interesting analogy. First he compared the performance of a Ferrari and a freight train. A Ferrari has fast acceleration and a higher top speed but can only carry a light load. A freight train accelerates slowly and has a lower top speed, but it can carry a huge load. Then he told us that a database system is like a Ferrari, while Map-Reduce is like the freight train. Map-Reduce does batch processing and is capable of handling huge amounts of data, but it is certainly not fast and agile like a database system, which is capable of giving answers in real time.&lt;br /&gt;&lt;br /&gt;Finally George Kong showed us the &lt;a href="http://www.asterdata.com/index.php"&gt;Aster Data Systems&lt;/a&gt; MPP database system with a Map-Reduce engine. They divide their database servers into three groups, the Queen that manages everything, Worker hosts that handle queries and Loader hosts that handle loading. This is a standard database system that works with standard tools such as Informatica, Business Objects, Microstratagy and Pentaho. It is also capable of running in the elastic cloud. For example, one of their customers is ShareThis which keeps a 10 TByte Aster Data Systems database in the cloud. This database uses Microstratagy and Pentaho for reporting.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5453357458920733818?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5453357458920733818/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5453357458920733818' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5453357458920733818'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5453357458920733818'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/06/databases-in-cloud.html' title='Databases in the Cloud'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-2703397522607858927</id><published>2009-05-29T18:47:00.000-07:00</published><updated>2009-05-30T01:23:19.891-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Using BI to Manage Your Startup</title><content type='html'>We heard several different perspectives on how Start Ups use Business Intelligence at the May meeting of the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;SDForum&lt;/span&gt; Business Intelligence SIG. The meeting was a panel, moderated by Dan &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;Scholnick&lt;/span&gt; of Trinity Ventures. Dan opened the meeting by introducing himself and then asking the panelists to introduce themselves.&lt;br /&gt;&lt;br /&gt;The first panelist was &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;Naghi&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Prasad&lt;/span&gt;, VP, Engineering &amp;amp; Operations at &lt;a href="http://www.offerpalmedia.com/"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;Offerpal&lt;/span&gt; Media&lt;/a&gt;, a start up that allows developers to monetize social applications and online games. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;Offerpal&lt;/span&gt; Media is a marketing company that does real time advertisement targeting and uses a variety of analytics techniques such as AB testing. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;Naghi&lt;/span&gt; told us that Business Intelligence is essential to the companies business and baked into their framework.&lt;br /&gt;&lt;br /&gt;Next up was Lenin &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;Gali&lt;/span&gt;, Director of Business Intelligence at &lt;a href="http://sharethis.com/"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;ShareThis&lt;/span&gt;&lt;/a&gt;, a start up that allows people to share content with friends, family and their network via Email, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;SMS&lt;/span&gt; and social networking sites such as &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;FaceBook&lt;/span&gt;, Twitter, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;MySpace&lt;/span&gt; and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;LinkedIn&lt;/span&gt;. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;ShareThis&lt;/span&gt; also uses AB testing, and as a content network has to deal with large amounts of data.&lt;br /&gt;&lt;br /&gt;Third was Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;Lapcevic&lt;/span&gt;, VP of Business Development at &lt;a href="http://www.newrelic.com/"&gt;New Relic&lt;/a&gt;, which provides Software as a Service (&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;SaaS&lt;/span&gt;) performance management for the Ruby on Rails web development platform. New Relic has acquired 1700 customers over its first year as a start up with a single sales person. Their customers are technical and they use their platform to track the addiction or pain of each customer, and to estimate their potential budget.&lt;br /&gt;&lt;br /&gt;The final panelist was Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;Grosso&lt;/span&gt;, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;CTO&lt;/span&gt; &amp;amp; VP of Engineering at &lt;a href="http://www.twofish.com/"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_18"&gt;Twofish&lt;/span&gt;&lt;/a&gt;, a start up that offers &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_19"&gt;SaaS&lt;/span&gt; based virtual economies for Virtual Worlds and Massive &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_20"&gt;Multiplayer&lt;/span&gt; Online Games (&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_21"&gt;MMOG&lt;/span&gt;). For the operator, a virtual economy is Sam Walton's analytics dream, as you see into every players wallet and capture their every purchase and exchange. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_22"&gt;TwoFish&lt;/span&gt; uses their experience with running multiple virtual economies to tell their customers what they are doing right and wrong in developing a virtual economy.&lt;br /&gt;&lt;br /&gt;Dan's first question was "What are some of the pitfalls of Business intelligence?" Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_23"&gt;Lapcevic&lt;/span&gt; told us that they have a real time reporting system that can track can track revenue by the minute. The problem is that you can become addicted to data and spend too much time with it. Sometime you need to get away from your screen and talk to the customer. Lenin agreed with this and added that they have problems with data quality. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_24"&gt;Naghi&lt;/span&gt; told us that while a benefit is the surprises that they find from the data, a problem is that they are never finished with their analytics. Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_25"&gt;Grosso&lt;/span&gt; was concerned with premature generalization. You need to wait until you have enough data to support conclusions and revisit the conclusions as more data arrives.&lt;br /&gt;&lt;br /&gt;There was a wide variety of answers to the question of which tools each panel member used. According to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_26"&gt;Naghi&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_27"&gt;Prasad&lt;/span&gt;, "MySQL is a killer app, it will kill your app!" &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_28"&gt;Offerpal&lt;/span&gt; Media uses Oracle for their their database. While they like some of the features of Microsoft &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_29"&gt;SQL&lt;/span&gt; Server, they are constrained to have only one Database Administrator (&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_30"&gt;DBA&lt;/span&gt;) and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_31"&gt;DBAs&lt;/span&gt; are best when they specialize in one database system. They use open source Kettle for &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_32"&gt;ETL&lt;/span&gt; and Microsoft Excel for data presentation. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_33"&gt;Naghi&lt;/span&gt; extolled the virtues of giving users data in a spreadsheet they were comfortable with and Excel pivot tables allows the user to manipulate their data at will. After surveying what was available, they implemented their own AB testing package.&lt;br /&gt;&lt;br /&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_34"&gt;ShareThis&lt;/span&gt; is on the leading edge of technology use. Lenin told us that they are 100% in the cloud, using the LAMP stack with MySQL and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_35"&gt;PHP&lt;/span&gt;. They have a 10 Terabyte in an Aster Data Systems database, and use both &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_36"&gt;Microstrategy&lt;/span&gt; and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_37"&gt;Hadoop&lt;/span&gt; with Cascading for data analysis and reporting. Running this system takes about 1.5 system admins.&lt;br /&gt;&lt;br /&gt;As might be expected, the New Relic system is built on Ruby on Rails and uses &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_38"&gt;sharded&lt;/span&gt; MySQL to achieve the required database performance. In their experience it is sometimes worth paying a little more for hardware than optimizing the last ounce of performance from a system. They have developed many of their own analytics tools that they expect to sell as product to their customers.&lt;br /&gt;&lt;br /&gt;As &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_39"&gt;TwoFish&lt;/span&gt; does accounting for virtual worlds, their servers are not in the cloud, rather they are locked in their cage in a secure data center. While Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_40"&gt;Grosso&lt;/span&gt; lusts after some features in Microsoft &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_41"&gt;SQL&lt;/span&gt; Server, they use MySQL with Kettle for &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_42"&gt;ETL&lt;/span&gt;. They have developed their own visualization code that sits in front of the Mondrian &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_43"&gt;OLAP&lt;/span&gt; engine. They expect to do more with the R language for statistical analysis and data mining.&lt;br /&gt;&lt;br /&gt;Dan asked the panel how they get the organization to use their Business Intelligence product Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_44"&gt;Grosso&lt;/span&gt; lead by saying that adoption has to come from the top. If the CEO is not interested in Business intelligence, then nobody else will be either. He also called for simple metrics that make a point. Bill &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_45"&gt;Lapcevic&lt;/span&gt; agreed that leadership should come from the top. The idea is to make the data addictive to users and to avoid to many metrics. Sharing data widely can help everyone understand how they can contribute to improving the numbers. Lenin thought that it was important to make decisions and avoid analysis paralysis. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_46"&gt;Naghi&lt;/span&gt; offered that Business Intelligence can scare non Business Intelligence users. You have to provide simple stuff, and make sure that you score some sure hits early on to encourage people. Finally remember that different people need different reports so make sure each report is specialized to the requirements of the person receiving it.&lt;br /&gt;&lt;br /&gt;There were more questions asked, too many to describe in detail here. All in all, we had an informative discussion throughout the evening with a lot of good information shared.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-2703397522607858927?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/2703397522607858927/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=2703397522607858927' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2703397522607858927'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/2703397522607858927'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/05/using-bi-to-manage-your-startup.html' title='Using BI to Manage Your Startup'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8275577207588581162</id><published>2009-05-16T18:35:00.000-07:00</published><updated>2009-05-17T01:06:29.552-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Dismal Science'/><title type='text'>Virtual Worlds - Real Metrics</title><content type='html'>Avoid hyperinflation! This was the salient piece of advice for running a virtual economy that I got from Bill Grosso's &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13360"&gt;talk&lt;/a&gt; to the April meeting of the SDForum Emerging Technology SIG. The talk was entitled "Virtual Worlds and Real Metrics". Bill is CTO of &lt;a href="http://www.twofish.com/"&gt;TwoFish&lt;/a&gt;, a startup that provides virtual economies to online worlds. Bill has an undergraduate degree in Economics, so he has both academic knowledge and the practical experience of seeing the insides of virtual economies in online worlds.&lt;br /&gt;&lt;br /&gt;In case you are wondering what a &lt;a href="http://knowledge.wharton.upenn.edu/article.cfm?articleid=1302"&gt;virtual economy&lt;/a&gt; is, two leading examples are found in World of Warcraft and Second Life. Both are massive multiplayer online (MMO) worlds. World of Warcraft is a fantasy game based on combat where players receive rewards for gameplay. The economy greases interactions between players and allows them to exchange rewards. Although it is not the primary point of the game, the World of Warcraft economy is estimated to be larger than many small countries around the world. Second Life is an online world where you can establish a second life for yourself. Like all the other aspects of Second Life, the economy is intended to provide a virtual reflection of a &lt;a href="http://gawker.com/tech/second-life/virtual-worlds-supposed-economy-is-a-pyramid-scheme-230813.php"&gt;real economy&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Although it is attractive to think that virtual economies are like real economies, Bill spent quite some time disabusing us of this notion. Unlike real money, virtual money may have different costs in different localities, sometimes you get a bulk discount on large purchases of virtual money and in some cases old money may expire. Another issue is that money does not circulate in the same way are real money. In the real world money, once set free, circulates from person to person, business to business. In a virtual world, money is usually created by the game, flows through a player and then back to the game. It seems that the degree to which money circulates between players is a good measure of how "real" the economy is in a virtual world.&lt;br /&gt;&lt;br /&gt;For measuring money velocity, virtual worlds do have an advantage. In the real world, statisticians estimate the money supply, then estimate the total number of number of transactions in the economy and divide one by the other to get money velocity. In a virtual world, the man behind the curtain sees into every players wallet and every transaction. The calculation of money velocity is exact. Moreover, by linking demographics and other knowledge to players you have a Business Intelligence analytics wonderland, precise reports on every aspect of the economy that can be used for all sorts of marketing purposes.&lt;br /&gt;&lt;br /&gt;Hyperinflation &lt;a href="http://news.bbc.co.uk/2/hi/technology/2345933.stm"&gt;has been&lt;/a&gt; and always will be a pitfall of virtual economies. It can come from bad design of the virtual economy, but is it more likely to come from players finding and exploiting bugs in the game, and there are always bugs in a game. Eternal vigilance is the price for avoiding hyperinflation, so the analytics reports are an important part of managing this part of the economy as well.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8275577207588581162?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8275577207588581162/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8275577207588581162' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8275577207588581162'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8275577207588581162'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/05/virtual-worlds-real-metrics.html' title='Virtual Worlds - Real Metrics'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4963472487441250503</id><published>2009-05-03T21:38:00.000-07:00</published><updated>2009-05-18T22:47:20.052-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><title type='text'>Gartner BI Summit</title><content type='html'>Suzanne Hoffman, VP Sales at &lt;a href="http://www.staranalytics.com/"&gt;Star Analytics&lt;/a&gt;, recently attended the Gartner BI Summit and she gave the SDForum Business Intelligence SIG a short overview of the conference at the April meeting. You can get her presentation from our &lt;a href="http://www.sdforum.org/bisig"&gt;web site&lt;/a&gt;. There were several threads that ran through her presentation, one is the scope of Business Intelligence (BI). Another related thread is Business Intelligence and Performance Management (PM or EPM). Note that when "E" is prefixed to a acronym, it stands for Enterprise.&lt;br /&gt;&lt;br /&gt;Although the title of the conference is the Gartner BI Summit, there seemed to be a lot of concern for placing BI within the context of Information Management (IM or EIM), where Information Management includes both search and social software for collaboration as well as BI and PM. On a side note, I have always found enterprise search to be terrible, as &lt;a href="http://bandb.blogspot.com/2004/08/search-alone.html"&gt;discussed &lt;/a&gt;previously. Anything that can be done to improve enterprise search is worthwhile, and both analytics and social efforts like bookmarking or tagging can and should be applied to make it better.&lt;br /&gt;&lt;br /&gt;Gartner sees BI evolving from something that is mostly pushed by Information Technology (IT) to something that is more broadly focused on Performance Management driving Business Transformation. There has been tension between the terms Business Intelligence and Performance management for some time. For example, I wrote a semi-serious &lt;a href="http://bandb.blogspot.com/2004/09/bi-is-back.html"&gt;post&lt;/a&gt; on the subject in 2004. At the BI SIG we have always used BI an umbrella term that encompasses customer and supply chain analytics and management. On the one hand, maybe BI is too associated with IT and not enough with the end users such as business analysts. On the other hand Performance Management may have a narrower focus on financial analysis, which is a large and important part of analytics, but not the whole enchilada by any means.&lt;br /&gt;&lt;br /&gt;Which ever term is chosen as the umbrella, we will continue to call our SIG the Business Intelligence SIG for as long as Gartner has BI Summits.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4963472487441250503?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4963472487441250503/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4963472487441250503' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4963472487441250503'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4963472487441250503'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/05/gartner-bi-summit.html' title='Gartner BI Summit'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-4664962729825522467</id><published>2009-05-02T00:51:00.000-07:00</published><updated>2009-05-02T01:22:16.774-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><title type='text'>The Next Revolution in Data Management</title><content type='html'>Cringely wrote a great post today called "&lt;a href="http://www.cringely.com/2009/05/the-sequel-dilemma/"&gt;The Sequel Dilemma&lt;/a&gt;". His point is that we are in the midst of a revolution in the way we do data management, the database is the like a horse and buggy soon to be run over by the next generation of data management tools like, for example, the &lt;a href="http://bandb.blogspot.com/2008/10/google-database-system.html"&gt;Google database system&lt;/a&gt; that I wrote about last year. I particularly liked his comment:&lt;br /&gt;&lt;blockquote&gt;Right now almost every web application has an Apache server fronting a database box running MySQL or its closed source equivalent like Oracle, DB2, or SQL Server.  The data bottleneck in all those applications is the SQL box, which is generally doing a very simple job in a very complex manner that made total sense for minicomputers in 1975 but doesn’t make as much sense today. &lt;/blockquote&gt;&lt;blockquote&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-4664962729825522467?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/4664962729825522467/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=4664962729825522467' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4664962729825522467'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/4664962729825522467'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/05/next-revolution-in-data-management.html' title='The Next Revolution in Data Management'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6224409874566814956</id><published>2009-04-25T15:08:00.000-07:00</published><updated>2009-04-25T15:24:58.599-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Mahout on Hadoop</title><content type='html'>No, this is not a tale of an elephant and his faithful driver, I am talking about &lt;a href="http://lucene.apache.org/mahout/"&gt;Mahout&lt;/a&gt;, an Open Source project that is building a set of serious machine learning and analytic algorithms to run on the Hadoop Open Source Map-Reduce platform. We learned about this at the April meeting of the SDForum Business Intelligence SIG where Jeff Eastman &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13329"&gt;spoke&lt;/a&gt; on "BI Over Petabytes: Meet Apache Mahout".&lt;br /&gt;&lt;br /&gt;As Jeff explained, the Mahout project is a distributed group of about 10 committers who are working on implementing different types of analytics and machine learning algorithms. Jeff's interest is in clustering algorithms that are used for various purposes in analytics. One use is to &lt;a href="http://en.wikipedia.org/wiki/Collaborative_filtering"&gt;generate&lt;/a&gt; the "customers who bought X also bought Y" come on that you see at an online retailer. Another use of Clustering is to create a small number of large groups of similar behavior to understand patterns and trends in customer purchasing behavior.&lt;br /&gt;&lt;br /&gt;Jeff showed us the all the Mahout clustering algorithms, explaining what you need to provide to set up the algorithm and giving graphical examples of how they behaved on a example data set. He then went on to show how one algorithm was implemented on Hadoop. This implementation shows how flexible the Map Reduce paradigm is. I showed a very simple &lt;a href="http://bandb.blogspot.com/2008/11/map-reduce-versus-relational-database.html"&gt;example&lt;/a&gt; of Map-Reduce when I wrote about it last year so that I could compare it to the same function implemented in SQL. Clustering using Map-Reduce is at the other end of the scale, a complicated big data algorithm that also can effectively use the Map-Reduce platform.&lt;br /&gt;&lt;br /&gt;Most Clustering algorithms are iterative. From an initial guess at the clusters, an iteration moves data points from one cluster to another to make better clusters. Jeff suggested that a typical application may use 10 iterations or so to converge to a reasonable result. In Mahout, each iteration is a Map-Reduce step. He showed us the top level code for one clustering algorithm. Building on the Map-Reduce framework and the Mahout common libraries for data representation and manipulation, the clustering code itself is pretty straightforward.&lt;br /&gt;&lt;br /&gt;Up to now, it has not been practical to do sophisticated analytics like clustering on datasets that exceed a few megabytes, so the normal approach is to sample the dataset to get a small representative sample and then do the analytics on that sample. Mahout enables the analytics on the whole data set, provided that you have the computer cluster to do it.&lt;br /&gt;&lt;br /&gt;Given that most analysts are used to working with samples, is there any need for Mahout scale analytics? Jeff was asked this question when he gave the presentation at Yahoo, and he did not have a good answer then. Someone in the audience suggested that analytics on the &lt;a href="http://bandb.blogspot.com/2004/10/long-tale.html"&gt;long tail&lt;/a&gt; requires the whole dataset. After thinking about it, processing the complete dataset is also needed for collaborative filtering like the "customers who bought X also bought Y" example given above.&lt;span class="appOutput"&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Note that at the BI SIG meeting Suzanne Hoffman of Star Analytics also gave a short presentation on the Gartner BI Summit. I will write about that in another post.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6224409874566814956?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6224409874566814956/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6224409874566814956' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6224409874566814956'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6224409874566814956'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/04/mahout-on-hadoop.html' title='Mahout on Hadoop'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8447152714379265643</id><published>2009-04-08T22:13:00.000-07:00</published><updated>2009-04-11T01:44:32.325-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='programming languages'/><category scheme='http://www.blogger.com/atom/ns#' term='programming'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Ruby versus Scala</title><content type='html'>An interesting spat has recently emerged in the long running &lt;a href="http://bandb.blogspot.com/2007/06/programming-language-wars.html"&gt;story&lt;/a&gt; of Programming Language wars. The Twitter team, who had long been exclusively a Ruby on Rails house, came out with the "shock" &lt;a href="http://www.artima.com/scalazine/articles/twitter_on_scala.html"&gt;revelation&lt;/a&gt; that they were converting part of their back end code to use the Scala programming language. Several &lt;a href="http://blog.obiefernandez.com/content/2009/04/my-reasoned-response-about-scala-at-twitter.html"&gt;Ruby&lt;/a&gt; &lt;a href="http://unlimitednovelty.com/2009/04/twitter-blaming-ruby-for-their-mistakes.html"&gt;zealots&lt;/a&gt; &lt;a href="http://www.theregister.co.uk/2009/04/01/twitter_on_scala/"&gt;immediately&lt;/a&gt; &lt;a href="http://ikaisays.com/2009/04/02/twitter-ruby-on-rails-scala-and-people-who-dont-rtfa/"&gt;jumped&lt;/a&gt; in saying that the Twitter crew obviously did not know what they were doing because they had decided to turn their backs on Ruby.&lt;br /&gt;&lt;br /&gt;I must confess to being amused by the frothing at the mouth from the Ruby defenders, but rather than laughing, lets take a calm look at the arguments. The Twitter developers are still using Ruby on Rails for its intended purpose of running a web site. However they are also developing back-end server software and have chosen the Scala programming language for that effort.&lt;br /&gt;&lt;br /&gt;The Twitter crew offer a number of reasons for choosing a strongly typed language. Firstly, dynamic languages are not very good for implementing the kind of long running processes that you find in a server. I have experience with writing servers in C, C++ and Java. In all these languages there are problems with memory leaks that cause the memory footprint to grow over the days, weeks or months that the server is running. Getting rid of memory leaks is tedious and painful, but absolutely necessary. Even the smallest memory leak will be a problem with heavy usage and if you have a memory leak, the only cure is stopping and restarting the server. Note that garbage collection does not do away with memory leaks, it just changes the nature of the problem. Dynamic languages are designed for rapid implementation and hide the boring details. One detail that is missing is control over memory usage and memory usage left on its own tends to leak.&lt;br /&gt;&lt;br /&gt;Another issue is concurrency. Server software needs to exploit concurrency, particularly now in the era of multi-core hardware. Dynamic languages have problems with concurrency. There are a bunch of issues, too many to discuss here. Sufficient to say that in the past Guido van Rossum has prominently &lt;a href="http://www.devshed.com/c/a/Python/Basic-Threading-in-Python/"&gt;argued&lt;/a&gt; against putting threads into Python, another dynamic language, and both Python and Ruby implementations suffer from poor thread performance.&lt;br /&gt;&lt;br /&gt;A third issue is type safety. As the Twitter crew say, they found themselves building their own type manager into their server code. In a statically typed language, the type management is done at compile time, making the code more efficient and automatically eliminating the potential for a large class of bugs.&lt;br /&gt;&lt;br /&gt;Related to this, many people &lt;a href="http://topsecretproject.finitestatemachine.com/2009/04/the-great-twitter-ruby-vs-scala-war-debate/"&gt;commented&lt;/a&gt; on the revelation that the Twitter Ruby server code was full of calls to the Ruby kind_of method. It is normally considered bad form to have to use kind_of or its equivalent in other languages like the Java instanceof operator. After a few moments thought I understood what the kind_of code is for. If you look at any real server like a database server's code, it is full of assert statements. The idea is that if you are going to fail, you should fail as fast as you can and let the error management and recovery system get you out of trouble. Failing fast reduces the likelihood that the error will propagate and cause real damage like corrupting persistent data. Also with a fast fail it is easier to figure out why the error occurred. In a language with dynamic typing, checking parameters with a kind_of method is the first type of assert to put in any server code.&lt;br /&gt;&lt;br /&gt;So the Twitter developers have opted to use Ruby on Rails for their web server and Scala for their server code. In the old days we would have said "&lt;a href="http://www.phrases.org.uk/bulletin_board/13/messages/690.html"&gt;horses for courses&lt;/a&gt;" and everyone would have nodded their heads in understanding. Nowadays , nobody goes racing, so nobody knows what the phrase means. Can anyone suggest a more up to date expression?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8447152714379265643?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8447152714379265643/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8447152714379265643' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8447152714379265643'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8447152714379265643'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/04/ruby-versus-scala.html' title='Ruby versus Scala'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-1581867482105216393</id><published>2009-04-05T20:30:00.000-07:00</published><updated>2009-04-05T22:19:09.069-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Cloud Computing'/><title type='text'>Cloud to Ground</title><content type='html'>&lt;a href="http://www.slate.com/id/2212467/"&gt;&lt;span style="font-style: italic;"&gt;Everybody&lt;/span&gt;&lt;/a&gt; &lt;a href="http://bandb.blogspot.com/2009/03/understanding-salesforcecom.html"&gt;is&lt;/a&gt; &lt;a href="http://chris-richardson.blog-city.com/"&gt;talking&lt;/a&gt; &lt;a href="http://richard-treadway.blogspot.com/2009/04/ria-and-cloud-computing.html"&gt;about&lt;/a&gt; &lt;a href="http://www.techcrunch.com/2009/04/02/with-hadoop-amazon-adds-a-web-scale-file-system-to-its-cloud-computer/"&gt;Cloud&lt;/a&gt; &lt;a href="http://radar.oreilly.com/2008/10/web-20-and-cloud-computing.html"&gt;Computing&lt;/a&gt;, the idea that your computing needs can be done by utility computing resources out there on the internet. One sometimes overlooked &lt;a href="http://www.ebizq.net/blogs/linthicum/2009/04/data_integration_lacking_from.php"&gt;issue&lt;/a&gt; with Cloud Computing is how do you get your data out of the cloud, summed up in the phrase "Cloud-to-Ground".&lt;br /&gt;&lt;br /&gt;The issue is that you have your data in the cloud, but you need it down here in your local computing systems so that for example, you can prepare a presentation for the board, or generate a quarter end report, or confirm that a new customer can get the telephone support they just paid for. While it is not a hugely different from other data integration problem, it is one more thing to put on your check list when you think about how you are going to use Cloud Computing.&lt;br /&gt;&lt;br /&gt;I first heard the phrase last year when Mike Pittaro of SnapLogic spoke to the SDForum Business Intelligence SIG on &lt;a href="http://bandb.blogspot.com/2008/09/saas-data-integration.html"&gt;SaaS Data Integration&lt;/a&gt;. It was only later that I discovered the origin of the phrase is describing a type of &lt;a href="http://www.nssl.noaa.gov/primer/lightning/ltg_basics.html"&gt;lightning&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-1581867482105216393?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/1581867482105216393/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=1581867482105216393' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1581867482105216393'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/1581867482105216393'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/04/cloud-to-ground.html' title='Cloud to Ground'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-5746428819973874881</id><published>2009-03-29T23:03:00.000-07:00</published><updated>2009-04-01T22:28:36.106-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Understanding Salesforce.com</title><content type='html'>I was out of town and could not attend the SDForum SAM SIG meeting on the Salesforce.com architecture, which was a shame as it seems to have been a fascinating &lt;a href="http://ororke.com/paul/blog/2009/03/25/the-data-architecture-of-forcecom/"&gt;presentation&lt;/a&gt;. I have been following Salesforce.com for some time. We had them present to the SDForum Business Intelligence SIG in 2002. In 2006, Ken Rudin an early Salesforce.com employee gave an interesting &lt;a href="http://bandb.blogspot.com/2006/05/uncommon-saas-wisdom.html"&gt;presentation&lt;/a&gt; on his experience to the SDForum SaaS SIG.&lt;br /&gt;&lt;br /&gt;On the one hand &lt;a href="http://www.salesforce.com/"&gt;Salesforce.com&lt;/a&gt; has built the first really successful Software as a Service (SaaS) application and continue to grow the company year after year. On the other hand there is a certain amount of hype surrounding the company. Here is the unvarnished story of what they do. In the USA, there are tens of thousands of companies with distributed sales forces. Each company has to keep in contact and track what its salespeople are doing, where each salesperson works out of their home or an anonymous office suite far from headquarters. Salesforce.com provides the application to manage a distributed sales force. It is as simple as that.&lt;br /&gt;&lt;br /&gt;Salesforce.com is the perfect web based SaaS application. There is a large client base. Each client's problem is to keep contact with a distributed sales force, dictating a web enabled application. There are many small clients who do not have the resources to implement their own sales force management application. In practice each client needs the same basic functions in their application, with some minor variations.&lt;br /&gt;&lt;br /&gt;Salesforce.com started out by offering their application to the smaller clients who needed a few seats and would have the most difficulty in implementing their own stand alone software application. With experience they made their application economic to medium sized clients with hundreds of seats. Eventually they got to the point where they could effectively support the largest clients like &lt;a href="https://www.salesforce.com/company/news-press/press-releases/2007/02/070227.jsp"&gt;Merrill Lynch&lt;/a&gt; with 25000 seats.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-5746428819973874881?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/5746428819973874881/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=5746428819973874881' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5746428819973874881'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/5746428819973874881'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/03/understanding-salesforcecom.html' title='Understanding Salesforce.com'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6914707283227733900</id><published>2009-03-19T22:26:00.000-07:00</published><updated>2009-03-19T23:37:42.437-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Google Blog Search is a Misnomer</title><content type='html'>Last week I saw one news item in &lt;a href="http://richard-treadway.blogspot.com/"&gt;passing&lt;/a&gt; that seemed to confirm my previous post about Twitter, and that was the news that Twitter Search is about to overtake Google search. At least that is what I thought it said, but it seemed so unlikely that I just dismissed it. Later, when I went back and looked again I discovered that the headline breaking news is that Twitter Search is about to overtake Google &lt;span style="font-weight: bold;"&gt;Blog&lt;/span&gt; Search.&lt;br /&gt;&lt;br /&gt;I have tried to use Google Blog Search a couple of times and found it to be completely useless. It seems to order search results in time rather than by relevance. Others have complained about this as well. For example, Michael Arrington of TechCrunch made the same &lt;a href="http://www.techcrunch.com/2005/09/14/google-blog-search-first-impressions/"&gt;complaint&lt;/a&gt; when the service debuted in 2006. It was only after a conversation last weekend that I came to realized that the point of Google Blog Search is to provide search results in time order rather than in relevance order (in practice, you can sort by relevance or by time, but the relevancy order seems to be heavily influenced by recency).&lt;br /&gt;&lt;br /&gt;Which brings me to my point. When I think of Google Search I think of the wonderful way that it always shows me relevant results in the first page. By ordering results by recency, the name Google Blog Search certainly does not explain its purpose and is thus a confusing misnomer. Ultimately the name dilutes the Google brand. I cannot offer any good suggestions, but Google Blog Search should change its name to something that clearly explains what it is &lt;a href="http://www.labnol.org/internet/who-needs-google-blog-search/7956/"&gt;for&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6914707283227733900?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6914707283227733900/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6914707283227733900' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6914707283227733900'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6914707283227733900'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/03/google-blog-search-is-misnomer.html' title='Google Blog Search is a Misnomer'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-9060003397279519968</id><published>2009-03-14T23:10:00.000-07:00</published><updated>2009-03-15T14:49:00.374-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sightings'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><title type='text'>Twitter is the new Black</title><content type='html'>This is the age of Twitter. In the last couple of weeks, both &lt;a href="http://www.comicsutopia.com/doonesbury-takes-on-twitter/"&gt;Doonesbury&lt;/a&gt; and &lt;a href="http://www.thedailyshow.com/video/index.jhtml?videoId=219519&amp;amp;title=twitter-frenzy"&gt;Jon Stewart&lt;/a&gt; have made fun of it. Members of the technorati who have been using Twitter for some time and have built up a solid following have suddenly found their lead in the number of followers eviscerated as Twitter goes mainstream and people we have all heard of become the most popular. As might be expected, the loudest &lt;a href="http://workbench.cadenhead.org/news/3498/size-your-twitter-makes-me-feel"&gt;complaints&lt;/a&gt; have come from Dave Whiner.&lt;br /&gt;&lt;br /&gt;I have &lt;a href="http://bandb.blogspot.com/2008/05/pull-dressed-as-push.html"&gt;written&lt;/a&gt; about Twitter in the past and how it relates to various feed technologies. However I must confess that I do not use the service for the simple reason that I have no use it. We live in a very noisy world. I want to keep the noise level down, and Twitter seems to just increases the noise.&lt;br /&gt;&lt;br /&gt;For example, when I first started using RSS to follow stuff, I subscribed to feeds from all the places that I regularly follow. Then I realized that there is no point in using RSS to follow a site that publishes several times a day. If I want to see what they have to say, I can just go to the site at any time and see their latest stuff. So I cut my RSS subscriptions back to the sites that publish infrequently. That way a quick daily look at my RSS feeds allows me to catch up with all sorts things without being overloaded.&lt;br /&gt;&lt;br /&gt;That is not to say that I do not see a value in Twitter, it is just that I do not see a use case for my using Twitter now. Twitter is of most use for people whose job is about communications. My job is to get things done, and to do that, I often need to switch off the outside world to reduce the noise.&lt;br /&gt;&lt;br /&gt;This attitude to Twitter could easily change. Many years ago, a friend suggested that I use Instant Messaging (IM). At the time I had no use for it and did not subscribe. A couple of years later, my manager asked that all reports  be available through IM whenever they were working, so that, for example the manager could ask a question of anyone from a meeting. Ever since then I have been online in IM whenever I have been at work. At a minimum, it shows my colleagues in a distributed organization that I am at work and available. Just as I found a use for IM, I could find a use for Twitter.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-9060003397279519968?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/9060003397279519968/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=9060003397279519968' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/9060003397279519968'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/9060003397279519968'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/03/twitter-is-new-black.html' title='Twitter is the new Black'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-8712548520352318686</id><published>2009-02-25T22:58:00.000-08:00</published><updated>2009-03-15T14:48:12.707-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><category scheme='http://www.blogger.com/atom/ns#' term='Database'/><category scheme='http://www.blogger.com/atom/ns#' term='UI'/><title type='text'>Open Source Business Models</title><content type='html'>The previous &lt;a href="http://bandb.blogspot.com/2009/02/open-source-business-intelligence.html"&gt;post&lt;/a&gt; on Open Source reminds we that we have looked at Open Source business models &lt;a href="http://bandb.blogspot.com/2008/04/open-source-10-years-later.html"&gt;several&lt;/a&gt; &lt;a href="http://bandb.blogspot.com/2004/10/open-source-dilemma.html"&gt;times&lt;/a&gt; in this blog. Here are a couple of case studies in why software became Open Source.&lt;br /&gt;&lt;br /&gt;Apache &lt;a href="http://db.apache.org/derby/"&gt;Derby&lt;/a&gt; is an Open Source Java database with an interesting history. Cloudscape, was an early Bay area Java startup. Founded in 1996, Cloudscape came out with its first database product in 1997. In 1999 Informix, bought Cloudscape, and in 2001 IBM bought the database side of Informix,  including Cloudscape.  Under IBM, Cloudscape development continued, and it was used as an embedded database within several IBM Java products and middleware.  In 2004 IBM contributed the code to the Apache Software Foundation as Derby where it has found acceptance.&lt;br /&gt;&lt;br /&gt;IBM had acquired Cloudscape by accident when it had acquired Informix, and some time after that acquisition IBM had to decide what to do with Cloudscape. On the one hand, Cloudscape did not fit into IBM product hierarchy. They already had a mobile version of DB2, the database brand and did not need another one. Also Cloudscape did not generate enough revenue to support continued development or be an interesting business. On the other hand, Cloudscape had been adopted for use in many IBM middle tier Java projects. By many I mean at least 70 internal projects were using Cloudscape.&lt;br /&gt;&lt;br /&gt;In the end, IBM decided to Open Source Cloudscape under the Apache organization with whom they already had a significant relationship. This &lt;a href="http://www.ibm.com/developerworks/data/library/techarticle/dm-0410prial/"&gt;solved&lt;/a&gt; several problems at once. Cloudscape no longer competed with the DB2 brand. A well established Open Source project would provide long term support for Cloudscape at less expense than doing it in house. A donation of valuable software would help with IBM's standing in the Open Source community and the Apache foundation with which it already had a significant relationship.&lt;br /&gt;&lt;br /&gt;Eclipse &lt;a href="http://www.eclipse.org/birt/phoenix/"&gt;BIRT&lt;/a&gt; (Business Intelligence and Reporting Tool) is an Open Source reporting application that is based on the Eclipse Java Open Source framework. I &lt;a href="http://bandb.blogspot.com/2006/08/birt-strategy.html"&gt;wrote&lt;/a&gt; about BIRT in 2006 about 18 months after the project was launched. In that post, I speculated on why Actuate had Open Sourced BIRT. Apart from the reasons that I gave then, all of which still stand, a couple more reasons come to mind.&lt;br /&gt;&lt;br /&gt;On the positive side, an open source project could establish the BIRT and Actuate methodology for defining reports as a de facto standard. Standards create defense in depth from competition and and can provide great profitability for those who control them. On the other side, an strong Open Source project with a pliable license is the best possible way to suck the air out of the competitors lungs and prevent any new competitors from springing up.&lt;br /&gt;&lt;br /&gt;Finally, releasing BIRT as open Source software has not harmed Actuate. Since the release, Actuate has not grown revenue significantly, but it has become profitable with strong cash flows. In its most recent quarter Actuate &lt;a href="http://phx.corporate-ir.net/phoenix.zhtml?c=64401&amp;amp;p=irol-newsArticle&amp;amp;id=1218026"&gt;reported&lt;/a&gt; that BIRT contributed a respectable 12% of revenue.&lt;br /&gt;&lt;br /&gt;In summary, there are a lot of reasons that software becomes Open Source. It can be a new home for old software that does not fit, or it can be weapon in the fight to the top of the business totem pole. Whatever reason, be sure you understand why the software is Open Source before committing to it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-8712548520352318686?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/8712548520352318686/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=8712548520352318686' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8712548520352318686'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/8712548520352318686'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/02/open-source-business-models.html' title='Open Source Business Models'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6975983.post-6387400675387027570</id><published>2009-02-22T12:46:00.000-08:00</published><updated>2009-02-22T15:11:17.759-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='SDForum'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Source'/><title type='text'>Open Source Business Intelligence</title><content type='html'>We had a great February &lt;a href="http://www.sdforum.org/index.cfm?fuseaction=Calendar.eventDetail&amp;amp;eventID=13309"&gt;meeting&lt;/a&gt; of the SDForum Business Intelligence SIG where C0-Chair Paul O'Rorke spoke on "BI on a Budget - Open Source BI". Paul's talk broke down into two parts. In the first half he talked about Open Source Licenses and to a lesser extent business models. In the second half he did a survey of Open Source Business Intelligence tools and platforms.&lt;br /&gt;&lt;br /&gt;My discussion of licenses must include the "I Am Not A Lawyer" (IANAL) disclaimer. Paul is not a lawyer either as he told us before launching into his license discussion. Open Source &lt;a href="http://en.wikipedia.org/wiki/Open_source_license"&gt;licenses&lt;/a&gt; generally fall into three categories. From a business point of view, the most restricting is the GNU General Public License (GPL). Any code that links with GPL licenses code must be released as Open Source. This is called &lt;a href="http://www.gnu.org/copyleft/"&gt;Copyleft&lt;/a&gt;, a play on Copyright. The least restrictive licenses are those like the BSD License and the Apache Software License that require little other than you acknowledge that you are using their software. In the middle sits the so called  "weak Copyleft" licenses like the Mozilla Public License and the Eclipse Public License.&lt;br /&gt;&lt;br /&gt;A good example of what these licenses mean in practice is found by looking at the two leading Open Source database systems PostgreSQL and MySQL. PostgreSQL was originally developed at the University of Berkeley under the leadership of Michael Stonebreaker and is released under the unrestrictive BSD License. Because of this, it is the basis of many recent commercial database systems including Netezza, GreenPlumb and ParAccell.&lt;br /&gt;&lt;br /&gt;On the other hand, MySQL is released by a company that makes money by selling software licenses. There is a "community" version of MySQL that is licensed under the GPL. However you can also buy a license for MySQL, in which case you are not required to release any of your code to Open Source. A cynic might say that the community edition is there for promotion. You get to try the product for free but when you go to use it, you find that there are good reasons to buy a license.&lt;br /&gt;&lt;br /&gt;Open Source license issues are complex. The above discussion is just the tip of the iceberg when it comes to understanding the implications of an open source license. Paul brought up plenty of other issues that need to be considered. One example is patent protection. This covers whether you are liable to be sued for using Open Source code, as well as issues of protecting your own intellectual property if your code is linked to Open Source code.&lt;br /&gt;&lt;br /&gt;One of the good things about the meeting was its interactive nature. Sandeep Giri of the &lt;a href="http://www.openi.org/"&gt;OpenI&lt;/a&gt; project gave us some insights as to why chose the Mozilla Public License for his project. When Paul got to discussing the &lt;a href="http://www.eclipse.org/birt/phoenix/"&gt;BIRT&lt;/a&gt; Open Source Reporting project that is sponsored by Actuate, &lt;a href="http://bandb.blogspot.com/search?q=star+analytics"&gt;Suzanne Hoffman&lt;/a&gt; told us that the decision to Open Source some of Actuate's code caused dissent within the company to the extent that some people left. &lt;a href="http://bandb.blogspot.com/2008/07/gentle-introduction-to-r.html"&gt;Jim Porzak&lt;/a&gt; had some input on the R programming language and helped us understand the difference between using a statistical programming language and a Data Mining system like Weka. Many other audience members also joined in.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6975983-6387400675387027570?l=bandb.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://bandb.blogspot.com/feeds/6387400675387027570/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=6975983&amp;postID=6387400675387027570' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6387400675387027570'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6975983/posts/default/6387400675387027570'/><link rel='alternate' type='text/html' href='http://bandb.blogspot.com/2009/02/open-source-business-intelligence.html' title='Open Source Business Intelligence'/><author><name>Richard Taylor</name><uri>http://www.blogger.com/profile/14036876973506495788</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://1.bp.blogspot.com/_rAvqFiewcf4/THoPUR_znhI/AAAAAAAAABM/aM86_GizId4/S220/RichardPix.jpg'/></author><thr:total>0</thr:total></entry></feed>
