NearWalden Rotating Header Image

Expense Tool Done Right

Yesterday I went looking for a simple tool to do expense tracking while I travel around, and found a great fit in Xpenser. You can post expenses over IM, SMS, email and through the web. When using email you can even attach a photo of the receipt, presumably taken with your phone.

All it expects is a text stream of the form “{activity} {amount} {tags}”, such as “Dinner with Jim 46.80 projectx”, which will (as you would hope) file an expense into the ProjectX report for $46.80 with the note “Dinner with Jim”.

They also have an open API. Since I needed to catch up on some expenses I decided that command-line entry would be easiest, so I whipped up this script (note: it doesn’t deal with consecutive spaces correctly):

#!/bin/bash
#
# this script writes an expense to xpenser.com
# change email address and password to match xpenser acct
LOGIN="address@email.com:password"
URLSTART="http://xpenser.com/api/simple/?q="
URLEND="&format=json"
Q=$1
curl -i -u $LOGIN $URLSTART${Q// /+}$URLEND

I named the command ‘xpens’, so now I just type xpens “Dinner with Jim 46.80 projectx” and I’m good!

Next, a search for a simple CRM app….

Green Education: What We Need

Over the holidays USA Today had an article talking about the sudden rise of green-oriented minor and major programs at universities. According to Paul Rowland, Executive Director of Association for the Advancement of Sustainability in Higher Education, two factors are driving the surge: students want the courses, and employers want the trained students.

When I give talks on our book “Citizen Engineer” at universities this topic always comes up. In specific, we discuss what employers are looking for in these students.

The first point is clear from the article: energy is a “sweet spot”. The examples at Illinois State, MIT and UC Berkeley all centered on energy. For the first time in decades there is a wave of innovation in this sector, and not only in companies that generate energy, but also those that depend on it for their operations or products. We certainly fall into this latter bucket at Sun, and more of our engineering jobs come with a requirement of understanding electrical energy and how our electricity infrastructure works in real life.

The second point relates to general sustainability degrees. While these were barely mentioned in the article, we are starting to see some students come from programs with majors in sustainability, and I hear of many universities considering adding such a degree.

My feeling on this has become pretty clear cut: a minor degree in sustainability is a wonderful idea, a major is not. We need more awareness of environmental needs and solutions in all of our roles in business: our engineers, chemists, lawyers, business people and operations teams. But the key is that these are all highly specialized roles, and the people need to be able to do these first. I want the major to be in these areas, and will highly value a minor in sustainability.

The proof, of course, is in the pudding. At Sun we have no sustainability generalists, and I don’t anticipate that we would ever hire one. In every real-world case we’ve always needed someone with training or experience in a specific expertise.

To sum up, my advice to universities is always the same: energy is a great major or minor, but be careful of majors in general sustainability. Be world class in the things you already do, and layer in sustainability minor to make those folks even more marketable.

Some Data Released in UK

Steve McIntyre reports that “the UK Met Office has released a large tranche of station data, together with code”.

Notes:

  • The Met Office says that this is not a complete set of data, but it is unclear what is missing.
  • This is the processed data – the raw data is claimed to have been deleted.
  • Over at The Air Vent, Jeff Id discusses some of the details.

In my recent discussions (1, 2) of open climate science, I have stressed the importance of understanding licensing terms when discussing specific code or data. In this case there appears to be no license info available for the data (Note: you can not assert a copyright on facts, e.g. raw data, but given that this is processed data it can have a copyright and license. The Met also seems to believe this, as it is their excuse for not releasing other parts of the complete data set).

On the code side, there are two files of perl code which are downloadable from the site (1, 2). The site says:

This code is released under an Open Source licence that is contained as comments in the code. By running the code you indicate your acceptance of the licence.

However, there is no license in the comments of either file. As those of us in software know, there are big differences between open source licenses, so it would be good do know what license they were planning to use.

Specifying Open Climate Science: A First Attempt

In my last post, I used lessons from the open source software community and the Creative Commons effort explore what we mean by “open climate science”. In this post I’m going to take the next step and propose a specification for open climate science. Finally, in the next installments I will look at the how to implement this specification using our current intellectual property legal framework.

Before I dive in, it is worth reiterating that I am not a scientist (and, by logical extension, not a climate scientist). I have a lot of experience with open source through my job at Sun, and I believe that much it is applicable to this situation. However, I’m sure there are subtleties that I will miss in this effort. Hopefully, though, it is complete enough to help facilitate a broader discussion.

The first step is to define some terms that describe the process of climate science (I’m open to suggestions for these). I’ll use the following to describe the process itself:

Climate science consists of running a calculation across one or more data sets and producing a result data set. Scientific conclusions or observations are based on the result data set

Beyond that we will also need some terms to help us talk about the mechanics of doing the science as described above:

  • Some data sets are a raw, meaning that they are taken directly from human or machine based observations.
  • A calculation is done using an algorithm that is embodied as software.
  • Software can exist in source and binary form (generally, source code is the software humans write and can read, binary code has been compiled into 1’s and 0’s that a computer understands)
  • The term metadata will describe any additional information beyond the software and data sets which is required to understand or accurately recreate a result. This includes the schema of the data (e.g. units of the numbers), required software tools or libraries, including version numbers, models and calibration of sensors, etc.

The second step is be to identify the use cases that we’d like the concept of “open climate science” to support. While ‘use case’ may sound benign, they are in fact the most critical part of this discussion. If there isn’t agreement whether these use cases embody the philosophy of ‘open’ that we are looking for, then the rest of the discussion is pointless. So far I’ve been able to identify three main use cases:

  1. Allow others to reproduce and verify the result data set. An example might be a scientist or other interested party who looks at a result and says “That surprises me. I wonder if there’s something funny in the data, or if they made a mistake in the code?”

  2. Allow others to run variations on the calculation. For example, another scientist might want to run the a variation of the calculation on the same data sets in order to see what the impact is on the result.

  3. Allow others to combine some or all elements of the experiment with other elements and produce a different experiment. For example, another scientist may want to take a calculation which measures the correlation of two data sets and apply it to a third data set to see if the resulting correlation is similar.

Some may say that use cases #2 and #3 could be difficult to tell apart in some situations, and they would be correct. This is also a common issue in music and literature: as many artists and authors “sample” other people’s work, frequently the question arises whether the new work is just a variation on the earlier work, or is in fact a new work that is distinct of the earlier one. Since copyright law distinguishes between these two cases, understanding where that line is may be important in these situations. However, if we agree that both #2 and #3 are important aspects of open climate science, then defining the line between them becomes less important.

Its also important to note that we haven’t yet talked about avoiding undesirable use cases. For example, I’d rather not have someone take my results, write a paper about it and imply that they created the data themselves. Or I may be upset if someone puts my result data sets on a CD and sells them to others. These are important, and we will start the process of dealing with those below.

Finally, let’s take the use cases and definitions above and write a proposed specification of open climate science.

For the purpose of reproducing results, running variations of the calculations, and creating derived calculations, the science is open if:

  1. All input data sets are freely available and include all relevant metadata necessary to perform the desired calculations.

  2. All calculations are freely available in source code form and include all relevant metadata necessary to execute the code as-is or make reasonable modifications.

  3. The results data set(s) are freely available and include all relevant metadata.

  4. All necessary tools are either freely available or commercially available

For the purpose of this specification, freely available means that digital copies are available for download at no cost (copies on physical media may or may not be available, and may or may not be free). Commercially available means that it is available for sale, such as a specific computer or software package, but does not require that the cost be zero.

In order to protect the creators of the open science, we need to add some optional specifications which restrict the users. These can be used at the discretion of the creator.

  1. All data sets and software are available under these terms for non-commercial use only. These may or may not be available for commercial use, and other terms may apply.

  2. Any results published using the data sets and software under these terms must acknowledge the source of the assets which were used.

Another thing creators may want to do is control the permissions around derived works. We’ll look at that more closely in the legal discussion.

That completes the proposed specification. Its not very long, but does ask a lot of the open climate scientist.

Before we move on to the next installment, there is one possible change which would strengthen this spec further, and that is to require that all input data sets are either a) raw and freely available, or b) open, using the above definition. This means that no climate science is open until everything it depends upon is also open. In the software world this is true, but I will leave it as a question whether that makes sense in the case of climate science as well.

In the next installment we’ll use this specification and intellectual property law to create a license which can legally formalize this specification.

Towards Open Climate Science

The events that have transpired (physically) at University of East Anglia and (virtually) around the globe have raised the important question of whether climate science is open and transparent enough. This has led, naturally, for a call for “open source” science.

Personally, this discussion links two amateur passions of mine, climate science and open source. Coincidentally these are central themes of Greg Papadopoulos’ and my book, “Citizen Engineer”, not because we miraculously anticipated this particular point in time, but because we saw these as the two largest knowledge gaps in today’s engineers.

I could write a long article about why an open approach to climate science is the right thing. Even acknowledging short term issues that it will create, such as the one raised by Roger Pielke Jr in his response to Andy Revkin, I can argue strongly that its the right thing in the long run. So instead of adding to that discussion, I want to move on and talk about what happens next, and propose two activities that can lay the ground work for the future.

The next important step in this conversation may not be obvious, but it is to formally define what we mean by “open source science”. Its easy to say that the raw data and code for all peer reviewed work should be publicly available. But working day-in and day-out in the world of open source software, I know, firsthand, that reaching a clear, usable definition is far harder than you might believe.

First, there are a series of practical questions that need to be answered, such as how soon data and code needs to be available. Is the live stream from satellites available on the web? Is it OK to sit on the data for 6 months? How about waiting until papers using the data have been published? And as people start to the work with the data there will inevitably be demands for related data. For example, how much other data on the real-time operation of a satellite will people want in order to do their own calibration of a raw data product?

But beyond the practical issues there’s a more subtle question of licenses. Your average person on the street would assume that open source software is just plain freely available (i.e. in the public domain), but almost all software that is considered ‘open source’ comes with a license that seriously restricts how you can use it. For example, the license may dictate whether the code can be used in a commercial product, or whether the copyright holder will relieve the user of any patents that they may have that relate to the software. cc.logo.large.pngTrademarks and attribution may also play a role. To see some easy to understand license options, take a look at the excellent site CreativeCommons.org, which provides a tool for creating your own, custom license for your website, blog, music, etc.

As you can imagine, the wide array of possible licenses leads to a long, heated and contentious discussion over what is truly “open” and what isn’t. OSI-logo-100x117.png In the software world the Open Source Initiative is a non-profit that was formed for this purpose, and is generally recognized by the open source community as the standards-bearer of the definition. As you can see on their site, they also maintain a list of widely used licenses and how they stack up with the OSI standard.

One of the most intriguing aspects of the open source licensing world is a class of license which are referred to as “viral” or “reciprocal”. These licenses place requirements on derived works, often that the derived work is placed under the same license. heckert_gnu.small.png The father of this type of license is the GNU Public License, or GPL. This clever license uses the US copyright system,not to prevent others from using a work, but instead to propagate free and open software. In other words, it says that you get the benefit of using this work, but in exchange you have to share your work which used this in the same way.

Its not hard to imagine using a GPL-like license in climate science. A data set could come with the requirement that results based on the data set also be freely available. Similarly, code used in an algorithm could have the same restriction (note that the algorithm is difficult to control, but code is covered under copyright law and can have an attached license). As you can see, this subtle idea could have broad, and lasting implications for the use of data and code in climate science.

So as you can see, the question of “what is open climate science” is less well defined than many would imagine, which leads me to two proposed actions.

First, is to find a formal home for the definition of “open climate science”. This important activity needs a home, just as open source software has OSI, which can manage the process of creating and maintaining the definition. This process will take some time, but if the climate science community is serious about transparency and openness, executing on this process will be required to making true progress. (Note: the Creative Commons project [Science Commons(http://sciencecommons.org/) may be useful here, but I don’t know much about it)

The second proposed action is simpler and can happen quickly. This activity is to publicly document (presumably on a website) basic facts about the ‘openness’ of the top sources of climate data and algorithm. Is the raw data available? Are the algorithms and code available? Who can have access? How do you get them?

While I’m not equipped to spearhead the first activity, I can certainly help get the second underway. Anyone else interested?

The Unasked Questions

Tom Fuller at Examiner.com ran a survey on various aspects of climate change and policy for the last few days (sorry, the survey is now closed, but Tom says he’ll be writing up the results soon).

At the end he asked for any questions that he might have missed. Usually by that point in a survey I’ve lost interest, but this one was pretty well designed, so I submitted the two key questions I thought that he’d missed:

Q: Considering the cap and trade policies as described in the draft Waxman-Markey and Kerry-Boxer bills, what impact on real US green house gas emissions do you believe they will have by 2025?

a) we will meet the emissions targets in the bills

b) we will have a reduction in emissions, but not close to the targets

c) they will have no effect on the US GHG emissions

Q: Again, considering the cap and trade policies as described in the draft Waxman-Markey and Kerry-Boxer bills, what impact on the US economy do you believe they will have by 2025?

a) the economy will be heavily, negatively impacted (> 1% of GDP)

b) little or no impact to the economy

c) cap and trade will cause growth in the economy (> 1% of GDP)

You could imagine these to actually be more of a sliding scale, and trying to figure out where on the chart you think we’ll end up.

GHG-cost-benefit.png

The standard rhetoric from Washington is that the system will have little economic downside (“the cost of one postage stamp per household per day”), while obviously meeting its goals since its a legally binding cap. On top of that, they will tell you, there will be tons of new jobs created.

How would you answer these questions?

I’ll Be the Judge of That

(note: this is reposted from my Sun blog )

Two different news items recently led me into the same train of thought: we are all increasingly in the business of judging Goodness, and of being so judged. I purposefully capitalized Goodness here, because I mean it in the highest sense of the word. This probably sounds vague, so let me use the examples to explain.

The first item is from the NYTimes, and discusses the conundrum caused by a proposed solar plant in Nevada. While the plant will produce copious free energy, it will also require over a billion gallons of fresh water a year, or over 20% of the water supply for the valley in which it is to be cited.

This is a classic tradeoff that we’re going to hear more and more about. What are we willing to give up in exchange for cleaner energy? Many types of solar power need water, as shown above. But how much water is too much? How many dead migratory birds are too many at a potential wind farm location? How much car safety should we sacrifice for better mileage? How much can the view from the shoreline be impacted by offshore wind farms before it crosses the line? There are tons of these questions, there are going to be more, and they are going to get more and more complex.

Presumably you can see why I described this as judging Goodness. We might try to reduce decisions such as these to financial terms, but its hard to put a price on scenic beauty or a single bird. And your answer may not be the same as someone else’s. I’ll react differently to a wind farm I can see from my deck than one that’s a thousand miles away in a place I’ll likely never visit. How we approach these decisions reflect our personal or group morals.

The second example isn’t about tradeoffs, but about people trying to quantify the Goodness of others. You see this all of the time: green rankings of companies, ethical lists of schools, etc. For example, Sun was recently included in the Newsweek green ranking, coming in 14th out of 500. (Note: I’m not singling out the Newsweek ranking – there are dozens of examples I could have used and this just happened to be a recent, well publicized one.)

This, however, is even trickier than the task above. We’re not talking about an either/or situation, we’re talking about trying to capture all of the factors that make up the green-ness or ethical-ness of an organization in a single number. And to make matters worse, these the organizations being compared usually don’t even do the same thing. How do you compare an airline to a consulting company? A manufacturer to a non-manufacturer? A small school in the north woods to big one in Manhattan?

But as we’ve seen, there’s no shortage of people who feel they are up to the task. They’re willing to put relative weights and scores on various sources of quantitative and qualitative data, deciding what the underlying components of “good”, “green” or “ethical” should be. Sometimes we get to understand these weights and data sources, but usually we don’t.

In the Newsweek case, one of the three major factors in the ranking was a “reputation survey” audit they did using unnamed CEOs and other “experts”. Since Sun Microsystems is not a consumer brand and we don’t advertise much, its not hard to predict that we may not score as high in that as organizations that are household names, or are use big ad budgets to tout their sustainability. Sure enough, Wal-Mart tops the list in that category followed by GE, Coke and Nike. (BTW, Wal-Mart is doing some outstanding sustainability work, but that doesn’t validate the scoring methodology.)

Don’t get me wrong – I’m not saying that Sun deserves to be scored or ranked higher. In fact, I’ll go a step further and say that I have no moral grounds on which to judge the ethics or greenness of any organization, Sun included. I’m confident in telling you that we’re using less fresh water and emitting less GHG than we were a year ago, but that’s as far as I’m comfortable going. In short, who am I to judge?

So what’s the alternative to rankings? I’m a huge proponent of measuring things and making the data public – that’s what we’ve been trying to do at Sun through our CSR report, annual CDP response, OpenEco.org data and other avenues. I hope that companies, investors, and consumers are using that data to understand what we’re up to, and are making better informed decisions.

As organizations are trying to exhibit more social responsibility, there is a necessary increase in moral judgement in business decisions. In light of this, my advice is simple. First, recognize that you’re using moral in your decisions. Second, figure out what’s important to you (not Newsweek or anyone else) for the specific decisions you have to make. Finally, gather your own data.

There are lots of things that are OK to outsource – your moral standards aren’t one of them.

Trip Report: A Visit to the Capitol

This was written for my Sun blog and cross-posted here

Last week I was down in DC with a group of investors and business execs, many of whom were in the green space under the banner wecanlead.org, a collaboration between Ceres and the Clean Economy Network. John Doerr was the headliner of the group, but there were CEOs of some hot company like A123 and Seventh Generation. The motivation for us all to be there was to impress upon our legislators that well-constructed, comprehensive legislation could be good for business.

(Aside: one of the dangers of these events is that the “well-constructed” part of the message is ignored or left to be defined by the audience, so the message can be interpreted as “pass anything!”. My experience with business folks who are savvy about climate policy is that they are fairly particular about what “well-constructed” means. Another risk is that the press and others will try to figure out what your profit motive is, and often zero in on one part of your group, such as happened here. So whenever agree to participate in one of these missions you have to think through risks like this.)

Overall we were well received, and the trip garnered good press attention. In particular, the executive branch pulled out all of the stops, allocating time from three Department Secretaries: Sec’y Salazar (Interior), Sec’y Chu (Energy) and Sec’y Locke (Commerce). We also heard from Carol Browner, Director of the White House Office of Energy and Climate Change Policy, and several other members of the administration.

With all of these meetings we were able to get a sense of the administration’s overall mood and approach to climate change legislation. These impressions were probably slanted somewhat by the fact that the administration knew it had a favorable audience. The good news was that, overall these folks all knew their stuff. Obviously Sec’y Chu is deep into the science side of energy and Director Browner into policy, but Sec’ys Salazar and Locke both exhibited a deep knowledge and personal passion on environmental issues.

Compared to past trips to DC, the biggest change for me was a new focus on clean energy competition with China. Particularly Sec’y Locke and Director Browner emphasized China above all other reasons to get legislation done, and done soon. Personally I think this is a good change. I believe it can get more broad-based support, and will focus the discussion more on innovation. However, if China truly takes center stage as the driver for legislation, it can’t help but change the focus on the individual elements of the package. In particular, is cap and trade a central mechanism in a competitive agenda as opposed to a climate change agenda? This will be interesting to watch.

Beyond the emergence of China as a motivator, there were some other notes of interest:

  • The administration isn’t waiting for climate legislation to get started. They discussed what they were doing with stimulus money and within the jurisdictions of their own departments. Earlier in the week the President had signed an executive order to drive the greening the federal government.
  • The message on timing for climate legislation is “as soon as possible”. But it was pretty clear that a climate bill is second fiddle to health care, and everyone was very careful to avoid making commitments about Copenhagen. Carol Browner was particularly careful with her words, and I was left with the sense was they’re willing to let the timetable slide past December if the higher priorities aren’t complete yet.
  • Nuclear is back on the agenda. Sec’ys Locke and Chu both talked about an increase in US nuclear capability in a manner that assumes its a done deal. There was none of the hedging about the usual concerns, no hint of upcoming deliberation, etc. There will be more nukes.
  • Public lands will be used for renewables. Similar to the discussion on nukes, Sec’y Salazar spoke with a confident certainty about opening up public lands for renewables, including solar in the southwest and wind farms on the continental shelf off of the Carolinas. Again, not even a nod to the expected concerns.

Finally, when one talks about “comprehensive climate policy”, the scope of what we heard in DC and what’s in the proposed legislation is certainly comprehensive in the sense that there are lots and lots of programs there. But listening to two days of discussions it is still very hard to see how the decarbonization math adds up in order to meet the goals that people are proposing. As someone commented “it’s a mosaic, but there’s no picture”.

I remain particularly concerned about the lack of an R&D plan to support the innovation that is required to meet these goals. Lots of faith is being put into “the market” and the effects of a cap and trade system, but so far that faith eludes me.

Citizen Engineer: The Book is Out!

This week our book, Citizen Engineer, made it out the online bookstores and is supposed to hit the bricks-and-mortar stores next week. The book was written by myself and Sun’s CTO Greg Papadopoulos, with tons of help from John Boutelle. Of course its also important to recognize our publisher, Greg Doench, at Prentice Hall.

The website for the book is citizenengineer.org, and there are links there for buying the book at Amazon (paper or Kindle), from the publisher, and other book sellers. PDF’s of the book are also available there under a Creative Commons license.

CEFinal_cover_crop_2.5in.jpg

We also plan to use the website to publish further content, publicize events related to the book, and provide forums for discussion of the book’s topics (some of this is there today, and the rest will be very soon).

So why did we write this book?

After the turn of the century, Greg and I began to notice some recurring themes in our work with the engineers at Sun Microsystems and elsewhere.

First, an increasing number of engineers, especially those right out of school, were expressing a desire to “make a difference”. Some had a hobby or activity they were already invested in outside of the office, while others were searching for something that would make them more fulfilled. Many were also bringing that sense into the office, and trying to see how they could use their job to make a difference beyond the bottom line of Sun.

Second, the world at large was asking more of engineers. Public knowledge about topics like recycling, copyright, privacy, and climate change translated into new demands on manufacturers of products and services, which, in turn, required new ways of thinking in engineering.

Generally this was good news: we have engineers who want to make a difference, and a public which has rising expectations of them. In 2006 Greg wrote the first blog post about this entitled Charting a Course from Recent Grad to “Citizen Engineer”, which launched a more focused discussion of Citizen Engineers and their role in the coming years.

However, as we began to discuss this more, it quickly became clear that engineers, including ourselves, had missed out on some important topics during our educations. In particular, engineers were being asked to make increasingly complex decisions about environmental impact and intellectual property, but had never had any formal training in either area.

So we set out to write Citizen Engineer with a couple of goals in mind. First, we wanted to promote the idea that engineers could, and should, take a more visible role in shaping our future world. Second, we wanted to fill in some of the basic knowledge gaps that we found to be widespread through the engineering community. Finally, we had a point of view about the role of engineering and how to approach these complex issues which we wanted to get across.

While we feel like the book does a great job of meeting these goals, engineers are in a rapidly changing environment. We hope to use the website, along with updates to the book, to try to move the discussion along and keep it current.

Pielke: The Australian Gambit

Roger Pielke, Jr summarizes the state of climate legislation in Australia, and speculates on what it could mean for the US. If this is the next outcome here I would think it quite positive – strengthen the renewable energy and efficiency efforts, and drop the poorly designed cap and trade for now.