Categories

 
GD Star Rating
loading...

As many would know, Debian GNU/Linux is one of the oldest, and the largest Linux distributions that is available for free. Since it was first released in 1993, several people have analysed the size and produced cost estimates for the project.

In 2001, Jesús M. González-Barahona et al produced an article entitled “Counting Potatoes“, an analysis of Debian 2.2 (code named Potato). When Potato was released in June 2003, it contained 2,800 source packages of software, totalling around 55 million lines of source code. When using David A. Wheeler’s sloccount tool to apply the COCOMO model of development, and an average developer salary of US$56,000, the projected development cost that González-Barahona calculated to start-from-scratch and build Debian 2.2 in 2003 was US$1.9 billion.

In 2007 an analysis entitled ‘Macro-level software evolution: a case study of a large software compilation‘ by Jesús M. González-Barahona, Gregorio Robles, Martin Michlmayr, Juan José Amor and Daniel M. German was released. It found that Debian 4.0 (codename Etch released April 2007) had just over 10,000 source packages of software and 288 million lines of source code. This analysis also delved into the dependencies of software packages, and the update flow between Debian release (not all packages are updated with each release).

Today (February 2012) the current development version of Debian, codenamed Wheezy, contains some 17,141 source packages of software, but as it’s still in development this number may change over the coming months.

I analysied the source code in Wheezy, looking at the content from the “original” software that Debian distributes from its upstream authors without including the additional patches that Debian Developers apply to this software, or the package management scripts (used to install, configure and de-install packages). One might argue that these patches and configuration scripts are the added value of Debian, however the in my analysis I only examined the ‘pristine’ upstream source code.

By using David A Wheeler’s sloccount tool and average wage of a developer of US$72,533 (using median estimates from Salary.com and PayScale.com for 2011) I summed the individual results to find a total of 419,776,604 source lines of code for the ‘pristine’ upstream sources, in 31 programming languages — including 429 lines of Cobol and 1933 lines of Modula3!

In my analysis the projected cost of producing Debian Wheezy in February 2012 is US$19,070,177,727 (AU$17.7B, EUR€14.4B, GBP£12.11B), making each package’s upstream source code wrth an average of US$1,112,547.56 (AU$837K) to produce. Impressively, this is all free (of cost).

Zooming in on the Linux “Kernel”

In 2004 David A. Wheeler did a cost analysis of the Linux Kernel project by itself. He found 4,000,000 source lines of code (SLOC), and a projected cost between US$175M and US$611M depending on the complexity rating of the software. Within my analysis above, I used the ‘standard’ (default) complexity with the adjusted salary for 2011 (US$72K), and deducted that Kernel version 3.1.8 with almost 10,000,000 lines of source code would be worth US$540M at standard complexity, or US$1,877M when rated as ‘complex’.

Another Kernel Costing in 2011 put this figure at US$3 billion, so perhaps there’s some more variance in here to play with.

Individual Projects

Other highlights by project included:

Project Version Thousands
of SLOC
Projected cost
at US$72,533/developer/year
Samba 3.6.1 2,000 US$101 (AU$93M)
Apache 2.2.9 693 US$33.5M (AU$31M)
MySQL 5.5.17 1,200 US$64.2M (AU$59.7M)
Perl 5.14.2 669 US$32.3M (AU$30M)
PHP 5.3.9 693 US$33.5M (AU$31.1M)
Bind 9.7.3 319 US$14.8M (AU$13.8M)
Moodle 1.9.9 396 US$18.6M (AU$17.3M)
Dasher 4.11 109 US$4.8M (AU$4.4M)
DVSwitch 0.8.3.6 6 US$250K (AU$232K)

Debian Wheezy by Programming Language

The upstream code that Debian distributes is written in many different languages. ANSI C with 168,536,758 is the dominant language (40% of all lines), followed by C++ at 83,187,329 (20%) and Java with 34,698,990 (8%).

Line chart

Break down of Wheezy by Language

If you are intersted in finding the line count and cost projections for any of the 17,000+ projects, you will find them in the raw data CSV.

Other Tools and Comparisons

Ohcount is another source code cost analysis tool. In March 2011 Ohcount was run across Debian Sid: its results are here. In comparison, its results  appear much lower than the sloccount tool. There’s also the Ohloh.net Debian Estimate which only finds 55 Million source lines of code and a projected cost of US$1B. However Ohloh uses Ohcount for its estimates, and seems to be to be around 370 million SLOC missing compared to my recent analysis.

Summary

Over the last 10 years the cost to develop Debian has increased ten-fold. It’s intersting to know that US$19 billion of software is available to use, review, extend, and share, for the bargain price of $0. If we were to add in Debian patches and install scripts then this projected figure would increase. If only more organisations would realise the potential they have before them.

Need help with Linux (including Debian), Perl, or AWS? See www.jamesbromberger.com.

GD Star Rating
loading...
Debian Wheezy: US$19 Billion. Your price... FREE!, 4.6 out of 5 based on 26 ratings

  25 Responses to “Debian Wheezy: US$19 Billion. Your price… FREE!”

  1. [...] Nesta semana destaco a notícia de um estudo efectuado por James Bromberger developer da distro de GNU/Linux Debian (não comercial e uma das mais antigas, da qual nasceram a Ubuntu e a Mint entre muitas outras) e CPAN que demonstra, tal como há tempos havia feito para o Kernel Linux David Wheeler, como sairia a preços verdadeiramente astronómicos, Developer values Debian at £12.1 billion, o desenvolvimento por uma entidade/empresa privada de um sistema operativo como a Debian assim como das suas quase 30mil aplicações que a compõem. In my analysis the projected cost of producing Debian Wheezy in February 2012 is US$19,070,177,727 (AU$17.7B, EUR€14.4B, GBP£12.11B), making each package’s upstream source code wrth an average of US$1,112,547.56 (AU$837K) to produce. Impressively, this is all free (of cost). vi@Debian Wheezy: US$19 Billion. Your price… FREE! [...]

  2. [...] recently read on Jeb’s blog an estimation on how much is the cost of one of the major Free Software project ef…According to James E. Bromberger – the whole Debian project was estimated to be at the [...]

  3. [...] | James Bromberger (nessun voto) 1 [...]

  4. [...] contains 17,141 packages of software, or 419,776,604 lines of code. With that figure, James Bromberger estimates that Debian would cost about $19.1 billion to produce. Bromberger also looks at the cost of individual projects like PHP, Apache and MySQL. Even at more [...]

  5. [...] ("wheezy") contains 17,141 packages of software, or 419,776,604 lines of code. With that figure, James Bromberger estimates that Debian would cost about $19.1 billion to produce. Bromberger also looks at the cost of individual projects like PHP, Apache and MySQL. Even at more [...]

  6. [...] contains 17,141 packages of software, or 419,776,604 lines of code. With that figure, James Bromberger estimates that Debian would cost about $19.1 billion to produce. Bromberger also looks at the cost of individual projects like PHP, Apache and MySQL. Even at more [...]

  7. [...] contains 17,141 packages of software, or 419,776,604 lines of code. With that figure, James Bromberger estimates that Debian would cost about $19.1 billion to produce. Bromberger also looks at the cost of individual projects like PHP, Apache and MySQL. Even at more [...]

  8. [...] یک آنالیز از توسعه دهنده‌ی CPAN و دبیان، جیمز برومبرگر – James Bromberger – اینطور نتیجه‌گیری می‌کند که توسعه و ساخت نرم‌افزارهایی که در حال حاضر در دبیان Weezy (نسخه ۷.۰) وجود دارد، از ابتدا، چیزی در حدود ۱۹.۱ میلیارد دلار هزینه بر خواهد داشت. برای این آنالیز از برنامه‌ی Sloccount برای شمارش خطوط کدهای برنامه‌هایی که در Wheezy وجود دارد استفاده شده است؛ او محاسبه کرده که چه مقدار هزینه باید صرف کرد که ۴۲۰ میلیون خط کد را با حقوق متوسط یک برنامه‌نویس نوشت. [...]

  9. [...] de los autores analizaron el uso de diferentes lenguajes de programación de distribución [...]

  10. [...] no projekta Debian izstrādātājiem mēģinājis sarēķināt, cik varētu izmaksāt Debian GNU/Linux analoģiska produkta izstrāde no nulles. [...]

  11. [...] lenguajeDebian 7 ha sido valorado simbólicamente en 19.000 millones de dólares (US$19 billones) por el desarrollador James Bromberger basándose en datos anteriores hechos a Debian y utilizando el programa Sloccount para contar las [...]

  12. [...] no projekta Debian izstrādātājiem mēģinājis sarēķināt, cik varētu izmaksāt Debian GNU/Linux analoģiska produkta izstrāde no nulles. [...]

  13. [...] In: Без рубрики Один из разработчиков проекта Debian попытался оценить стоимость разработки с нуля продукта, [...]

  14. [...] Dollar wert sein – umgerechnet etwa 14,4 Milliarden Euro. Das hat der Debian-Entwickler James Bromberger errechnet. Dafür hat Bromberger zunächst mit dem Werkzeug Sloccount alle Zeilen des gesamten Quellcodes in [...]

  15. [...] из разработчиков проекта Debian попытался оценить стоимость разработки с нуля продукта, [...]

  16. [...] из рaзрaбoтчикoв прoeктa Debian пoпытaлcя oцeнить cтoимocть рaзрaбoтки c нуля прoдуктa, aнaлoгичнoгo Debian [...]

  17. [...] Aufschluss über die genaue Berechnung und weiterführende Informationen findet man unter golem.de oder JEB's Blog. [...]

  18. [...] Article « Debian Wheezy: US$19 Billion. Your price… FREE!« , de James E. Bromberger (administrateur système). [...]

  19. [...] el esfuerzo, trabajo y ingenio que se invierten en cualquier desarrollo Open Source, pero ahora ha aparecido un estudio que nos permite hacernos una idea de lo que realmente podría costar un desarrollo Open [...]

  20. [...] are the numbers according to research completed by Perth-based programmer James [...]

  21. [...] primeira vez que fizeram isso foi em 2001 e o valor era dez vezes menor (1.9 bilhões de dólares). Uma década depois o valor foi atualizado e agora está em 19bilhões de dólares. Em que patamar este valor vai parar de crescer eu não sei, mas o que sei é que tudo isso é de [...]

   
© 2011 JEB's Blog Suffusion theme by Sayontan Sinha