Blog

Programming Languages for Pipelined Analytics: Conclusion

Thu 14 January 2016 by Jeff Fischer

This is part 4 of my series on pipelined analytics language choices. In this part, we will run performance benchmarks of the file crawler programs and then conclude the series. Here is the outline of the entire series:

  1. Introduction
  2. Commentary on the Python and Java implementations
  3. Commentary on the Go …
read more

Programming Languages for Pipelined Analytics: Go and OCaml

Wed 06 January 2016 by Jeff Fischer

This is part 3 of my series on pipelined analytics language choices. In this part, we will look in more detail at Go and OCaml. Here is the outline of the entire series:

  1. Introduction
  2. Commentary on the Python and Java implementations
  3. Commentary on the Go and OCaml implementations (this post …
read more

Programming Languages for Pipelined Analytics: Python and Java

Tue 10 November 2015 by Jeff Fischer

This is part 2 of my series on pipelined analytics language choices. In this part, we will look in more detail at Python and Java. Here is the outline of the entire series:

  1. Introduction
  2. Commentary on the Python and Java implementations (this post)
  3. Commentary on the Go and OCaml implementations …
read more

Programming Languages for Pipelined Analytics

Mon 19 October 2015 by Jeff Fischer

At my startup, we do a lot of "pipelined analytics" — gathering and processing of data across multiple processes over multiple stages. Our current system uses Python, with some C extensions. We've been asked in the past, "Why Python?" The answer is partly developer productivity and partly the nature of the …

read more

Can you do third-grade math? [part 2]

Tue 25 August 2015 by Jeff Fischer

In part 1, we looked at a math problem found in my 3rd grade son's homework. We can formally state the stamps problem as:

Given a goal postage value V and a set of available stamp values S, find a multiset of stamps C = {c:sub:1, c2, ... c …
read more