Try averaging over a number of runs. Building the sparsity pattern involves communication, and timings for such functions can be noisy on some systems (what system are you on)?
Make sure you're using the development version of DOLFIN - it has a number of scalability improvements over the 1.4 release.
We're interested in user experiences in parallel, so we'd be happy to hear about any results. You can post results to fenics@fenicsproject.org or on Google+ (https://plus.google.com/communities/105550716956576029273)