I have a term
h_avg/vnorm('+') * inner(jump(grad(u), u_old), jump(grad(v), u_old))*dS +...
in a quadratic form, where u,v,u_old are VectorFunctions, and it takes considerably more time to assemble the system ("Assembling matrix over interior facets ...") than to solve it using gmres (the final matrix is symmetric). Is it a standard behaviour, or can I use some setting to make the assembling faster?