Measuring Parsing Difficulty Across Treebanks