//These are patterns for sentence splits // // Valentin Tablan, 24 Aug 2007 // // // Lines starting with // are comments; empty lines are ignored //more than 2 new lines (?:[\u00A0\u2007\u202F\p{javaWhitespace}&&[^\n\r]])*+(\n\r|\r\n|\n|\r)(?:(?:[\u00A0\u2007\u202F\p{javaWhitespace}&&[^\n\r]])*+\1)++ //the end of the document is also an external split, so that there is no //orphaned text \s*+\z