Sax parsing and encoding

I have a contact that is experiencing trouble with SAX when parsing RSS and Atom files. According to him, it's as if text coming from the Item elements is truncated at an apostrophe or sometimes an accented character. There seems to be a problem with encoding too. I've given SAX a try and I have some truncating taking place too but haven't been able to dig further. I'd appreciate some suggestions if someone out there has tackled this before. This is the code that's being used in the ContentHandler: public void characters( char[], int start, int end ) throws SAXException { // link = new String(ch, start, end); Edit: The encoding problem might be due to storing information in a byte array as I know Java works in Unicode.

以上就是Sax parsing and encoding的详细内容,更多请关注web前端其它相关文章!

赞(0) 打赏
未经允许不得转载:web前端首页 » JavaScript 答疑

评论 抢沙发

  • 昵称 (必填)
  • 邮箱 (必填)
  • 网址

前端开发相关广告投放 更专业 更精准