Darrell is president, developer, and most everything else at SynergiSystems, Inc. He launched SynergiSystems in 2007 in order to create software to make life easier for software developers.
So you're writing the mother of all text editors, and your rich editing features are working beautifully. Then you hit a serious snag as you start the code that reads and decodes existing files: character sets. How can your program tell which character encoding should be used to properly read each file?
Or perhaps you're writing a custom program to convert to Unicode and archive thousands of text documents for your employer. The original documents are saved in many different encodings, and there is no easy way to correctly identify the character set for each one.
You do a little research and find that byte order markers (BOMs) might help you identify some of the UTF character sets, plus you learn some tricks that can help you recognize when a file might use the US-ASCII encoding. But these tricks aren't guaranteed-in fact, they'll probably fail as often as they work. Plus they don't help you at all with most of the two hundred or so other possible encodings.
That just isn't good enough for your application. You need software that can accurately recognize the character encoding of a text file no matter what it is. As you begin to discover the wide array of character sets and encoding strategies and contemplate the complexities involved, you conclude you'd really rather not write it.
You need EncodingSleuth Text.
EncodingSleuth Text is a powerful Java library designed specifically with your application in mind. It examines files and byte streams to determine whether they contain encoded text, and identifies the character set most likely used to encode them.
EncodingSleuth Text uses several different statistical analysis techniques-called detectors-to analyze each possible character set that might be used to decode a file, and to score each one so that the correct character set obtains the highest score. It is configurable: you can selectively enable/disable each of the detectors to tailor its operation for your specific needs. It is also extensible: you can provide your own detector implementations should the need arise.
With licensing options that allow royalty-free redistribution within your applications, and even deployment within server applications, and a price that's a fraction of the cost to develop your own encoding recognition technology, EncodingSleuth Text offers a complete and robust answer to your need.
You can download EncodingSleuth Text, request a free full-featured trial license, and peruse the documentation at http://www.encodingsleuth.com.
- Related Videos
- Related Articles
- Ask / Related Q&A
- Software Outsouricng Experience
- Establishing Java Code: the Future of Software Development
- Java Software Development Services
- Java and Its Advantages
- Java Goes Open Source
- What is the Importance of J2ee Server in the Software Scenario?
- Java Developers
- Benefits and Pitfalls of a Java-based Forex Trading Platform




php software development company
By: usha sharma | 08/07/2009Professional Web Development Company phpmaestro provides custom website development web application development ecommerce website design and development services. Custom web application development services and professional website development at affordable rates from phpmaestro Company. www.phpmaestro.com is a php Application Development Shopping Cart for e-commerce stores. Our shopping cart software gives our client full control over your online shop its products design development prices sh
Hire ASP .Net Developers Hire Dedicated ASP.Net Developers Offshore ASP.Net Programmers
By: Arshad | 08/07/2009Since the concept of outsourcing changed the economy of many countries it has been very clear that paying Indian development service providers is very much economical than funding an in-house private team. Analysis says that more than 50% of the cost can be saved by inking contracts with Indian IT service providers.
Windbg Minidump Tutorial:Setting up & Reading Minidump Files
By: Jeannie Lee | 07/07/2009Windgb Minidump tutorial to set up and read minidump files (.dmp). Setting Symbol File Path. Output of Windbg command. windbg.exe -z [file path to minidump file.dmp] -c !analyze -v.
Javascript Validate Name Field
By: Jeannie Lee | 07/07/2009Simple Javascript tutorial on validating a name field. Checks to see if there is a value in the name field with Javascript after the user submits a form.
Logo Design- The image creator!
By: Jhonny Sharma | 07/07/2009Brand image is something that many consumers look for while buying a product. Brand image and brand positioning have become important concepts in the corporate world. When image building strategies are talked about, what tops the list is a creative logo design.
Design principles in logo
By: Jhonny Sharma | 07/07/2009An element of balance is a mandatory aspect of design. A design is considered to be a great design when it incorporates all design aspects in the required proportion. The design principles are vital for any kind of design.
Organization specific software
By: Manish Shrivastava | 07/07/2009Different businesses have different technological needs depending upon the type of work they are engaged in. Some businesses might require minimum use of software technology where there might be others whose very business might depend upon the optimum use of technology.
Hire .Net Developers: Hire ASP.Net Programmers
By: Arshad | 07/07/2009Since the concept of outsourcing changed the economy of many countries it has been very clear that paying Indian development service providers is very much economical than funding an in-house private team. Analysis says that more than 50% of the cost can be saved by inking contracts with Indian IT service providers.