Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
In my C# project, I have been dealt with the task of parsing an SGML file and have tried, very naively, to use XmlReader, and this has led to some interesting revelations (i.e., the difference between SGML and well-formed XML, etc.)
So I am thinking that I just need a good SGML parser which converts it to an XML file and go from there. In my search, I have found two SGML parsers that can integrate with my C# project:
MSDN's SgmlReader, and
James Clark's SP SGML parser.
Any other recommendations?
Apparently SgmlReader's updated here:
https://github.com/MindTouch/SGMLReader
HTML is an implementation of SGML. If you want to parse HTML properly, you will need an SGML parser. SGMLreader appears to fit those needs well, and I plan to use it myself. I would suggest using HTML tidy. It is a native application, but .net bindings for it do exist. If you need entirely managed code, then the SGMLreader is the way to go.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I'm looking to parse a C# code base within a Java app. I need to know where classes, methods, members, etc are defined (both filename and line number). What are the best tools/libraries to do this.
It looks like the Doxygen executable could be invoked and then its output XML files could be read.
Are there any other solutions out there?
I'm doing this in Java because its for a specialized application.
From here, there is a CSharp Grammer for ANTLR. I don't know how complete it is, but it includes a preprocessor, parser and lexer.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Is there a jQuery equivalent for C#? Basically, I want to download some html from a site and parse it the same way jQuery works. I realize that .Net has its own xml parser but it's not the same thing.
There is a library called Html Agility Pack: http://html-agility-pack.net
You can parse HTML and then select nodes, and work them... not the same as jQuery but it is a very near match.
One major benefit of HAP is, it will take malformed HTML and massage it into XHTML and then a traversable DOM, so is very useful for the HTML you find in the wild.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Are there any libraries which can convert Markdown to PDF? Or a complete markdown parser which generates tokens instead of HTML directly?
I first generated HTML using MarkdownSharp and then PDF using https://wkhtmltopdf.org/.
Any other answer which links to a token based parser or a library which could generate PDF directly will be awarded the answer.
You might want to check for LaTeX extensions for your markdown parser, which can easily be converted into PDF. There apparently is for the python-markdown parser. For example, https://github.com/justinvh/Markdown-LaTeX.
It also looks like you can use pandoc to convert markdown to LaTeX:
https://pandoc.org/
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I'm trying to create an ebook reader for WP7. So far I haven't been able to find any API to read ePub books with.
Thanks,
Zain
For DRM-free html based ePubs(there are also dtbook based ePubs, but I've never seen one) you can simply use a few libraries to build a reader:
An html render control, since the content is based on xhtml 1 and css
A zip library because the container is a zip archive
An xml library to parse the meta data files
For 2. and 3. there are many libraries, some of which support silverlight/WP7. No idea about 1, but I suspect WP7 already offers such a control.
EPUB sharp beta - http://sourceforge.net/projects/epubsharp/
Also, you can check out the draft of the ePub 3 spec here
Edit: Fixed the link
EPUB Sharp has not been updated since a long time.
Here's a step by step procedure in building your own (for iPhone, IMO it's good starting point).
I would imagine that extracting the contents of the book, as string or simple text and passing it into the Microsoft Speech SDK (SAPI) would work. Functions therein can be called from within c#. Did you try that already?
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Looking for an open source library, for C++, Java, C# or Python, for reading the data from Quicken .qdf files.
#Swati: Quicken .qif format is for transfer only and is not kept up to date by the application like the .qdf file is.
QDF is proprietary and not really meant for reading other than my Quicken, probably for a reason as it is messy.
I would recommend finding a way to export the qdf into an OFX (Open Financial Exchange) or qif file. I have done some financial and quickbooks automation and I did something similar. The problem is if you don't export to an exchange format, each version differs and strange things happen for many conditions that since they aren't documented (QDF) it becomes a bad situation for the programmer.
OFX is what allows online banking, brokerages and apps like mint.com securely get financial data. It is a standard and consistent. Finding a way to this is much better if at all possible.
http://www.west-wind.com/Weblog/posts/10491.aspx
And i know one other blog where the author was developing a parser for qfx/qif... lemme look it up... googling hasnt helped yet :(
Update: Found one more:
http://blogs.msdn.com/lucabol/archive/2007/08/31/parsing-qif-quicken-files-in-c.aspx
Check out http://qif.codeplex.com/
You may want to check the license before use. Thanks