Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Is there a jQuery equivalent for C#? Basically, I want to download some html from a site and parse it the same way jQuery works. I realize that .Net has its own xml parser but it's not the same thing.
There is a library called Html Agility Pack: http://html-agility-pack.net
You can parse HTML and then select nodes, and work them... not the same as jQuery but it is a very near match.
One major benefit of HAP is, it will take malformed HTML and massage it into XHTML and then a traversable DOM, so is very useful for the HTML you find in the wild.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I'm looking to parse a C# code base within a Java app. I need to know where classes, methods, members, etc are defined (both filename and line number). What are the best tools/libraries to do this.
It looks like the Doxygen executable could be invoked and then its output XML files could be read.
Are there any other solutions out there?
I'm doing this in Java because its for a specialized application.
From here, there is a CSharp Grammer for ANTLR. I don't know how complete it is, but it includes a preprocessor, parser and lexer.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
Is there anything as an HTML5 libtidy .NET wrapper ?
I have been looking around and all I can find is Mark Beaton's Tidy Managed.
It doesn't seem to support HTML5 gracefully. I have been using it in the past with great success, but it's no longer a good fit.
Then there is this HTML Tidy with HTML5 support.
But I was hoping there is a .NET wrapper for it (than I can't seem to find, no matter what).
Also it needs to build on x64 so I understand the chances are close to zero :)
You can try the NewInlineTags property. Works for me:
var td = Document.FromFile(InFileName);
// new HTML5 tags
td.NewInlineTags = "section, aside, header, nav, footer";
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I have seen many questions on this but still couldn't find my answer.
I want to be able to convert a docx file to a pdf file without having word installed. Is there maybe a free library out there? I really don't want to pay for this.
I read a post about XSL-FO and i found a XSL-FO to pdf converter library, not sure if that works.
So without using ofMicrosoft.Office.Interop.Word I want to convert docx to pdf
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Are there any libraries which can convert Markdown to PDF? Or a complete markdown parser which generates tokens instead of HTML directly?
I first generated HTML using MarkdownSharp and then PDF using https://wkhtmltopdf.org/.
Any other answer which links to a token based parser or a library which could generate PDF directly will be awarded the answer.
You might want to check for LaTeX extensions for your markdown parser, which can easily be converted into PDF. There apparently is for the python-markdown parser. For example, https://github.com/justinvh/Markdown-LaTeX.
It also looks like you can use pandoc to convert markdown to LaTeX:
https://pandoc.org/
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
In my C# project, I have been dealt with the task of parsing an SGML file and have tried, very naively, to use XmlReader, and this has led to some interesting revelations (i.e., the difference between SGML and well-formed XML, etc.)
So I am thinking that I just need a good SGML parser which converts it to an XML file and go from there. In my search, I have found two SGML parsers that can integrate with my C# project:
MSDN's SgmlReader, and
James Clark's SP SGML parser.
Any other recommendations?
Apparently SgmlReader's updated here:
https://github.com/MindTouch/SGMLReader
HTML is an implementation of SGML. If you want to parse HTML properly, you will need an SGML parser. SGMLreader appears to fit those needs well, and I plan to use it myself. I would suggest using HTML tidy. It is a native application, but .net bindings for it do exist. If you need entirely managed code, then the SGMLreader is the way to go.