Converting from PDF to HTML [closed] - c#

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 11 years ago.
Is there a .dll I can use which uses a PDF file as an input and HTML file as an output?
I want to convert from PDF to HTML. My colleague says that it's very difficult going step by step, getting text/font/image/margins/links etc. from PDF and then creating new HTML file with the same content. He says it's nearly impossible. So I was thinking - if there's some dll which I can use as a reference to do that?

Writing a program to do it is definitely not trivial. If you don't find any .NET Library to do this (I couldn't, at least not free), I would just download this and invoke it programmatically to get my html.
If you have the time to spare and/or PDFToHtml does not produce acceptable output for you, you could use iText to write the program yourself. It's a very mature free pdf library. I've used it in the past to manipulate PDFs (merge, create, etc).
UPDATE
As noted in the comment by Quandary, the PDFSharp library offers a more relaxed license (MIT) compared to the Commercial or AGPL license offered by iText. Keep this is mind when choosing your library. I have not used the PDFSharp library myself and I don't know how they compare in terms of functionality.

You can download this free tool: PDFToHTML
Then in your program just fork a new process and run the executable passing the PDF file. I just tested it now and it seems to work ok.

If you don't mind paying, Aspose offers a very good solution, this is what we use at my company.
http://www.aspose.com/categories/.net-components/aspose.pdf-for-.net/key-features.aspx

Related

how to export datagrid to pdf winform [duplicate]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 11 years ago.
Can you recomend any PDF API for C#. Free is the best, but I don't mind paying for it.
I used PdfSharp. It's free, open source and quite convenient to use, but I can't say whether it is the best or not, because I haven't really used anything else.
Update:
I'm not sure when or if the license changed for the iText# library, but it is licensed under AGPL which means it must be licensed if included with a closed-source product. The question does not (currently) require free or open-source libraries. One should always investigate the license type of any library used in a project.
I have used iText# with success in .NET C# 3.5; it is a port of the open source Java library for PDF generation and it's free.
There is a NuGet package available for iTextSharp version 5 and the official developer documentation, as well as C# examples, can be found at itextpdf.com
My work uses Winnovative's PDF generator (We've used it mainly to convert HTML to PDF, but you can generate it other ways as well)

Generate PDF using Asp [closed]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 10 years ago.
Anyone here knows how to generate PDF using ASP? My system is about car rental online. When user choose car,date and services offered, there will be a 'CONFIRM' button which it will generate PDFs after user click it. I don't ask for you to write me the code, I just need you to provide with some helps,tips and reference as I am not familiar with this. Looking forward to hear from you. Have a good ones. Cheers.
You can use ITextsharp from here http://sourceforge.net/projects/itextsharp/ it will help u to generate pdf.
And this tutorial will help u
http://www.mikesdotnetting.com/Article/80/Create-PDFs-in-ASP.NET-getting-started-with-iTextSharp
This is a list of free libraries that help you implement PDF generation in any type of aplication (including ASP) .
A bit of reading is required, but in the end its your choice wich one you are going to use:
http://csharp-source.net/open-source/pdf-libraries
From personal experience I can recomment ABCPdf.Net (paid) and iTextSharp (free)
Use EVO PDF
http://www.evopdf.com/
Or
Use Winnovative
http://www.winnovative-software.com
Use following to generate PDF it works with multilingual with great easy.. It is the best free dll i have used so far
WKhtmltopdf convert html file to pdf

What is a good C# PDF library for form-heavy webapp? [closed]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 10 years ago.
I am in need of recommendations for a good C# .NET PDF library for a web application that will make heavy use of PDF forms. First, the library needs to have commercial licensing. Also, it needs to have robust features for merging data into PDF forms, extracting data, flattening form fields that have data, etc. If the library includes barcoding, that'd be great.
iTextSharp is fairly popular around here. It is available under AGPL and commercial licenses (with source of course).
It'll do just about anything that can be done with AcroForms-based PDF forms (including flattening), and can get/set values with LiveCycle Designer forms. It also supports the following symbologies:
Codabar
Code 39 (and variants)
Code 128
DataMatrix (2d)
EAN (8 & 13)
Interleaved 2 of 5
PDF417 (2d)
Postnet
UPCA and UPCE
Huh. I thought it supported code 93, but I don't see anything in the code here to back that up.
The book iText In Action 2nd Edition is pretty good. Commercial support is available (included with a commercial license), and the help here and on their mailing list is quite good.
Disclaimer: iTextSoftware pays me from time to time for services rendered. Whether or not you use decide to use iText will not affect that amount.

C# PDF Printing Library [closed]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 10 years ago.
Are there any free libraries that would "print" to a PDF without actually having to install a PDF printer on the system. I want something that can be completely self contained in my application. The reason I say I want it to "print" is that I've tried and tried to find a solution for directly converting from HTML with CSS to PDF, but it does't seem very possible. So I want to use the System.Windows.Forms.WebBrowser control to render the page first and then output that rendering to PDF. I just don't want the user to be required to install a PDFPrinter.
PDFsharp
Does it have to be free? Last time I looked at ABCpdf it looked quite good, and claims to support css (here).
HTMLDOC should support css at some point, but last time I checked the css stuff was still in beta. And has been for "some time"; I've given up waiting... It isn't .NET, but you can use Process to invoke it.
http://www.html-to-pdf.net/ - This isn't free but its preety cheap and effective. I am using it now to convert HTML with CSS to PDF. Essentially we display the page to the user and then with a click of a button they can download a PDF of the page.
http://itextsharp.sourceforge.net/ is really excellent. You can see how to use it in the repository at stochfit.sourceforge.net
Another tool (not free): http://www.winnovative-software.com/

Digitally sign PDF files [closed]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 9 years ago.
I have a digital certificate that identifies a user. I need to use it to Digitally sign pdf files.
Does anyone have an example that does not uses a third party component? I need to get this done but it would be nice to fully understand how things are done.
C# Examples please :)
The open source iTextSharp library will allow you to do this. Here's a post explaining how to digitally sign a pdf file. If you don't want to use a third party library then you can implement it yourself but it could be a tough task -> you can start by reading the pdf specification (8.6MB)
Proper PDF signing is a very sophisticated task. There exist a number of files that don't conform to the PDF specification (broken xrefs etc) and your code must handle all of them. Then various Acrobat versions treat certain things in signed fields differently. So if you need to do the task (rather than study how it works) you should rely on third-party solution, such as our PDFBlackbox components.
Digitally signing a PDF document without using a third-party component entails a great deal of work and is generally best avoided.
Components do all the hard work for you, so you don't have to. You should find there are some excellent free PDF components available that will suit your needs.
The following example written in C# shows how simple it is to digitally sign a PDF document using ABCpdf:
Doc theDoc = new Doc();
theDoc.Read(Server.MapPath("../Rez/Authorization.pdf"));
Signature theSig = (Signature)theDoc.Form["Signature"];
theSig.Location = "Washington";
theSig.Reason = "Schedule Agreed";
theSig.Sign(Server.MapPath("../Rez/JohnSmith.pfx"), "111111");
theDoc.Save(Server.MapPath("Signed.pdf"));
Source: ABCpdf documentation - Sign method
Lost my first answer. May want to give DocQ a try to link text They have their own cert and can do this for you for free/cheap to seal and encrypt PDFs. They also have an API you can use.

Categories