Question:
TexT mining or extraction to Excel?
vahid s
2010-11-04 02:46:31 UTC
Hi all dears
It's 3rd week that I am try to process some text but have some problem
there is 10000 notepad (TXT) (Web pages that has been saved as txt )file and each of them has some info (some repeatable information) like First name , last name , Location , Tellphone number
I have to Export all of them to excel but I cant do it manually is there any way for do it Automatically ?

Good luck thanks for your time .
Three answers:
?
2010-11-04 04:30:06 UTC
It comes down to whether all files have the same formatting. Meaning, if the names are always proceeded by the word "Name", then you could find the name of the person following this word in every text file relatively easily. Or if the names always appear in the same location of the file...



You can use almost any language to extract the names.



Another option, if you have SQL server enterprise edition, then you can use the data mining in the SSiS package to extract the names. I haven't worked wih it, but read up on it some. It may still need some type of location or key to reliable extract the names from the text files. However, you could then store the names into a comma delimited file and import it into excel quite easily at that point. I would still try the SSIS package even if you don't have enterprise edition and see what you can pull from the file using expressions.



Otherwise, i suppose you could write something that would reject all common words (the, and, or, have, is, etc), dump the result into another file or table, and whittle away at the remaining words until you get most of the names out of it. Separate the names by commas and import into excel.



ODesk also has freelance programmers, along with Elance. I've found both of these sites have good rating systems for contractors, along with additional tests that can help people determine which coder has proven skill sets for the job.
?
2010-11-04 03:22:18 UTC
if all the files are of the same format you could try and write a simple application using .net or any other langauage for that matter, that will open the files, get the required data and populate either a database or out put to a csv file depending on your preference.



you can then import all the data in to a single excel file.



If your not a developer but still want to go down this route please feel free to contact me or you could perhaps post the work on rentacoder.com. Please not this will cost you a we bit of money depending on the complexity of the extraction.



Good Luck
2016-12-04 02:19:13 UTC
once you definitely love them . It makes me experience kinda awkward sending it to absolutely everyone else . yet I do say '' i such as you '' to my family contributors and my close acquaintances . BQ : No . to me it ability a cool friendly hug . BQ2 : i think of its a much bigger hug . BQ3: who's down there ?! hi ??? P.S : some human beings upload xxx whilst ending a sentence and the variety of xs exhibits the importance and the firmness of the factor . I had a German buddy who used to try this each and all of the time . she might placed x's on the top of each paragraph . she used to try this..


This content was originally posted on Y! Answers, a Q&A website that shut down in 2021.
Loading...