How to preprocess monolit text sequences in python?

I have dataset, which strcture contain 3 part [label,type,random-sequence]. label is constant, there are 5-10 values of type and random-sequence is random field. Dataset example you can find below:


Main goal is to find label, type and random-sequence. In the future data will change. So text processing method have to be universal. Is there way to devide this text sequence in words? So result should be like this:

field1: HTTP; field2: TRACE; field3: b615c083-0ddf-4d69-aa8e-aeff2c1c2a62;

Future data set can look like.


Read more here:

Content Attribution

This content was originally published by leafar_giraphick at Recent Questions - Stack Overflow, and is syndicated here via their RSS feed. You can read the original post over there.

%d bloggers like this: