Hi all,
I have been trying to use file upload method to upload the source data, however, i often find that the original event log has a lot of duplicated activities (same timestamp or near same-time timestamp) which created a lot of "noises" in the analysis, for example:
- Create Purchase Order 2021-01-01 01:00:00
- Cerate Purchase Order 2021-01-01 01:00:01
This kind of "duplicated" activities are less likely to be performed by human, would you please advise if there is a way to de-duplicate records that happened within a very short timeframe (say 1 s) with all other fields having the same value?
Also, as I have been trying using different file and thus this occupy much of my data consumption, may I know is it possible to remove the file uploaded and release some capacity for other analysis? thanks in advance!