1000 lines of logs, rookie number. I was once given 400000 lines of customer data told to find a pattern of discrepancy based on logs. Both files were 400000 lines. Python cannot be in my company due to security reasons as they were financial data, I used java for regex.
Wait Ive never heard about python not being used due to security concerns, could you expand?
They had restrictions, plus i only had like 4 months of experience in Java , I was a fresher and I was crying 🥲.
Bravo, you pulled it off beautifully.
I suppose it could be dependency injection and the greater potential for breaking out of restricted environments.
Also, it's an interpreted language which is a bit less safe than a straight compile.
Also also, python is what they use for the most common hacking tools. Has good potential for privilege escalation.
Probably more "hasn't been approved" than "has been banned".
Sounds like it might be a fintech company, in which case, do not expect there to be a logical, modern, coherent reason.
I consulted for 14 years and will never do fintech again unless it’s a scrappy consumer-focused org with a low headcount. One company, to work on their iOS code, I had to remote from a perfectly good Mac to a windows machine in the cloud to another Mac. In New Zealand.
Guess no Python interpreter made it into the corporate whitelist?
It's a lot of work to make Python function in a whitelist security policy environment. Approving PyCharm is one thing, but you'd have to maintain an internal PyPI mirror with individually approved packages, and that's where an understaffed corporate infosec department would likely nope out.
Wonder if PyPI-whitelisting-as-a-service could be a viable business model.
How does python impose a security risk?
Don't try to reason with corporate
Every python function call you make is sent to a private server where Roko’s Basilisk reads and learns. Why did you think the language is called Python?
Maybe it was not validated in that environment and thus they could not know if it imposed a security risk or not?
But the mere presence of a programming language be deemed as a security risk is what’s interesting to me. If Python is said to be a risk then why not Java?
They’re aaaall a security risk, honestly. Nothing unique about python. Unless maybe the fact that anti-virus programs can’t really analyze code as well as they can a compiled executable.
That's what I was told, I was not allowed to use python.
Supply chain attacks can and do happen regularly against python's pypi which is why management would restrict the use of it.
Well if you have an idea of what you are looking for or at least when you are looking for, no problemo.
Otherwise, just take the day off and tell them you found nothing.
EOD stands for ‘End of Dignity’ when dealing with 10000 lines of code
SRE here. My applications log in PROD millions lines per hour and we keep them for 6 weeks. Not that hard to analyse if you use the right tools. IMHO this is a skill issue.
10 M well structured lines can be easier than 10 k ad hoc lines.
So this happened to me. And while my manager was showing logs to me and my junior asked us to analyse the logs and find the problem by EOD.
I was losing my shit like how can you expect us to find it in less than 5 hours. And he was saying bs like you can do it. You got to believe in yourself.
I saw the issue, i found the bug. And I asked him to stop.
And he with pride said this is why I come to you.
I knew I had done myself dirty.
ChatGPT or other IA
Exactly. It has saved me so much time
Better hope there’s nothing confidential in those logs huh
A friend developed a "language" for highlighting in Notepad++, so he could collapse the stacks in the logs. After that, he scrolled through the logs via the preview and looked, if he could see any usual pattern, like longer lines or shorter ones.
10000 lines of log files is easy mode. They probably have a reasonable text encoding, line breaks and everything..
Humans are……
Stick them into log insights in cloud watch -> find patterns -> check weird patterns
Make your machine analyze it for you dummy. If you don’t spend 4 hours automating a 10 minute task, can you even call yourself a software engineer?
Or are we dancers?
Depends what you mean by analysis. Really, whatever you're doing it shouldn't matter if it's 10k lines or 10 million lines, you just filter out the noise and either find the exact logs you're looking for, or write a script to extract the data your boss wants.
I don’t have imposter syndrome. These posts are made by imposters.
Damn the low quality effort is getting worse
grep | awk
And do some magic
I lost a position that was legit 80% reading through logs and trying to figure out what went wrong. It was awful
Small log files (<100,000 lines) I just search through in VS Code, but for any large ones I strongly recommend https://github.com/variar/klogg That will open a multi-gigabyte text file with no problem.
I'll just feed it to LLM hahahaha
Trace32.exe -> look for red -> LGTM
Depends on the logs. I was at the customer's site and hat to analyze 600k wireshark packets. Reproduced and found the error in a few minutes. Filtering is the key.
Ctrl c Ctrl t c h a t g p t . c o m enter Ctrl v enter
Microsoft logparser if you want a quick way to use SQL queries against CSV, XML files
Laughs maniacally in regex
your grep game is weak
One of the perfect use cases for AI but this sub will just call me a vibe coder
Grep
Type"Error"
No results found
Send a message back "Looks good to me
It isn't that complicated
I couldn't find the Ctrl key
Presses C + T + R + L instead
"Why doesn't this work?"
I did that in high school (to be fair it was my 7th class for the day and I was on auto pilot) I had to press Ctrl+F5, but I pressed Ctrl+F+5, the teacher even said people like me usually fail the class. I got an A and work in programming just to show her.
Lol
That's an actually good app idea, if a bit demented
🤣🤣 +1 plus if the log are bad, dev are bad
You caould search for a "Warning" too if you feeling spicy.
Nah, it is the reason why there are 10000 lines in the first place.