NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Following Up on the Python JIT (lwn.net)
klooney 2 days ago [-]
Is LWN the only professional media organization that covers this sort of thing? It seems like core, important news relevant to a large number of well heeled professionals, it's a little wild there's so little competition.
rs186 2 days ago [-]
Most professional media organizations, including tech focused media, don't have a writer that remotely understands what's going on here.

I know Ars Technica has a guy who can go deep into how a security attack works. That's the only one I am aware of from a well-known publication. Even so, if you look at the comment section, most people have no clue about technical details and are just talking about the story.

Also bear in mind -- media in general is not a good industry to be in. Making money out of writing news articles is getting increasingly difficult. Most articles try to optimize for reach and clicks. Something like this one is not going to attract a lot of readers. I have no idea how LWN can be sustainable, but I can assure you that it's the exception, not the norm.

Other than LWN, your best bet is reading this from someone's blog, who spends hours writing about it, trying to explain it in an understandable way and avoid mistakes, expecting almost nothing in return, just as a hobby.

Your next best bet is reading someone's 12 disjointed tweets, possibly riddled with errors.

That's just the world we are living in.

wmanley 2 days ago [-]
> I have no idea how LWN can be sustainable

Speculation: it’s because Linux (the kernel) is a large centralised project, so there’s a critical mass of people willing to pay for Linux Weekly News.

From there the proven quality allowed the publication to cover other OSS projects. I suspect that without Linux LWN would not be sustainable.

sophacles 2 days ago [-]
> I have no idea how LWN can be sustainable, ...

Speculation: LWN is sustainable because there are enough people out there who recognize the value of such a source of information and are willing to pay for a subscription.

>... but I can assure you that it's the exception, not the norm.

Anecdata: I credit a good portion of my success to knowledge and insight I've gained from lwn articles.

If you're in a position to do so, I always recommend getting a paid membership, particularly if you've found their articles helpful to your tech journey.

(i am not affiliated with LWN, just a happy subscriber)

Sesse__ 2 days ago [-]
Given that even LWN, with its consistently high-quality reporting, is struggling to make it, would there be room for a second one?
klooney 2 days ago [-]
Probably not, which is sad.
satellite2 2 days ago [-]
100% or even 200% seems nice, but at least at the time when I compared with JavaScript it was 4200% faster than would have been needed. Let's not even mention go.

At least for now it seems that Python can still only be used to call some Pascal or cuda bindings if one needs performance

pjmlp 2 days ago [-]
Back in the 2000's I was on a startup whose main language was Tcl, and we would write C extensions for any performance critical command.

The experience lead me to avoid any language without JIT or AOT, unless forced upon me.

Hence why I have used Python since version 1.6 only for OS scripting tasks.

mananaysiempre 2 days ago [-]
I don’t know the timeline off-hand but I’m guessing this was before Tcl 8, at a time when everything in Tcl was a string not only logically, but also in the actual VM? There’s a whole chasm of implementation tradeoffs between that and a straight-up JIT.
pjmlp 1 days ago [-]
Tcl 8 was just released, and yes we were using it, it hardly mattered that much.

Our product was inspired by AOLServer and Vignette, providing similar kind of capabilities, we had Rails in Tcl, but we were not a famous SV startup, rather mostly serving the Portuguese market.

The founders then went on to create OutSystems, built on top of .NET and Java, nowadays using other stacks, while offering the same kind of RAD tooling.

glenstarchman 19 hours ago [-]
Starmedia (Periscopio) by chance?
pjmlp 3 hours ago [-]
Intervento, acquired by EasyPhone in 1999, which alongside other acquisitions became Altitude Software in 2000.

The platform was originally called Safelayer, and later renamed for Altitude Software consulting projects.

v3ss0n 1 days ago [-]
PyPy is average 4x faster yet 95% of python community ignored. Its already feature parity to 3.12 and most of the pypi libs works.
iberator 2 days ago [-]
"For native profilers and debuggers, such as perf and GDB, there is a need to unwind the stack through JIT frames, and interact with JIT frames, but "the short answer is that it's really really complicated"

I'm homeless and broken and I just spent like 2 weeks developing low level python bytecode tracer, and it SEEMS that this gonna ruin everything.

This is hilarious - as its my first project in like 2 years

achierius 2 days ago [-]
It's not impossible, JavaScript engines have the same challenge but are able to handle it. You do need to dump a lot of extra info but there's more or less a standard for this now -- look up JITDump
miohtama 2 days ago [-]
What's up with PyPy lately? Anyone using it in production?
cpburns2009 1 days ago [-]
I've been using it in production for 5 years now for an e-commerce website, and a lot of ETL and web apps for work. It runs flawlessly. The only real annoyance I run into is having to have the Rust toolchain installed in order to build the cryptography package.
ziihrs 2 days ago [-]
Yep. It supports Python 3.11 now.
v3ss0n 2 days ago [-]
PyPy existed since a decade ago.
pansa2 2 days ago [-]
PyPy deserves much more credit (and much wider use) than it gets. The underperformance of the Faster CPython project [0] shows how difficult it is to optimize a Python implementation, and highlights just how impressive PyPy really is.

[0] The article says "Python has gotten nearly 50% faster in less than four years", but the original goal was a 5x speedup in the same timeframe [https://github.com/markshannon/faster-cpython/blob/master/pl...].

Qem 2 days ago [-]
> The article says "Python has gotten nearly 50% faster in less than four years", but the original goal was a 5x speedup in the same timeframe

IIRC they originally expected the JIT to be the single focus on CPython performance improvement. But then another front was opened to tackle the GIL in parallel[1]. Perhaps the overhead of two major "surgeries" in the CPython codebase at the same time contributed to slower progress than originally predicted.

[1] https://peps.python.org/pep-0703/

pjmlp 2 days ago [-]
The main culprit is not wanting to change the C ABI of the VM.

Other equally dynamic languages have long shown the way.

incrudible 1 days ago [-]
But what do people actually use Python for the most, at least as far as industry is concerned? Interfacing with those C extensions.

PyPy does have an alternative ABI that integrates with the JIT and also works on CPython, so if people cared that much about those remaining bits of performance, they could support it.

pjmlp 3 hours ago [-]
That is the sad part of it all.

The culture that sees writing C as Python, and for them to care, Microsoft and Facebook had to step in.

Now with Microsoft out of the loop, lets see how much support the whole CPython JIT project will keep getting.

nromiun 2 days ago [-]
I really wish PSF would adopt PyPy as a separate project. It is so underrated. People still think it supports a subset of Python code and that it is slow with C ffi code

But the latest PyPy supports all of Python 3.12 and it is just as fast with C ffi code as JIT Python code. It is literally magic and if it was more popular Python would not have a reputation for being slow.

ziml77 2 days ago [-]
PyPy is amazing and it's actually a bit baffling that it's not the default that everyone is using in production. I've had Python jobs go from taking hours to run, down to minutes simply by switching to PyPy.
quibono 2 days ago [-]
Do you happen to know if Flask is supported by any chance?
Twirrim 2 days ago [-]
Yes. I've had a small webapp running under it quite happily (complete overkill, but it's a personal project and I was curious).

Very basic hello world app hosted under gunicorn (just returning the string "hello world", so hopefully this is measuring the framework time). Siege set to do 10k requests, 25 concurrency, running that twice so that they each have a chance to "warm up", the second round (warmed up) results give me:

    pypy   : 8127.44 trans/sec
    cpython: 4512.64 trans/sec
So it seems like there's definitely things that pypy's JIT can do to speed up the Flask underpinnings.
tgbugs 2 days ago [-]
Yes, have been using Flask on PyPy3 for years. I get about a 4x speedup.
nromiun 2 days ago [-]
I just tested it and it works perfectly.
pjmlp 2 days ago [-]
Unfortunately it keeps being the black swan in the Python community.

Python is probably the only programming language community that has been so much against JITs, and where folks routinely call C libraries bindings, "Python".

IshKebab 2 days ago [-]
It's not a black swan. The issue is that using Pypy means accepting some potential compatibility hassle, and in return you get a reasonable speedup in your Python code, from glacial to tolerable. But nobody who has accepted glacial speed really needs tolerable speed.

It's like... imagine you ride a bike to most places. But now you want to visit Australia. "No problem, here take this racing bike! It's only a little less comfortable!".

So really it's only of interest to people who have foolishly built their entire business on Python and don't have a choice. The only one I know of is Dropbox. I bet they use Pypy.

v3ss0n 4 hours ago [-]
By the time they switch ton pypy they already have too many C Extensions that is not compatible with pypy at that time and instead of improving pypy they try to develop their own llvm based jit python and they failed doing that. They should had ported those into CFFI or just help pypy improve context support. But NIH much and they built their own pypy alternative for years and failed
didip 2 days ago [-]
I don't get why PyPy and CPython don't simply merge. It will be difficult, organization wise... but not impossible.
pjmlp 2 days ago [-]
When people think of C library wrappers as Python is kind of an hard sell.
v3ss0n 4 hours ago [-]
HPY is new alternative, it works at same performance with cpyext and the same with pypy
orbisvicis 2 days ago [-]
If memory serves, PyPy supports a subset of Python and focused their optimizations on software transactional memory.
iberator 2 days ago [-]
Back in 2022 it worked fine with literally all modules except some ssh, ssl and C based modules.

With a little bit of tinkering (multiprocessing, choosing the right libraries written strictly in python, PyPy plus a lot of memory) I was able to optimize some workflows going from 24h to just 17 minutes :) Good times...

It felt like magic.

achierius 2 days ago [-]
The "C based modules" bit is the kicker. A significant chunk of Python users essentially use it as a friendly wrapper for more-powerful C/C++ libraries underneath the hood.
Twirrim 2 days ago [-]
They've long since fixed the C based modules interaction, unfortunately a lot of common knowledge is from when it couldn't interact with everything.

If you've written it off on that basis, I'd suggest it's worth giving it another shot at some stage. It might surprise you.

Last I saw there was still a little bit more overhead around the C interface, so hot loops that just call out to a C module in the loop can be just a smidgen slower, but I haven't seen it be appreciably slower in a fair while.

laurencerowe 2 days ago [-]
The FAQ states it is often much slower:

> We have support for c-extension modules (modules written using the C-API), so they run without modifications. This has been a part of PyPy since the 1.4 release, and support is almost complete. CPython extension modules in PyPy are often much slower than in CPython due to the need to emulate refcounting. It is often faster to take out your c-extension and replace it with a pure python or CFFI version that the JIT can optimize.

https://doc.pypy.org/en/latest/faq.html#do-c-extension-modul...

I have seen great success with cffi though.

orbisvicis 2 days ago [-]
I see, and it's a pretty short list:

https://doc.pypy.org/en/latest/cpython_differences.html#exte...

""" The extension modules (i.e. modules written in C, in the standard CPython) that are neither mentioned above nor in lib_pypy/ are not available in PyPy. """

The lifecycle of generators makes pypy code very verbose without refcounting. I've already been bitten with generator lifecycles and shared resources. PEP533 to fix this was deferred. Probably for the best as it seems a bit heavy-handed.

hnuser123456 2 days ago [-]
Yep, I had a script that was doing some dict mapping and re-indexing, wrote the high level code to be as optimal as possible, and switching from cpython to pypy brought the run time from 5 minutes to 15 seconds.
anthk 2 days ago [-]
If pypy worked with Retux the game would get a big boost. Altough the main issue is that it tried to redraw many object at one per frame.
v3ss0n 1 days ago [-]
Not a subset. It covers 100% of pure python. CPyExt are working fine , just need optimizations on some parts. The private CPyEXT calls that some libraries uses as Hacks are only things that PyPy do not support officially (PyO3 Rust-python bindings uses those) .
almostgotcaught 2 days ago [-]
Why do people feel the need to comment this on every single JIT post? Like imagine commenting on every post about Pepsi "Coca-cola exists since 1886".
a-french-anon 1 days ago [-]
Because Pypy wasn't even _mentioned_ in the JIT PEP (https://peps.python.org/pep-0744/), like it's the black sheep the family isn't supposed to talk about.
v3ss0n 1 days ago [-]
Because it is one of the most ambitious project in opensource world and very little is known about that. It is neglected by Python Contributor community for unknown reasons ( something political it seems) . It was developed as PHD Research project by really good researchers. PyPy had written python in Pure python and surpassed performance of Python written in C by 4-20x . They delivered Python with JIT and also Static RPython : which is subset of python which compiles directly to binary. I had also personally worked together with some of the lead PyPy developers on commercial projects and they are the best developers to work together with.
almostgotcaught 9 hours ago [-]
> PHD

Do you know that it's PhD because the h is part of word philosophy?

v3ss0n 4 hours ago [-]
Sorry I was on mobile
pjmlp 2 days ago [-]
Because as proven multiple times, the problem isn't Python, rather CPython, and many folks keep mixing languages with implementations.
bratao 2 days ago [-]
I feel sad and disappointed in Microsoft for letting the entire Faster CPython team go. I was a big supporter, always leaving positive comments and sharing news about their work. I'd figure the team paid for itself in goodwill alone. What a letdown, Microsoft. You ought to do better.
lynx97 2 days ago [-]
I wonder what is going on with the strange ""double quoting"".
setupminimal 2 days ago [-]
Hi — LWN editor here. We use <q> tags with some CSS to set off quotes from the main text. This _mostly_ works seamlessly, but a few browsers render "<q>something</q>" as ""something"". It's especially common if you copy/paste from the site.

I've considered dropping the outer quotes and using CSS before/after text to add them back in to the rendered page, but we have a huge back-catalog of articles doing it this way, and it's usually not much of an issue.

lynx97 1 days ago [-]
Thats interesting. I was using Lynx. A quick test with <q>hello</q> gives "hello" as expected. So whatever it is, you must be doing something else. Please dont use CSS to insert characters into normal text, that will fall down if CSS is not supported. I know, in this day and age, that sounds unusual. But abusing CSS to do something it wasnt ment to be doing is still no good idea IMO.
leephillips 18 hours ago [-]
How is using a feature of a language, that is part of the language’s standard, for its intended purpose “abusing” the language to “do something it wasn[’]t me[a]nt to be doing”?
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 13:14:35 GMT+0000 (UTC) with Wasmer Edge.