I feel like there are probably some ad based search engines which are privacy and service oriented, but in general even for those there remains a misalignment problem. Hence if I don’t want to be a product now or in the future, what good search engines are there that I can pay for?
ALL HAIL KAGI
https://marginalia-search.com/ is pretty great for a different approach to search. While not exactly what you were asking for since its free, you can definitely pay via donations if you find it valuable.
You can pay me and I’ll setup a searxng instance for you if you want.
Or just use the free ones, mine is here
Kagi is the only paid search ive heard of but its more of a AI company that just happened to have a search engine as their most successful product. Consider reading this before switching to them: https://d-shoot.net/kagi.html
yours seems to work pretty well, thanks
holy shit they fixed the issue which made it really difficult to use on mobile. I might actually consider switching off duckduckgo
if you searched something and pressed the images tab then back swiped it would go back to the homepage and not to the homepage of your search results
Kagi is my absolute favorite.
Ok cool I saw this one— is it really like the old internet?
Like others have said, it’s not “old internet” per se, but they are big proponents of the “small web”; that is, independently-run, focused sites with good content and a purpose other than ad clicks. In that sense, it reflects what made the old internet good.
I use Kagi. It’s not especially like the Internet at some long-ago date, other than maybe than in that they run a search engine for Usenet archives.
I use Kagi for privacy reasons. They run a no-log, no-profile, no-ads setup, which is what I’d been wanting for some time.
If you’re looking for some kind of retro Internet experience, maybe grab a gopher client and look at gopher servers, or an IRC client, telnet to a MUD, use an FTP client to connect to an FTP server. All of those still have active servers and haven’t changed much, so it’ll still give a comparable feel.
I also recommend Kagi. I wouldn’t say it’s like the “old internet” but it works and it has amazing features that every search engine should have.
There are only 2 relevant web crawlers; Google’s and Microsoft’s. All queries from every search engine goes through these two crawlers either directly or through a middle layer of obfuscation.
The issue is that the internet is too large to index. This has been a known emerging issue for a long time. This is the real reason search sucks. It is not deterministic because it cannot be, but therein lies the issue. Without deterministic unbiased information, democracy is dead. And so search sucks. No one has been able to find a solution for efficient access to enormous databases like this except through the methodologies behind AI. At least not for real time search queries.
The issue is that the internet is too large to index.
It’s really not. At least, not yet. It’s a large part of why it isn’t done, but it’s not the only one, and I’d argue, not even the main reason it isn’t really done.
A complete crawl with meta data of the internet in 2025 is only 424TiB. For comparison, my 1000$ home setup can handle about a tenth of that(in storage at least). The hardware to maintain a single database of the internet with metadata could cost under $100,000, easily.
Dave, your comment about it costing a billion to run Bing or Google might be true, but it is completely unrelated to the realities of running a small search engine and has everything to do with the fact that they are Google and Microsoft products respectively.
The real issue isn’t the physical size of the internet, it’s much more likely to be the complexity of making a search algorithm that can compete with the 75 billion seo market that wxists to break search engines.
Original comment said in good faith, but from sketchy long term memory of stuff I’ve come across. It seems like it was in a Lex Friedman or similar podcast at some point, but from some time in the last 3-10 years. I may have conflated or misunderstood, as I am not experienced with such complexity. I seem to recall it coming up around the time several astronomers were speaking publicly about issues with processing large amounts of data and soliciting solutions. I just recall wondering why search started to suck around 2017, and putting the pieces together when I heard this. Now, in retrospect, it seems much of the changes were also adversarial for rival AI training after the Transformers paper. At least, looking at how search results are salted now, and the way images are selected for search is absolutely adversarial for AI training datasets… but that is all I know, and should be taken as friendly neighborhood water cooler talk, always with the best of intentions.
I think most startup search engines use Google/bing because it’s free/way cheaper than running their own database, not because it’s impossible. It also likely sidesteps a lot of the seo bullshit simply because Google/bing have more experience working around it
So like, short term/small size its cheaper and straight up easier to piggyback off of the big two companies, rather than manage your own data set. Long term, if you get popular enough to be noticed, I expect that the seo business would wreck any selfhosting search engine startup company’s results pretty regularly.
I once read that running a search crawler costs upwards of a billion dollars a year. Anyone other than Microsoft or Google running their own search index are either not getting a wide spread of the internet or they are using their own index to supplement Google or Bing results.
That’s like saying that it’s impossible to run a car manufacturing company without 100 billion because that’s how much Ford spends on their car manufacturing processes. It makes no sense.
Yes, making an original search engine is hard, just like making trucks is. But that doesn’t mean that running either one requires billions of dollars to do.
Common crawl is a nonprofit that regularly shares free copies of every internet page with metadata, and it damn well doesn’t take billions to do it either. https://commoncrawl.org/
I didn’t know there were paid search engines. Could just list ANY of them for us?
I think I only know of Kagi. Are there any more?
I think qwant is one? Or maybe it’s just European and not paid.
It’s French and free.
It’s not paid but you could try self hosting searxng. Since it’s FOSS there isn’t the conflict of interest issue, although be aware it’s a meta search engine so relies on the quality of common search engines. And your IP could be traced if it’s not connected through a commercial vpn just like using the search engines directly.
I’m using MetaGer and it’s pretty cool. They also have a nice OpenStreetMap interface maps.metager.de.