• 0 Posts
  • 40 Comments
Joined 11 months ago
cake
Cake day: August 16th, 2023

help-circle



  • You think in Reddit’s 20 year history no one has thought of indexing comments for data science workloads?

    I’m sure they have, but an index doesn’t have anything to do with the python library you mentioned.

    Analytics workflows are never run on the production database, always on read replicas

    Sure, either that or aggregating live streams of data, but either way it doesn’t have anything to do with ElasticSearch.

    It’s still totally possible to sync things to ElasticSearch in a way that won’t affect performance on the production servers, but I’m just saying it’s not entirely trivial, especially at the scale reddit operates at, and there’s a cost for those extra servers and storage to consider as well.

    It’s hard for us to say if that math works out.

    It’s incredibly naive to think that they don’t have a vested interest in identifying organic engagement

    You would think, but you could say the same about Facebook and I know from experience that they don’t give a fuck about bots. If anything they actually like the bots because it looks like they have more users.


  • icydefiance@lemm.eetoADHD memes@lemmy.dbzer0.comNothing but truth
    link
    fedilink
    English
    arrow-up
    8
    ·
    edit-2
    2 months ago

    MAP (minimum advertised price) is often different from MSRP, but otherwise this comment is correct.

    In some industries, like RVs or auto parts, the vast majority of products have a MAP. The manufacturers also have bots that scan the internet for MAP violations, and they’ll blacklist a vendor if they don’t fix the price within a day or two. (Which is really annoying when there’s a false positive and I get blamed for it.)

    I think it’s partly so high volume vendors can’t put smaller vendors out of business by just reducing their margins as much as possible, and it’s partly because the manufacturer doesn’t want their products to look like they’re really cheap. Customers feel better about finding a “great deal” on an “expensive” product.


    1. To compare every comment on reddit to every other comment in reddit’s entire history would require an index, and if you want to find similar comments instead of exact matches, it becomes a lot harder to do that efficiently. ElasticSearch might be able to do it, but then you need to duplicate all of that data in a separate database and keep it in sync with your main database without affecting performance too much when people are leaving new comments, and that would probably be expensive.
    2. Comparing combinations of comments is probably impossible. Reddit has a massive number of comments to begin with, and the number of possible subtrees of those comments would just be absurd. If you only care about comparing entire threads and not subtrees, then this doesn’t apply, but I don’t know how useful that will be.
    3. Programmers just do what they’re told. If the managers don’t care about something, the programmers won’t work on it.







  • icydefiance@lemm.eetoScience Memes@mander.xyzBiology OP
    link
    fedilink
    English
    arrow-up
    4
    ·
    edit-2
    4 months ago

    The other replies are simplifying too much. Just adding or subtracting in a loop would be far too slow.

    A multiplier will find the partial products by using AND gates, and then sum them, which is very similar to long multiplication as they teach you in school. This article explains it pretty well.

    Division is more complicated. It’s sort of done like long division, but apparently that is slow and there’s some magic with two’s complements that can make it faster. Honestly I don’t fully understand it yet.


  • I haven’t said anything on the subject before, because I don’t care very much, but I don’t know what the alternative is supposed to be. It’s not like someone who is that famous can walk into a public airplane without putting her own safety at risk and causing trouble for other passengers.

    It’s also well known that conservatives really hate her because she told her fans to vote and these memes are part of their effort to discredit her. Whether the point has any merit or not, it’s obvious who started the trend and who it’s helping.




  • If your connection is stable, the latency will more or less be the same, but TCP will consume more bandwidth because of acknowledgement packets, making it harder to keep your connection stable.

    On an unstable connection, TCP latency will skyrocket as it resends packets, while UDP will just drop those packets unless the game engine has its own way of resending them. Most engines have that, but they only do it for data that is marked as “important”. For example using an item is important, but the position of your character probably isn’t, because it’ll be updated on the next tick anyway.


  • old webpages (like from the 2000s) are fast and snappy. new webpages take much longer to load.

    This part is true, especially on phones, but those old webpages were not fast at all when they were first published.

    I remember trying to watch videos on dialup Internet. I’d make it start buffering and then go do something else for half an hour before coming back to watch the video. I also remember avoiding certain websites even on DSL because they had 1 or 2 whole megabytes of JavaScript and it took forever to load.

    Increases in bandwidth and processing power has made those old websites seem a lot more performant than they were at the time.

    Today we can put a lot more stuff on our websites than we used to, which makes things slower, but we’re also much more aware of major performance issues. Google uses it as a factor in their ranking algorithm, and offers a pretty intelligent tool to help developers figure out where to optimize their websites, so it’s essential for most companies to optimize for that. Giant companies like Amazon and Facebook can ignore it because they’ll always be on top, but the rest of us are getting really excited about new frameworks like solid and qwik that will make it a lot easier to optimize our sites.