at://did:plc:44ybard66vv44zksje25o7dz/app.bsky.feed.post/3lxveszcqok2j
Back to Collection
Record JSON
{
"$type": "app.bsky.feed.post",
"createdAt": "2025-09-03T00:29:18.010Z",
"langs": [
"en"
],
"reply": {
"parent": {
"cid": "bafyreibubccaokuz6oh6edjpg5knadxxd6wizyu3awj2qilqvucdok6xhu",
"uri": "at://did:plc:7zwztfs4yywggefayx7oid35/app.bsky.feed.post/3lxvdfhy6222l"
},
"root": {
"cid": "bafyreif2gdxk4s4zjantsuxb5zg6qyh67t34anav6xxgor23mavej4qssm",
"uri": "at://did:plc:7zwztfs4yywggefayx7oid35/app.bsky.feed.post/3lxvd3snag22l"
}
},
"text": "pretty interesting!\n\nbootstrapping a large *and* quality URL crawl list is hard, and a barrier to entry. you can spider top stuff from, eg, Alexa top million, wikidata, and commoncrawl. but true long tail is important and hard: \"not linked-to but good\"\n\nsometimes via old tweet links, reddit, etc"
}