遗忘录

Kana

2023-05-17 18:14 (已更新 @ 2025-05-31 15:11)

有很多不记下来就会渐渐遗忘的东西，懒得一一整理，总之都记录在这里。

对 Unicode 的 Han Unification 的 Parody¶

gioele ( https://news.ycombinator.com/item?id=8041288 ):

Please understand that Han unification is _the_ problem. It is clean that Unicode needs to realize that the Han unification is wrong and accepts what the native writers of those languages think about their scripts.

To make the problem more understandable to the people that are used to alphabetic scripts, suppose that tomorrow an Asian committee starts creating Uniword, a repertoire that maps complete words to numerical IDs. At a certain point they get to “colour”.

Uniword committee: Well, that word shares meaning and origin with the other word “color”, for which we have already a codepoint, so we will encode them under the same codepoint.

GB, Australia and Canada: Ehi! No! To us those are different words; especially, we do not want Mr. Colours to appear as Mr. Color.

Uniword commitee: No problem, just add some out-of-band information like “nationality” or “<span lang=’en-GB’>”

“colour”-people: that will not work, there are so many cases in which this can go wrong. Whenever I copy a field from a DB I also have to extract this extra information?

Uniword: yes, that is the problem? C’mon!

“colour”-people: but do you need to do that in your applications?

Uniword: no, we have one code for every single word in our languages, including codes for very old languages that exist only in two palimpsests.

“colour”-people: and why cannot we have the same level of granularity?

Uniword: because you have too many words!!! And we started we had only 100k available integers.

“colour”-people: and now?

Uniword: now we have 2^32. But, yeah, that is not the point; just do how we suggest. This dialog is getting to long.

“colour”-people: “dialogue”, please.

Заезжий музыкант¶

一首俄文歌。虽然完完全全不懂俄语，但摘选网络上的只言片语：

Судьба, судьбы, судьбе, судьбою, о судьбе 一连用了俄语六个词格中的五个来表达 “命运”，不同的词格代表了不同语境不同场合下的 “命运”。

「死ね」はやめようよ！¶

出自《花开伊吕波》的松前緒花。

「死ね」はやめようよ！それでさ　あたしが本当に死んだらどうするのきっと後味が悪いよもっとちゃんと言葉を選んだほうがいいよ

之所以想起这一句是因为看到群聊里有人真的这么说了……

酒后胡言/酒后吐真言¶

在学姐学长的毕业聚餐之后发生了一些不太愉快的事。总之：（用了第二人称请不要介意）

就算你认为借酒可以让你有勇气说出一些事，但酒也仍然会使可信度/逻辑性降低。
酒后的逻辑性降低是极为明显的。
酒后易情绪失控。

一些网上常见论述的辩驳¶

对于某种现象的新闻等等，偏偏用“明明还有其它的什么什么严重得多”来反对当前议题的。

不知道怎么辩驳，不要去为难自己的血压吧。引用 Hacker News 的一句评论：

The good ol’ “there are many contributing factors, so how dare you try to solve one of them” argument.

总之需要意识到对方在回避论证。

编辑器九宫格¶

/	守序	中立	混乱
善良	`nano`	`code-oss`	`sed -e ...`
中立	`gedit`	双击	`wine notepad`
邪恶	“如何退出 vim”	“Emacs Evil”	`alias vim=emacs`

（顺便一提我现在是混乱邪恶方案（

Facebook Threads¶

最近各种社交媒体平台都出现了各种状况，例如：

Twitter 开始向 API 收费，推出付费蓝勾标签，并在最近限制了帐号一天内所能访问的推文数量：
Reddit 考虑 API 收费，基本上直接斩断了第三方客户端的活路：
- Reddit 将对利用其数据训练 AI 收费
- Reddit 准备杀死第三方客户端
Facebook 推出 Threads，该平台将会支持 ActivityPub 协议：
- Meta 发布其微博客应用 Threads

尤其是最后的 Threads，因为和联邦宇宙相关，在至少是我关注的圈子里引起了非常多的讨论。我个人虽然模模糊糊有些看法，但是基本都是些散乱的观点，也没什么逻辑支撑，总之汇总在这篇文章里。

深度学习与内容平台¶

前段时间讨论得很热烈（讨论现在其实也未尝停息）的就是深度学习的各种版权以及道德问题。下面是一些随机掉落的想法，之后的内容大概也就是这种形式。（或者说，就是想法太多我懒得把它们整理成逻辑连贯的形式了。）

反驳“画家和 AI 一样都是从其它画作中学习而来的，前者的是 fair use 所以后者也是”的观点

画师和 AI 有很大不同点，我惊讶于把这两者相提并论并以为自己逻辑完善的人的逻辑能力。
- 自由意志：人类的自由意志还可以讨论，但目前的 AI 根本没有讨论的意义。
- 人权：无许多说。
我个人的观点是：如果 AI 发展到了可以考虑它们是否具有人权的地步，那么上述的命题就还有讨论的空间（当然，那时我们就有更好的比喻对象了——奴隶制）；而现在的话，谁知道呢？
“深度学习是压缩”的一种学术观点：

https://zhuanlan.zhihu.com/p/634194171

图片格式方面，PNG 格式是无损压缩，JPG 格式是有一定损失的压缩，但无论如何，把一张图片改了格式再盈利发布该侵权还是会被判定为侵权的。问题就是，有损压缩损失到了什么程度才能被判别为侵权？深度学习的压缩肯定比所有其它专门用于储存图片的文件损失多得多，这方面如何判定？

ActivityPub¶

名称¶

最开始似乎大多数人把 Mastodon 等效于联邦宇宙了，这就让其它软件如 Misskey, Pleroma 的用户很尴尬（虽然我也常用“嘟文”这个词，也基本能理解“象友”这一词的指代范围）。最近似乎越来越多人开始使用联邦宇宙这一词，这是好趋势，但是大多数人还是不清楚 ActivityPub 协议的“宇宙”并不等于联邦宇宙。

“联邦宇宙”并没有指定什么协议，只要一周协议可以“联邦”——去中心化，那么它就可以被称为联邦宇宙的一部分。 ActivityPub 是，Mastodon 曾经支持的 OStatus 也是，diaspora* 也是，Matrix 等软件严格说来也是。

ActivityPub 其实还是比较受人诟病的一种协议，因为它非常宽松。放在人类世界里，都是中文，各种方言也许还能够相互听懂，但放到需要严丝合缝的计算机世界里，期望一个宽松的协议正常工作基本就是天方夜谭。所以你会看到各种 ActivityPub 软件都会面临各种兼容性的问题，不可能彻底解决。（目前一个公认的标准基本就是 Mastodon，毕竟它的的确确就是最流行的软件。然而它基本没有文档记载相关内容，而我也相信每一个 ActivityPub 开发者都会有被逼到要去读 Mastodon 源码的经历。）

一个玩笑¶

https://pod.geraspora.de/people/4ce5b01a9e3a1b18d700000a

Ubuntu禁用登陆时显示系统信息的banner¶

来源： https://ubuntuforums.org/showthread.php?t=1449020

That irritates the heck out of me too. It is controlled by the “pam_motd” module. You can disable the motd display (and potential delay) by editing these two files: /etc/pam.d/login, /etc/pam.d/sshd, and comment out the line that has “pam_motd” in it. I only tested the secure shell change since gdm is running, but it worked. Tested on Lucid Beta.

也就是说，修改 /etc/pam.d/login 和 /etc/pam.d/sshd ，把里面的含有 pam_motd 的行都注释掉就行了。