Internal Development Central
This page is for Langwiki developers and contributors all around the world. Of course, in this initial stage, we are a small group of creative and intelligent people.
Here you will find key engineering and editorial tasks that have been planned and worked on. You are welcome to make suggestions and comments. If you are interested in any task, you are welcome to work with us.
Areas under development [编辑 | 编辑源代码]
- Language entry-point design and construction
- Create attractive and easy-to-use entry points for studying a language
- Initial languages
- Mandarin
- Add English tutorials ⌛️
- Manchu
- Cantonese
- Minnan
- Middle Chinese
- Mandarin
- Viewing languages
- Voice recording
- To fulfill the goal of showing core vocabulary of languages and dialects, we will make voice recording easy to use, and encourage users to respond to pronunciation requests.
- Audio request browser ⌛️
- Introductory Chinese (Mandarin) tutorial
- English version (The goal is to really help foreigners study Chinese.)
- Chinese version (the tutorial serves as a grammar textbook and template for writing tutorials for Chinese dialects)
- Online dictionary
- Chinese character pronunciation dictionary (古今中外汉字读音查询)
- Add General Chinese Phonetics(新通用拼音)for dialects: Cantonese (粤), Minnan (闽南), Shanghainese (吴语上海话). ✅
- Add General Chinese Phonetics(新通用拼音)for Korean ✅
- Add case conversion tool (upper, lower, proper cases) Thanks to 叶剑飞. ✅
- Convert Japanese text to Hiragana, Kitakana and Romanji ✅
- Convert between the two forms of General Chinese Phonetics(新通用拼音). The two forms are Extended Latin and Basic Latin. ✅
- Convert pure Korean text to mixed Hangul-Hanja text. ✅
- Conversion between Manchu and latin ✅
- Reversed lookup (from phonetics to Chinese characters) ⌛️
- Add Mindong (闽东) dialect pronunciation ✅
- Add Suzhou (苏州) dialect pronunciation ✅
- Manchu word lookup ✅
- Revise Guangyun pronunciation v1.3
- Improve Ming scholar's pronunciation (韵略易通)
- Prepare data for Ming folk pronunciation (中原音韵)
- Provide GCP for Hakka
- User-specific memoir (生词本) and adaptive review scheme ✅
- Web-based version ✅
- User database ✅
- Review scheme ✅
- Document the phonetic system (General Chinese Phonetics)
- New dictionary framework
- Multiple languages ⌛️
- Design ⌛️
- Multiple dialects
- Editable
- Open API (provides query service for 3rd party applications and scripts) ✅
- Needed by the chatbot ✅
- Chinese character pronunciation dictionary (古今中外汉字读音查询)
- Chatbot ⌛️
- Core service
- Simple instructions and dictionary query ✅
- Push daily posts
- User categories
- QQ interface ✅
- Wechat interface
- Telegram support
- Features
- Traditional - Simplified conversion ⌛️
- Single char: search variants and combine
- Words: chars converted to the required form of each dictionary
- Traditional - Simplified conversion ⌛️
- Bot content
- Wiki access (@bot) ✅
- Grabbing Japanese lyrics ✅
- Grabbing Korean lyrics ✅
- Grabbing Cantonese lyrics ✅
- Core service
- Chat system
- Chat server ✅
- Chat client ✅
- Bot API
- Customized fonts
- Customized IME
- Voice synthesis
- Start with a dialect ✅
- Phoneme editing document ✅
- Middle Chinese
- Article on Middle Chinese pronunciation
- Pending final pronunciation
- 语言和方言信息采集
- 词汇采集(初始词典、用户新增及更新)
- 语音采集(页面基本词汇;词典词汇)
- 列表显示被请求语音,鼓励发音
- 语法信息采集
- 收录语法书籍
- 收录循序渐进教程
- 语言维基设计通用语言信息收集卡,每张卡片就一个语言用法提问,用户填写例句和简要说明。覆盖语法、语义两方面信息。
- 文学、语料收集(提供操作界面、索引、内容模版等)
- 双语语料收集(提供操作界面、奖励机制)
- 分区管理、团队运维
- Font and IME resources
- Popular fonts (Hanamin) ✅
- Improve open fonts (Flash-based font editor) ⌛️
- System improvement
- Allow visual editing for Blogs and Templates ✅
- Allow embedded videos ✅
- Fix audio player ✅
- Performance tuning ⌛️
- Cluster construction
- Mobile application
- Langwiki browsing and editing
- Online dictionary access
- Update Android app
- Add conversion functionalities
- Update memoir functionalities
Contact us: QQ group 562374735(语言维基 新通用拼音)
Detailed planning for Manchu chatbot [编辑 | 编辑源代码]
指令集模型 [编辑 | 编辑源代码]
基本想法是 设计一个指令集
使用者
- 在指令集范围内 每输入一个指令 就会输出相应的内容。
- 可以直接查询单词。单词也被看作是一种指令。
- 点歌功能。
- 如果使用者输入了指令以外的内容,等同于“invalid”指令。
- 开启报告模式后,输入的内容及相关信息(所在群以及报告者的QQ号码)会被编号(从1开始)后存档并传送到后台编辑者。(输入的具体内容直接传送,不存档。只对所在群以及报告者的QQ号码进行存档。)
编辑者
edit: 修改某一指令所输出的内容 若指令之前不存在 则创建此指令
delete: 删除指令 即移出指令集
broadcast: 对所有使用者广播
reply: 回复报告,其内容会传送到指定的使用者。
编辑者可以即时接收来自前台使用者的报告,这些报告的格式是:
【编号】xxx(显示编号而不是具体的群号码和QQ号码,以方便编辑者进行回复)
【内容】(报告的内容)
指令格式 | 指令权限 | 功能 | 指令举例 |
---|---|---|---|
menu | 显示菜单 | menu | |
满语单词 | 查询单词 | abka | |
%edit(某一指令)更改后的输出文字 | 认证的编辑者 | 更改某一指令所输出的文字 | %edit(menu)欢迎使用满语机器人! |
%delete(某一指令) | 认证的编辑者 | 删除指令 | %delete(xun) |
%broadcast[广播内容] | 认证的编辑者 | 对所有使用者广播 | %broadcast[我们推出了新的功能!] |
%reply(编号)[回复内容] | 认证的编辑者 | 回复某一编号的报告信息 根据编号档案在某群 @某个成员 |
%reply(3)[感谢建议] |
The commands that can only be used by 认证的编辑者 统一以“%”开头,它们对普通使用者是无效指令。
辞书计划 [编辑 | 编辑源代码]
辞书资料上传至一个特定的服务器,可有网页查询、Chatbot、(将来的)app查询等多个用户端(出口)。为了保护资料的准确性,只有已认证的编辑者才能编辑辞书资料。用户可以通过指定的反馈通道向编辑者报告错误,而编辑者以Chatbot编辑指令(%edit)的形式修正错误,修正完成后所有用户端(出口)都会自动更新。除此之外,还应该建立一个内部文件纪录所有的修改操作历史,这个修改纪录可以提交到东京大学。
Database [编辑 | 编辑源代码]
Content | Language Code | Maintainer | Note |
---|---|---|---|
Imperial Manchu Thesaurus | mnc | authorized | Must be kept identical to the original book |
Sin Man-Han | mnc | authorized | Must be kept identical to the original book |
Si-Han | sjo | no edit | Released by official organization |
New Manchu | mnc | anyone | A wiki-style online dictionary |
Standard Taiwanese Mandarin | goyu | no edit | Copyrighted by Department of Education, Taiwan |