Ruby Jam 2025/10 月場

(English Version Below)

Ruby Jam: 台灣 Ruby 愛好者的互動式聚會 💎

Ruby Jam 是一個每月舉辦的技術交流活動,為 Ruby 社群提供一個交流、學習和成長的平台。

在這裡你可以...

  1. 開源貢獻:和志同道合的夥伴一起貢獻開源專案
  2. 解決問題:帶上你在工作或 side project 中遇到的難題,讓我們一起集思廣益
  3. 經驗分享:我們提供 lightning talk 短講的舞台,歡迎你分享 Ruby 相關的技術心得和經驗
  4. 學習成長:如果你是 Ruby 初學者,歡迎來聚會向資深開發者學習
  5. 擴展人脈:歡迎來這裡認識新朋友,以及和舊朋友敘舊

分享主題:

🧐 像解析程式碼一樣處理台語:用Ruby實作台羅拼音的三階段分析法 🧐

講者:鄧慕凡,台灣的 Ruby 傳教士一枚。

簡介:

當慕凡的團隊投標教育部「臺灣台語語料庫應用檢索系統」時,竟然沒有其他廠商競標——因為台語漢字與台羅拼音的自動對齊被業界認為「技術難度太高」。

台語文本處理的核心挑戰在於雙語對齊。例如將溫度二十九至三十三度,落雨機率六十三%與對應的台羅拼音un-tōo lī-tsa̍p-káu tsì sann-tsa̍p-sann tōo, lo̍h-hōo ki-lu̍t la̍k-tsa̍p-sann %精確分割為10對對齊的詞組。這涉及複合詞邊界、數字變音、標點對應等複雜問題。

團隊採用三階段分析方法:詞法分析使用 40+ 條正規表達式規則處理tokenization、語法分析建立動態對齊演算法、語義分析實作平衡性檢查與錯誤修復。

這套系統目前服務提供詞彙檢索、語法點分析及教材資源功能。透過Ruby的表達力,將複雜的語言學規則轉化為可維護的程式碼,展現開源技術在本土文化保存上的價值。

 

到這裡報名 Ruby Jam 閃電講


期待在聚會中見到你!

Ruby Jam: An Interactive Gathering for Ruby Enthusiasts in Taiwan 💎

Ruby Jam is a monthly technical meetup that provides a platform for the Ruby community to connect, learn, and grow.

Here's what you can do at Ruby Jam:

  1. Open Source Contributions: Collaborate with like-minded individuals to contribute to open-source projects.
  2. Problem Solving: Bring your work or side project challenges, and let's brainstorm solutions together.
  3. Experience Sharing: We offer a stage for lightning talks! Feel free to share your Ruby-related technical insights and experiences.
  4. Learning & Growth: If you're a Ruby beginner, you're welcome to learn from experienced developers at our meetup.
  5. Networking: Meet new friends and catch up with old ones!

Featured Talks:

🧐 Parsing Taiwanese Like Code: Three-Stage Analysis of Pe̍h-ōe-jī Romanization with Ruby 🧐

Speaker: Mu-Fan Teng, a Ruby evangelist from Taiwan.

Abstract:

When Mu-Fan's team bid on the Ministry of Education's "Taiwanese Hokkien Corpus Application and Retrieval System," they faced no competition. Why? Because the automatic alignment of Taiwanese Han characters and Tâi-Lô Romanization was widely considered "too technically difficult" by the industry.

The core challenge in processing Taiwanese text lies in bilingual alignment. For example, accurately segmenting the Chinese characters into 10 aligned phrases with their corresponding Tâi-Lô Pīnyīn:
"溫度二十九至三十三度,落雨機率六十三%"
"un-tōo lī-tsa̍p-káu tsì sann-tsa̍p-sann tōo, lo̍h-hōo ki-lu̍t la̍k-tsa̍p-sann %"
This involves complex issues like compound word boundaries, digit tone changes (sandhi), and punctuation correspondence.

The team adopted a three-stage analysis method:
• Morphological Analysis: Using over 40 regular expression rules to handle tokenization.
• Syntactic Analysis: Developing a dynamic alignment algorithm.
• Semantic Analysis: Implementing balance checks and error correction.

This system now offers features including lexical retrieval, grammatical point analysis, and teaching material resources. By leveraging the expressiveness of Ruby, complex linguistic rules are transformed into maintainable code, demonstrating the value of open-source technology in the preservation of local culture.

 

Schedule

Time Content
6:30 - 7:00 Check-in
7:00 - 7:30 Open Discussion: Open Source Contribution, Problem Solving, Experience Sharing
7:30 - 8:30

Sharing

8:30 - 9:30 Open Discussion

Event Details

  • Time: Last Tuesday of every month, 6:30-9:30 PM
  • Location: PicCafe in PicCollage (3F, No. 102, Guangfu South Road, Da'an District, Taipei)

Do you have a topic to share? Sign up here for a Ruby Jam lightning talk


Looking forward to seeing you at the meetup!

 

特別感謝贊助 Special Thanks to Our Sponsors!

 

   

Ruby Central (full logo)

PicCafe in PicCollage / 台北市大安區光復南路102號3樓

イベントチケット

チケット種別 販売期間 価格
一般票 / Entrance

~ 2025/10/28 21:30(+0800)
  • 無料
次へ