張耀仁: 9月 2025

星期日, 9月 28, 2025

River crossing puzzle 3

problem statement

17 step solution 黃品勛

17 step solution 王智弘

Why LLM performs poorly with reasoning and planning

Claude Sonnet 4.5 reasoning did not work (It worked very very hard.)

Claude Sonnet 4.5 gen BFS but did not work (misinterpretation of problem)

Code gen (Python)

code gen python A* search for ::River Crossing Puzzle ...

Python on colab

Use REACT to visualize the steps

change to js

run js

use React to visualize the steps

ChatGPT 5 create BFS to solve it. 17 step solution, visualize, visualizer

ChatGPT BFS

If BFS option is trurned off, ChatGPT has a hard time doing it manually.

ps:

I really like ChatGPT o1 reasoning performance.

Monologue is best.

reasoning mode in o1

Talking to itself

Running into difficulties? Cute

hours? Ha Ha Ha

Follow up to #3 Constrained Optimization

1. 繪製專業工程圖

為何你的AI得不到專業級的工程圖版型?

專業版型需要時間調整(熬煮)，你們第一次做出來通常很陽春。還有因為你們沒有付費，所以分配到很少的算力，只能做陽春版。所以初學者先用老師版型customize就好，要做出好的版型，要跟AI好好溝通很多回，而且要有足夠的算力($$$)。

如何套用

你需要套用自己的答案到版型裡面。這題要先求解，再把文字結果放進三種版型視覺化。套用時，先按下 customize鍵，等系統跑完版型，輸入你的文字資料，系統就會自動為你的資料上版型。

如果你要使用現成版型(artifact type) 做行程視覺化，步驟如下:

找到版型程式（作業工程圖下方連結）
按介面右上方 customize 鍵
輸入你的文字版時間排程
得到專業工程圖

2. 推理出正確時序

為什麼晚餐大作戰問題ＡＩ有時會出錯？

當AI面對包含多個限制條件的複雜問題時，確實存在難度。

在注意力機制(Attention mechanism)中，模型需要同時關注多個條件，並且每個注意力分配都是基於概率的。當條件數量增加時，要同時滿足所有條件的概率會顯著降低，這就像是連續投擲硬幣並期望全部正面朝上一樣——條件越多，全部滿足的概率越低。

這種情況下，模型可能會:

忽略某些條件
錯誤理解條件之間的關係
在滿足一些條件的同時違反其他條件

如果您有一個包含多個複雜條件的問題需要解決，我建議將問題分解成較小的部分，逐步處理每個條件，使用推理模式這樣會更容易得到準確的解答。

illustration

要避免錯誤需要打開推理模式（如何開啟參看附圖），在直覺模式下容易忽略題目給定的限制條件。

請打開推理模式

1. 推理是把一件原本複雜的任務，拆解成一連串小型的步驟，就像煮一道菜，食譜會列出一串的步驟，讓你按部就班煮出來。這一連串拆解後的步驟，我們稱為思維鍊(Chain of Thought)，因此當你使用推理模式的時候，會看到系統顯示每一個步驟在做的事。

2. 為什麼非推理型 LLM不擅長規劃(如搶救晚餐大作戰)或是處理複雜多重限制條件的難題(如渡河問題)？

一般來說，規劃是一項複雜的認知任務，需要幾個關鍵能力：

理解因果關係和時間關係
維持連貫的長期目標
適應不斷變化的環境
管理資源和限制

Remember to use reasoning mode to reduce such constraint violation errors. If you use Claude, turn it on in the menu by checking Extended Thinking. If you use ChatGPT, it will use Thinking mode by itself.

參考正確時序: Link 1, Link 2

3. 走迷宮

Good user interface that can generate a random maze, and allow users to add or delete walls.

建議顯示出動態找出路徑的過程，尤其是探索了哪些點。例如:

https://claude.ai/public/artifacts/502718bb-d96b-4211-b1d4-0671ae9892f0

4. ChatGPT 分享

Reminder: Cannot open your ChatGPT HTML or image files (jpg, gif, png). It is for your eyes only and not sharable.

How to share HTML from ChatGPT:

You can copy the HTML to Claude. Then prompt "render." After that, you can share the resulting link with me.

ChatGPT 只能分享一次，更新後的內容無法看見。必須要更新分享才能看到新的修改。

星期六, 9月 27, 2025

晚餐接機大作戰(半導體廠製程排程): Part II A*

Scheduling Parallelism in Plans problem (Eng) (from source)

Generate Python code base

using such as A*, BFS, Dijkstra, ...

or let AI make its decision

To run Python, use Google Colab

Visualize results as Python can generate complex states (步驟參考)

ChatGPT Python*, Colab, Visualization，Python*- Colab，Python** Colab，Python** Colab

Claude Python Colab Visualization，Python Colab＊ＭＩＬ Colab Viz

Gemini 2.5 Pro & Grok 4 Colab

最佳解最高的平行度，throughput 高，idle time 減少，critical path 短

次佳解因為James 似乎太奔波了，剛從機場回來，馬上得回去剛才的機場接人

在工廠等於也就是機器人來回運動，但是沒有產出

次佳解 Sarah 好忙，先烤火雞，換手給機場回來的 James，然後去機場接Emily，James 接手烤火雞，idle time 過長，而且造成沒有必要的 switchover

Merge diagrams in a synchronized dashboard

狀態圖(State Diagram)

(interactive timeline) artifact state diagram

(interactive timeline) artifact map version

state diagram with aligned timeline

State diagram with sync timeline (map version)

看板圖 (Kanban)

(interactive timeline)

流程圖(Flow chart)

(interactive timeline) artifact state diagram

flow

dashboard

星期五, 9月 26, 2025

Solving river crossing puzzles (半導體廠情境)

半導體廠情境

share artifact

Key Features Reproduced:

🔬 Five Fab Systems: Alice (photoresist), Bob (plasma etch), Carlos (CVD), Diana (metrology), Elena (ion beam)

🏭 Three Chambers: Prep staging, transfer chamber, and active processing

⚙️ Process Control: Systems marked with ⚙️ can control transfers between chambers

🚫 Critical Constraints:
- Photoresist + CVD incompatible (contamination)
- Plasma + Metrology stay coupled (real-time feedback)
- Ion beam + CVD interference (sensor conflicts)
其他運用領域：機器人，供應鏈管理，自駕車，半導體廠製程，機器人人機協作安全規劃
這個題目對AI 為什麼有點難? 多個限制條件的複雜問題
推理模式是甚麼? 推理 vs. 直覺

版型

星期三, 9月 24, 2025

EX#3 AI for Constrained Optimization

課堂練習

Deadline: This Saturday at 23:59

Send all the share links to me chang212@gmail.com by email with subject EX#3 [your id, your name]

搶救感恩節晚餐大作戰講義題目

Plan a feasible schedule. Then verify the schedule. Then visualize your results.

繳交範例

ChatGPT validation, A* Macro*, Visualization

Claude validation, reasoning*, Viz

檢查你的感恩節計畫是否正確

Use validator to check up the plan

validator

如果你希望驗證結果可輸出為app

1. 先進入 validator

2. 按下右上方 customize 鍵

3. 接下來的操作步驟

4. 結果輸出為app

可使用狀態圖(State Diagram)，看板圖 (Dashboard), 流程圖(Flow chart) 進行視覺化

(interactive timeline)

flow

Hint

優化解可以達成所有人四點前到家

Ｈint

如果你要使用現成版型(artifact type) 做行程視覺化，步驟如下

0 找到版型程式（工程圖下方連結）

1 按介面右上方 customize 鍵

2. 輸入你的文字版時間排程
3 得到專業工程圖

2. Try smart search. Prompt AI models to generate A* search Algorithm

A* search

A* synced with animation

星期四, 9月 18, 2025

推理模式研修題

Devise a strategy for the dog in the picture to retrieve its ball that fell into the pond without getting its body wet, with no humans nearby to help."

This is a problem-solving or creative thinking exercise asking someone to come up with a plan for how a dog could get its ball back from water without getting wet and without human assistance.

Animate your strategy in svg or in 3d.

1. 方法必須合乎常理(例如狗不能飛，不會使用打撈網)

2. 需排除幻覺，不可使用畫面中不存在的器物

3. 答案必須滿足題目中的規定，例如狗在完成目標取回球的過程中都不可弄濕自己

3. 答案除了文字敘述，必須搭配動畫呈現(漫畫，影片亦可)

LLM 有兩種模式推理 vs. 直覺，你用對了嗎?

research

星期三, 9月 17, 2025

AI半導體廠製程優化

半導體廠製程情境

晚餐接機大作戰Scheduling Parallelism in Plans problem (Eng) (from source)

SVG

dinner scheduling (artifact)

EUVL

semiconductor fab scheduling (artifact)

LLM 遭遇多個限制條件的複雜問題

專注力模型

Gemini 2.5 Pro 推理，Claude Sonnet 4 流程圖 feasible, not optimal，*

Gemini 3.0 Pro 推理，optimal

Claude Sonnet 4 直覺模式流程圖，no feasible

Claude Sonnet 4 推理模式 (extended thinking)，流程圖 no feasible，＊

Claude Opus 4.5 optimal

ChatGPT 5, 流程圖 feasible, not optimal，＊

Grok 4 ，流程圖＊

Decision Tree (share)

location-based time-evolving diagram (space-temporal transformation )

EX#2 Reasoning vs. non-reasoning AI

本次作業講義

建議工具

使用 Claude Sonnet 4

使用 ChatGPT 5

使用 Gemini 2.5 Pro 免費額度最高 1M tokens

使用 Grok 4

課堂練習

Deadline: This Saturday at 23:59

Send all the share links to me chang212@gmail.com by email with subject EX#2 [your id, your name]

How to share a Claude link

How to share a ChatGPT link

How to share a Grok link

How to share Gemini Link

1. Given 1,2,3,4,5,6 can you give an equation that equals -38? If you got a correct answer, ask how AI made it. If not, ask AI why it failed. 請分析ＡＩ解題策略（推理，直覺，蠻力，內建程式演算...)，如果有錯誤，原因是什麼？

2. 搶救感恩節晚餐大作戰講義。題目不用做

3. 計算面積