Review or Reviews
테크, 개발, AI, 하드웨어 — 실사용 기반 리뷰와 가이드
최신 글
llama.cpp --split-mode row vs layer on Multi-GPU — Old GPU Edition (1080 Ti, 2080, P40)
When llama.cpp's --split-mode row beats layer on dual-GPU inference, when layer is faster, and why the answer is different on Pascal/Turing without NVLink than on Ampere with NVLink. Real benchmarks on 2× GTX 1080 Ti for Mixtral, Yi-34B, Llama 3.1 13B, with PCIe lane and tensor split notes.
Ollama Dual GPU Without NVLink — Tensor Split on 2× GTX 1080 Ti (Actual Benchmarks)
How to make Ollama actually use both GTX 1080 Ti cards without NVLink — environment variables, tensor split configuration, and real tokens/sec benchmarks for 13B and 30B-class models. Where PCIe becomes the bottleneck, what works versus what just looks like it's working, and how the same setup compares to a single 3090.
Running Modern LLMs on GTX 1080 Ti in 2026 — What Still Works, What OOMs
A 2026 reality check for the GTX 1080 Ti: 11 GB VRAM, Pascal architecture, no FP16 tensor cores. Which modern LLMs (Llama 3.1, Qwen 3, Phi-4, Gemma 3) still load and run usefully, what hits OOM, real tokens/sec numbers from a 1080 Ti, and when it's time to retire the card.
더 보기
Ollama vs LM Studio vs llama.cpp: Honest 2026 Comparison for Local LLM
Definitive comparison of the three most popular local LLM inference engines in 2026. Real performance benchmarks on RTX 3090, feature-by-feature matrix, setup walkthroughs, and a decision framework for picking the right tool for your use case.
Best Ollama Models for RTX 3090 24GB in 2026: Real Benchmarks (Qwen3 vs DeepSeek vs Llama)
Real Ollama benchmarks on RTX 3090 24GB — tokens/sec, VRAM, quality scores for 12+ models. Qwen3-30B vs DeepSeek-Coder-V3 vs Llama 4 head-to-head. Plus RTX 4090 comparison, cloud API cost analysis, and which local LLM to pick for your use case in 2026.
Qwen3 vs DeepSeek R2 vs Llama 4 Local Performance — RTX 3090 24GB Benchmark 2026
Head-to-head Ollama benchmarks on RTX 3090 24GB — Qwen3, DeepSeek R2, and Llama 4 tested across reasoning, coding, creative writing, and multilingual tasks. Real tokens/sec, VRAM usage, quality scores. Which local LLM wins for your use case in 2026?
Home AI Server Build Guide 2026: RTX 4090 vs 3090 vs 5090, Real $4,000 Setup
How to build a home AI server for running LLMs locally in 2026 — RTX 4090 vs 3090 vs 5090 comparison, full hardware list, Ollama setup, $4,000+ real cost breakdown, electricity bills, and what I'd buy differently after 18 months running 24/7.
연구자를 위한 AI 어시스턴트 구축기: OpenClaw로 바이오인포매틱스 워크플로우 자동화하기
반복적인 프로테오믹스 분석 작업을 OpenClaw로 자동화하여 연구 효율성을 90% 이상 향상시킨 실제 경험담. DIA-NN 파이프라인 구축부터 바이오마커 데이터베이스 개발까지, 구체적인 구현 과정과 성과를 상세히 공개합니다.
WordPress에서 Next.js로 마이그레이션 — 속도 60배 개선
WordPress 블로그를 Next.js + MDX 기반으로 마이그레이션하면서 TTFB를 2.4초에서 40ms로 줄인 실전 경험을 공유합니다. 데이터 이전, SEO 유지, 리다이렉트 설정까지.