🔥 Discover this awesome post from Hacker News 📖 📂 Category: 💡 Main takeaway: For New York’s No 1 Socialite, Angela.1) Prioritize your ease of being over any other consideration: parties are like babies, if you’re stressed while holding them they’ll get stressed too. Every other decision is downstream of your serenity: e.g. it's better to have mediocre pizza from a happy host than fabulous hors d'oeuvres from a frazzled one.2) Advertise your start time as a quarter-to the hour. If you start an event at 2:00, people won't arrive till 2:30; if you make it 1:45, people will arrive…
✨ Discover this must-read post from Hacker News 📖 📂 Category: 📌 Main takeaway: Other vegetables have a very unique effect on our smell. The asparagus plant produces a compound called the asparagusic acid and, when it's digested by your body, it releases sulphur compounds too. It's these chemicals, such as methanethiol and dimethyl sulphide, that make your sweat and your pee smell a certain way. Sulphur compounds are very volatile, so they easily disperse in the air. That's why they're so easy to smell from the toilet bowl. This smell usually lasts more than five hours.Not everybody produces this…
🔥 Explore this must-read post from Hacker News 📖 📂 Category: 💡 Here’s what you’ll learn: FurtherAI (Series A, a16z + YC) is hiring Software Engineers, AI Engineers, and Forward-Deployed Engineers.We're building AI Agents for the insurance industry and are already post-PMF with strong enterprise adoption.Highlights:- $25M Series A led by Andreessen Horowitz (a16z) - > 10× revenue growth this year - Seed -> Series A in under a year - Small, talent-dense team - 6/15 are founders (incl. 4 YC founders) - Team backgrounds include staff engineers and researchers from Apple, Microsoft, AmazonLooking for strong engineers based in SF…
✨ Explore this insightful post from Hacker News 📖 📂 Category: ✅ Main takeaway: Chris BaraniukTechnology ReporterAFP via Getty ImagesDiesel locomotives are being replaced with electric modelsEvery day, thousands of passengers heading south west on trains leaving Aldershot station pass a cluster of solar panels nestled by the tracks. Few, if any, may notice the installation. But the train they are on is drawing power from it."On a sunny afternoon, if you are catching a train through Aldershot, a little bit of the energy for that train will come from those solar panels," says Leo Murray, co-founder and chief executive…
💥 Discover this must-read post from Hacker News 📖 📂 Category: 💡 Key idea: LISP-NOTES ON ITS PAST AND FUTURE-1980 Next: Introduction John McCarthy Computer Science Department Stanford University Stanford, CA 94305 jmc@cs.stanford.edu http://www-formal.stanford.edu/jmc/ JanFebMarAprMayJun JulAugSepOctNovDec , :< 10 0 Abstract: LISP has survived for 21 years because it is an approximate local optimum in the space of programming languages. However, it has accumulated some barnacles that should be scraped off, and some long-standing opportunities for improvement have been neglected. It would benefit from some co-operative maintenance especially in creating and maintaining program libraries. Computer checked proofs of program correctness…
🔥 Discover this awesome post from Hacker News 📖 📂 Category: ✅ Here’s what you’ll learn: Sign up for the daily CJR newsletter. In May 2024, Daniel Ojukwu, a twenty-six-year-old reporter for the Foundation for Investigative Journalism, a Nigerian nonprofit, was grabbed off the streets of Lagos by armed police and bundled into a vehicle. For the next several days, he was held in a cell incommunicado—first in Lagos, and later in the federal capital, Abuja—without being told exactly what he’d been arrested for. “It was more of an abduction,” Ojukwu recalled recently, via WhatsApp. Finally, on the fourth day,…
🔥 Discover this awesome post from Hacker News 📖 📂 Category: 💡 Key idea: People rant about having to learn algorithmic questions for interviews. I get it — interview system is broken, but you ought to learn binary search at least. Anyways, yet again I came across a real life application of Algorithms. This time in the OG tool git. git bisect - Use binary search to find the commit that introduced a bug ref. And Leetcode wanted you to know it First Bad Version We use a monorepo at work. And people tend to make hundreds, if not thousands,…
🔥 Explore this insightful post from Hacker News 📖 📂 Category: 📌 Key idea: Starting this month, parking lots in South Korea with more than 80 spaces will be required to install solar canopies and carports. But, unlike similar laws that have been proposed in the US, this new law doesn’t just apply to new construction – existing lots will have to comply as well! South Korea’s Ministry of Trade, Industry and Energy announced in August that it has prepared an amendment to the Enforcement Decree of the Act on the Promotion of the Development, Use, and Diffusion of New and…
🚀 Check out this awesome post from Hacker News 📖 📂 Category: 📌 Key idea: ACM Classic: Reflections on Trusting Trust Reflections on Trusting TrustKen Thompson Reprinted from Communication of the ACM, Vol. 27, No. 8, August 1984, pp. 761-763. Copyright © 1984, Association for Computing Machinery, Inc. Also appears in ACM Turing Award Lectures: The First Twenty Years 1965-1985 Copyright © 1987 by the ACM press and Computers Under Attack: Intruders, Worms, and Viruses Copyright © 1990 by the ACM press. I copied this page from the ACM, in fear that it would someday turn stale. Introduction I thank…
💥 Check out this insightful post from Hacker News 📖 📂 Category: ✅ Key idea: GITHUB HUGGINGFACE MODELSCOPE SHOWCASEFrom Chatbot to Autonomous Agent#We are proud to present Tongyi DeepResearch, the first fully open‑source Web Agent to achieve performance on par with OpenAI’s DeepResearch across a comprehensive suite of benchmarks. Tongyi DeepResearch demonstrates state‑of‑the‑art results, scoring 32.9 on the academic reasoning task Humanity’s Last Exam (HLE), 43.4 on BrowseComp and 46.7 on BrowseComp‑ZH in extremely complex information‑seeking tasks, and achieving a score of 75 on the user‑centric xbench‑DeepSearch benchmark, systematically outperforming all existing proprietary and open‑source Deep Research agents.Beyond the model, we share a complete and battle‑tested methodology for creating such advanced agents. Our contribution details a novel data synthesis solution applied across the entire training pipeline, from Agentic Continual Pre‑training (CPT) and Supervised Fine‑Tuning (SFT) for cold‑starting, to the final Reinforcement Learning (RL) stage. For RL, we provide a full‑stack solution, including algorithmic innovations, automated data curation, and robust infrastructure. For inference, the vanilla ReAct framework showcases the model’s powerful intrinsic capabilities without any prompt engineering, while the advanced Heavy Mode (test‑time‑scaling) demonstrates the upper limits of its complex reasoning and planning potential.Continual Pre‑training and Post‑training Empowered by Fully Synthetic Data#Continual Pre‑training Data#We introduce Agentic CPT to deep research agent training, creating powerful agentic foundation models for post‑training. We propose AgentFounder, a systematic and scalable solution for large‑scale data synthesis that creates a data flywheel with data from the post‑training pipeline.Data Reorganization and Question Construction. We continuously collect data from various sources, including documents, publicly available crawled data, knowledge graphs, and historical trajectories and tool invocation records (e.g., search results with links). As shown in the figure, these diverse data sources are restructured into an entity‑anchored open‑world knowledge memory. Based on randomly sampled entities and their corresponding knowledge, we generate multi‑style (question,answer) pairs.Action Synthesis. Based on diverse problems and historical trajectories, we construct first‑order action synthesis data and higher‑order action synthesis data. Our method enables large‑scale and comprehensive exploration of the potential reasoning‑action space within offline environments, thereby thereby eliminating the need for additional commercial tool API calls. Specifically, for the higher‑order action synthesis, we remodel trajectories as multi‑step decision‑making processes to enhance the model’s decision‑making capabilities.Post-training Data#High-quality synthetic QA pairsWe develop an end‑to‑end solution for synthetic data generation. This fully automated process requires no human intervention to construct super‑human quality datasets, designed to push the boundaries of AI agent performance. Through long‑term exploration and iteration‑from early methods like reverse‑engineering QA pairs from clickstreams (WebWalker) to the more systematic graph‑based synthesis (WebSailor and WebSailor‑V2), then the formalized task modeling (WebShaper)‑our approach ensures both exceptional data quality and massive scalability, breaking through the upper limits of model capabilities.To address complex, high‑uncertainty questions, we synthesize web‑based QA data through a novel pipeline. The process begins by constructing a highly interconnected knowledge graph via random walks and isomorphic tables towards tabular data fusion from real‑world websites , ensuring a realistic information structure. We then sample subgraphs and subtables to generate initial questions and answers. The crucial step involves intentionally increasing difficulty by strategically obfuscating or blurring information within the question. This practical approach is grounded in a complete theoretical framework, where we formally model QA difficulty as a series of controllable “atomic operations” (e.g., merging entities with similar attributes) on entity relationships, allowing us to systematically increase complexity.To further reduce inconsistencies between the organized information structure and the reasoning structure of QA, enable more controllable difficulty and structure scaling of reasoning, we proposed a formal modeling of the information‑seeking problem based on set theory. With this formalization, we developed agents that expands the problem in a controlled manner, and minimizes reasoning shortcuts and structural redundancy, leading to further improved QA quality. Moreover, this formal modeling also allows for efficient verification of QA correctness, effectively addressing the challenge of validating synthetic information‑seeking data for post‑training.Furthermore, we have developed an automated data engine to scale up the creation of PhD‑level research questions. This engine begins with a multi‑disciplinary knowledge base, generating “seed” QA pairs that require multi‑source reasoning. Each seed then enters a self‑guided loop of “iterative complexity upgrades”, where a question‑crafting agent is equipped with a powerful toolset including web search, academic retrieval, and a Python execution environment. In each iteration, the agent expands knowledge boundaries, deepens conceptual abstraction, and even constructs computational tasks, creating a virtuous cycle where the output of one round becomes the more complex input for the next, ensuring a controllable and systematic escalation of task difficulty.Unleashing Agent Capabilities with Diverse Reasoning PatternTo bootstrap the model’s initial capabilities, we constructed a set of trajectories via rejection sampling, based on the ReAct and IterResearch frameworks (for details, see below). On one hand, ReAct, as a classic and foundational multi-turn reasoning format, instills rich reasoning behaviors and reinforces the model’s ability to adhere to structured formats.On the other hand, we introduce IterResearch, an innovative agent paradigm (detailed below). It unleashes the model’s full reasoning potential by…
