We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
AI in EdTech Weekly
by avergin 92 sources
Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.
liemandt
Justin Reich
MacKenzie Price
Grounded AI is becoming the product default
The biggest shift this week was structural, not model-related: new education tools are increasingly being tied to course materials, district resources, and specific workflows instead of handed over as open chatbots. Gemini’s new study notebooks are grounded in class materials, start with a diagnostic quiz, build bite-sized interactive lessons, update based on follow-up quiz results, and roll out globally on the web at no cost. They also sync sources and chats with NotebookLM.
NotebookLM, meanwhile, added fully customizable flashcards so learners can edit questions and answers, add new cards, and share sets with classmates.
At the institution level, the pattern is even clearer. MagicSchool says a Georgia state audit found 58% of teachers have used its product, and the company has now added district-level controls to ground AI in local resources. Brisk presented a similar idea through "Curriculum Intelligence" built from district guidance and curriculum libraries, while TrekAI positions itself as a supervised "learner’s permit" and Lightspeed gives districts visibility into what AI tools students are already using.
Higher ed is testing the same move with different infrastructure. Denison built a token-priced multi-model environment with 17 models for students, faculty, and staff, and requires incoming students to complete a foundational AI course focused on ethics and use.
These launches are promising, but they are still mostly product rollouts and operational examples rather than independent proof of better learning. Access is also uneven: Gemini says mobile and school-issued accounts are coming later this summer, and district or campus systems depend on local setup, policy, and budget.
The learning design principle is getting clearer: don’t remove the struggle
This week’s strongest research signal was practical: AI helps most when it guides thinking instead of collapsing it. In Ghana, the Rori tutor produced an effect size of 0.36 at about $5 per student by giving hints and guidance before solutions. A Carnegie Mellon study found answer-giving chatbot use tracked with worse exam performance, while answer-withholding proof review tracked with better outcomes. In a GPT-4 field experiment with nearly 1,000 high-school math students, the answer-giving version led to 17% worse later exam scores without AI than the control group, while a safeguarded version avoided that drop. A UK RCT from Google, LearnLM, and Eedi found AI support increased novel problem-solving by 5.5 percentage points over human tutors alone.
"Ask me one question at a time, waiting for my answer in between, to help me think through this problem, to help me discover angles I haven’t thought of."
That advice from Mindstone’s Joshua mirrors the research trend: use AI as a thinking coach, not a search replacement or answer machine.
The limitation matters just as much. One physics-feedback system was wrong in about one case in five, and even very strong students often failed to spot those errors. The broader evidence still supports a narrower claim than the market sometimes implies: AI reliably improves performance while learners have access to it, but durable unassisted retention and transfer remain unsettled. Rich scaffolding also helps weaker learners more than stronger ones, which means the same design will not fit every student.
Alpha is trying to turn AI-native schooling into a scalable schedule
New Alpha School interviews pushed the conversation beyond tutoring and toward full schedule redesign. The model uses AI to assess knowledge gaps, keep students working at roughly 80-85% difficulty, and compress core academics into a two-hour daily sprint with no homework. Alpha says that model has produced 2.6x faster learning, top 1-2% performance, and an average SAT of 1535 for 11th graders.
The more distinctive claim is about motivation. Alpha’s founders argue that motivation is "90% of the solution," and say "Time Back" lets students spend afternoons on workshops, sports, entrepreneurship, and life skills. They report that 96% of students say they love school, while guides focus on one-to-one motivation and support rather than lectures and grading.
The new scaling detail is access. Alpha says AI costs are falling from about $10,000 per student toward the hundreds, and it is using Texas vouchers to expand through sports academies and gifted programs while planning inner-city public-school pilots in Texas. Its longer-term goal is a tablet-based product priced below $1,000 that can teach a full curriculum in under two hours a day. Those public-school pilots are still upcoming, with the founder saying the data will be published when they open.
Trust, bias, and policy are moving to the center
The policy story this week was not faster adoption but slower, more contested adoption. New York City delayed final AI guidance until later this summer after its March draft drew nearly 6,500 comments and broader backlash. The draft used a traffic-light framework that banned AI for assessments and grading while allowing lower-risk uses such as brainstorming lesson plans, but it said little about student use. Officials are now considering age-based expectations as they try to prepare older students for an AI-present world without letting AI replace their thinking.
Bias concerns are getting harder to dismiss as abstract. Victoria Hedlund reported that AI explained a simple science concept with more scaffolding but less technical rigor to girls than boys, awarded lower marks to "Victoria Hedlund" than "Victor Hedlund" on the same GCSE history paper, and gave girls more discouraging physics career advice than boys with the same qualifications. She also argued that names, locations, and inferred socioeconomic context can lower the level of challenge an AI tutor offers, and that the frequency of AI interaction can expose students to bias far more often than a human teacher would.
The operational response is becoming clearer: minimize identifying inputs, avoid using AI for marking or detectors without strong evaluation, and treat outputs as results to inspect rather than conclusions to trust. That lines up with broader guidance from Tech & Learning, where Microsoft’s Matt Jubelirer argued that AI literacy now goes beyond prompting to judging capabilities, context, and ethical use, and that grading still requires human judgment.
"We don’t want humans in the loop. We want humans in the lead."
That same principle is showing up in product architecture. e-Literate argues that multi-agent AI can fit academic work because it mirrors teams of specialists, but each extra agent adds token costs and lossy context handoffs, making large-scale deployment harder to budget and audit.
The AI literacy debate is widening into a human-skills debate
A second strategic shift this week was philosophical. Justin Reich argued that education still lacks evidence on what effective AI literacy practice actually is, and that many early frameworks repeat the same mistake schools made with web literacy: packaging a lofty new skill bundle before expert practices are clear. In his view, domain expertise may matter more than generic knowledge about how large language models work, so schools should run local experiments and compare evidence of learning rather than assume a ready-made AI literacy playbook exists.
At the same time, industry voices are shifting from "teach AI" to "teach the human capabilities AI makes more valuable." Executives speaking to educators emphasized collaboration, resilience, communication, negotiation, leadership, critical inquiry, and ethical reasoning as future-proof skills, even as employers and coalitions like RAISE US push AI-enabled training and workforce transition support.
For lifelong learners, Andrew Ng’s advice was notably old-fashioned in the best way: start with efficient coursework, then build small projects, take handwritten notes to improve retention, and make learning a regular habit rather than a burst activity.
What This Means
- For K-12 systems: the near-term winners are likely to be grounded tools with clear guardrails, visibility, and age-appropriate rules—not open-ended chat alone. District leaders have more reason to prefer tools tied to curriculum, local resources, and data-minimization standards.
- For higher ed: AI literacy is looking less like a single standards document and more like a sequence of local experiments, explicit policies, and assessment redesign. The bar for using AI in grading or high-stakes judgment should stay high.
- For teachers and learning designers: this week’s evidence again favored AI as a critic, coach, or draft partner over AI as an answer source. Study notebooks, guided tutoring, and structured planning are moving faster than fully automated teaching.
- For learners and workforce teams: structured AI tools can make practice easier to start, but they do not remove the need for error-checking, domain knowledge, and regular study habits.
- For investors and product builders: demand is moving toward grounded, workflow-specific AI, but bias evaluation, cost control, and proof of learning impact are becoming as important as model quality.
Watch This Space
- Grounded consumer study stacks: Gemini’s study notebooks, NotebookLM’s editable flashcards, and free AI tools from Khan Academy, CK-12, and Pear Start all point to a more structured self-study layer emerging on top of general-purpose models.
- Public-school and lower-cost AI-native models: Alpha’s planned Texas public-school pilots and its tablet-based mass-market ambitions are worth tracking if they move from founder claims to published data.
- Bias audits and age-based governance: NYC’s delayed rewrite and Hedlund’s experiments suggest next-wave policy will focus less on blanket approval or bans and more on age, data minimization, and bias exposure.
- Agentic systems with humans in the lead: multi-agent course design and support tools are advancing, but whether institutions can afford them at scale remains open.
- Human-skill-first workforce learning: the tension between AI fluency and durable human skills is likely to shape both curriculum design and employer training over the next cycle.
Ethan Mollick
The big development: education is drawing a harder line between answer machines and learning tools
This week’s clearest signal was not a new model. It was a sharper distinction between AI that helps learners think and AI that helps them avoid thinking .
Ethan Mollick pointed to a recurring pattern: students naturally reach for AI on homework, but off-the-shelf chatbots act like assistants, not tutors, by providing answers that reduce mental effort and undermine learning . He also cited a large study in China showing that when AI shortened homework time by lowering effort, test scores fell too .
“Across studies, a theme: AI tutoring in support of classes is good, using AI to ‘help’ with homework is bad.”
The same tension is showing up in writing and assessment. One higher-ed analysis argued that AI has widened the college-readiness gap in writing by making it easier for students to produce polished text without doing the thinking writing is supposed to develop . A narrower use on the Packback platform looked more promising: AI handled grammar and style feedback so instructors could focus on ideas, and student writing improved modestly over a semester . EdSurge’s podcast reached the same practical test for K-12: whether students are learning to think with AI or using it to bypass productive struggle .
Sam Altman said he expected schools to redesign quickly after ChatGPT, with projects that require AI but still stretch thinking, yet he still sees no significant systemic change across education 3.5 years later and warned that, without redesign, critical thinking skills could atrophy .
Mollick’s practical response is more concrete: more in-class assignments, AI tutors that challenge rather than answer, and prompts that use AI as a critic during debate or argument rather than as a completion engine .
Even commentary on higher ed outcomes is moving in this direction. The AI in Education Podcast highlighted Berkeley research suggesting that A grades rose 30% in take-home writing- and coding-heavy courses after ChatGPT, while one computer science course’s failure rate reportedly rose from 7% to 35% when students later faced exams without AI . Researchers from Australia, New Zealand, and China are now explicitly arguing for Socratic AI companions that prompt reflection and understanding, rather than vanilla chatbots that act as a crutch .
The response is shifting from bans to explicit learning design
Higher ed and K-12 are starting to turn that insight into design choices instead of generic rules. Lance Eaton argues that year 5 of generative AI in higher education should be the year of program-level curriculum change, because students are still graduating after four years of scattershot experiences: one professor requires AI, another bans it, another treats it as misconduct, another never names it . His recommendation is not to simply teach the tool or ban it, but to map where students should first encounter AI, where they should use it, where they should work without it, and where they should learn refusal, verification, disclosure, and judgment .
A related UK discussion is pushing even further upstream. Rose Luckin argues that if AI can master knowledge-heavy curricula faster and more accurately than humans, schools need to shift toward richer human capabilities: creativity, problem-solving, metacognition, resilience, empathy, and better assessment of those skills .
K-12 policy is also getting more operational. Microsoft Teams now lets educators set assignment-level AI expectations: full AI use, editing only, brainstorming only, or no AI use, with customizable labels and defaults . Students see the guideline when they open the assignment and, if their school enables it, a direct button to open Copilot .
That product choice matches district-level policy thinking. Tech & Learning highlighted a three-part framework for districts starting from scratch: understand how students and teachers are already using AI, protect student data and personally identifiable information, and address academic integrity without taking students out of the driver’s seat of their own learning . It also draws a distinction between prohibition-heavy acceptable use policies and responsible use policies that explain reasoning and treat students as participants . In Massachusetts, Shrewsbury Public Schools built five pillars around student preparation, student learning tools, staff tools, guardrails, and academic integrity, aligned to the district’s Portrait of a Graduate rather than to any single product .
This is also showing up in professional development. Hillsborough County Public Schools, a district serving more than 200,000 students, put 1,000 educators through a summer week of training on responsible use of MagicSchool AI . The underlying message is similar to Monica Burns’ advice: start AI guidance with the kind of thinking you want students to do, not compliance alone .
Tools are getting more specific — and their limits are clearer
The most credible product activity this week was not “AI does everything.” It was AI being inserted into narrower learning workflows .
In Microsoft’s education stack, Learning Zone lets educators attach AI-built interactive lessons directly to Teams assignments, render them inside Teams for students, and provide built-in checks and feedback . Those lessons can also draw from partner content including NASA, Figma, and Minecraft . Rubric generation is becoming more constrained as well: when teachers create a rubric with AI, the standards already attached to the assignment are automatically carried into the rubric . The limitation matters: AI lesson generation requires a Copilot Plus PC, even though students can complete the lesson in Teams on their side .
At the school operations level, the near-term gains remain mostly administrative. One principal described using AI daily to turn state memos into slide decks, teacher texts into parent messages, long emotional emails into summaries and replies, contract PDFs into queryable answers, and scattered event details into calendar entries — saving “a few hours” a day and freeing more time for students and staff . Monica Burns describes the same division of labor more generally: AI can do the drafting, formatting, and structuring, but human review, edits, and knowledge of students still shape the final product .
For self-directed learning, platforms are getting more interactive. Copilot Notebooks is now available without paid student Copilot licenses, and one suggested use is uploading a curriculum to generate study guides, activities, and infographics . Google NotebookLM is already being used by students to turn class slides into self-quizzes and answer keys, reinforcing retrieval practice . Andrew Ng is pushing toward a more conversational model in CodeDream.ai, where learners interact through simulated video calls and embedded JavaScript demos instead of passively watching static videos . But Ng’s verdict is restrained: online learning tools are better than they were 10 years ago, not yet truly transformed .
Adoption, meanwhile, remains a constraint. Mollick says AI interfaces like chatbots, Codex, and NotebookLM are not intuitive in practice and contain “a dozen little tricks and traps” that block effective use . He also says many people never get past the difficult first hour, which keeps AI in the “kind of like Google” box . Chalkbeat’s reporting on a Stanford AI tutor study shows what that looks like in schools: human guidance increased use by only 1-4 minutes a week, many students never logged on, total time stayed far below the 30 minutes a week needed for reading gains, and there was no meaningful difference in reading scores .
“The challenge isn’t just building good AI tools. It’s really getting students to use them, and that seems to take the same type of intentional design that we’ve learned matters with other ed tech interventions and tutoring.”
AI-native models are expanding — but not all in the same direction
Some schools are no longer treating AI as an add-on. They are designing schedules, staffing, and pedagogy around it from the start.
At Alpha School, students spend two morning hours on personalized academic work with AI tutors or adaptive apps that give immediate feedback and let students advance on mastery . Afternoons shift to four hours of team-based life skills, projects, collaboration, and conversation . Alpha argues that this split lets AI handle individualized cognitive work while freeing more time for authentic social learning, and says its classes rank in the top 1-2% nationally . Reporting from Michael Horn’s microschool series reinforces the broader pattern: the most interesting schools are not simply maximizing AI use; they are being explicit about where AI belongs and what human capabilities — autonomy, entrepreneurship, deep research, feedback, and strong foundations — they still want school to build .
Alpha is also trying to rebut a common critique directly. Its leaders say AI-first schooling should produce more thinking, not less, and are pairing the model with explicit humanities work, including students reading Tocqueville’s Democracy in America and debating it for the age of AI as part of building “philosopher-builders” .
Higher education is seeing its own AI-native experiments. The AI in Education Podcast highlighted a new Italian online university built from scratch around AI optimization, serving 112,000 students with 400 academic staff . That is a radically different staffing model from legacy higher ed. But Andrew Ng’s comments are a useful counterweight: what people need to learn is changing quickly — coding agents, AI building blocks, and broader product skills — yet the delivery of training is still being reinvented in real time .
Economics are starting to shape product design
For edtech buyers and investors, one of the most practical notes this week came from e-Literate: current AI economics do not fit education’s usual software model .
Schools and colleges budget around fixed annual costs, while metered AI introduces variable usage that can spike unpredictably . e-Literate points to enterprise examples of blown token budgets, revoked licenses, and large unexpected bills as signs of what happens when usage limits are weak . The likely consequence is product design, not just procurement, changing: mainstream platforms are more likely to ship constrained AI actions, narrow buttons, predefined workflows, and usage caps than open-ended magic text boxes or expensive multi-agent systems . That slowdown may frustrate some vendors, but it could also reduce the odds that education scales the wrong tools too quickly .
What This Means
- For K-12 leaders: Redesigning homework and assessment is now harder to avoid. The evidence and commentary this week point toward more in-class checks, explicit AI-use expectations on assignments, and tutor-like AI that preserves effort instead of replacing it .
- For higher ed: Course-by-course AI rules are too inconsistent. Program-level maps of where students should use, refuse, verify, and disclose AI are becoming a more practical governance model .
- For teachers and L&D teams: The best short-term use cases remain bounded ones — interactive lesson building, standards-aligned rubrics, administrative drafting, and study supports — with human review still central .
- For edtech builders and investors: Capability is not enough. Products need engagement, usability, and cost discipline. Low-usage tutoring pilots and unsustainable token economics can kill otherwise promising ideas .
- For learners: AI is most useful when it behaves more like a critic, coach, or quiz-maker than an answer machine .
Watch This Space
- Socratic companions and public-interest tutors: Researchers are pushing companion-style AI that promotes reflection, while Mollick argues universal tutors are now technically plausible if built with public R&D, transparency, and the right scaffolding .
- Mainstream platforms embedding guardrails: Assignment-level AI labels in Teams suggest more classroom software will make AI expectations visible inside the workflow, not just in policy documents .
- Next-generation study platforms: Khan Academy says its next launch will combine trusted content with AI tools to help students persist through hard learning, while Copilot Notebooks and CodeDream point to more interactive self-study formats .
- AI-native school builders: Alpha’s summer internship and new engineering cohort show schools investing directly in building learning apps, not just buying them .
- Cost-shaped AI design: Expect more predefined AI actions, fewer unlimited chat interfaces, and closer scrutiny of whether usage actually translates into learning gains .
Google DeepMind
Justin Reich
The clearest signal this week: structured AI is outperforming generic AI use
In Sierra Leone, Google DeepMind positioned AI as a response to teacher shortages, describing it as a partner that can extend educators’ reach without replacing them . Over eight weeks, students increasingly used Gemini to understand concepts rather than just get answers, with problem-solving queries rising from 68% to 90% . EdSurge also pointed to a Sierra Leone study in which a one-day AI training for secondary teachers was followed by math gains equivalent to more than a year of additional schooling .
"AI can act as a partner to support educators in these environments – amplifying their reach without replacing their essential expertise and skills."
The contrast with generic chatbot use is getting sharper. Standard free LLMs can lower brain activity and retained learning by encouraging what one report called "cognitive surrender," while a carefully designed AI tutor in an undergraduate physics course produced twice the learning gains of active, in-person instruction . Estonia’s emerging policy follows the same logic: students build foundational knowledge first, then use AI later in the learning process for feedback and assisted learning; earlier grades are deliberately excluded for now .
Schools are building tighter AI workflows instead of relying on public chatbots
In Australia, some schools are already doing this themselves. In Broken Bay Diocese, a Year 6 student built a science agent inside a controlled "kids’ pool" environment; it checks a learner’s level, adapts explanations or tests, and can even add engagement cues like jokes. The class later adopted the agent because it worked across different learning needs .
Another school built a secure Gemini + Apps Script tool that combines GPA, testing, and timetable data so teachers can query class or student breakdowns and get differentiation suggestions without moving student data into public systems . At Cathedral College in Rockhampton, a lesson-starter agent was grounded in school teaching frameworks, curriculum documents, and teacher-contributed examples, but its creator emphasized that cultural preparation came first and that the tool was not meant to replace human mentoring .
"It can't exist on its own. It's not meant to be a standalone agent or replace a human coach or mentor."
District tutoring programs are moving in the same direction. Newark Public Schools received $400,000 from New Jersey to expand high-impact tutoring that uses AI with teacher oversight for math and reading . District leaders said they expanded Khanmigo after pilot users showed math-score improvement .
The next wave of tools is more lesson-native than chatbot-native
Microsoft’s new Learning Zone turns educator prompts, uploaded files, or vetted resources such as OpenStax into interactive lessons in minutes . The lessons combine bite-sized content slides with multiple exercise types, immediate feedback, retries, and conditional "nested" slides that give students extra practice when they miss a concept . Teachers can use it for topic introductions, wrap-ups, flipped learning, or live instruction with anonymous aggregated knowledge checks, then assign lessons through codes, links, Teams, or an LTI-compatible LMS and review performance reports afterward .
The capability is notable, but the limits matter too. Students can access lessons in a browser on any device, while lesson generation currently requires a Copilot+ PC, with a broader trial planned . Microsoft also says more generation languages and an in-class teaching mode are coming .
Assessment is still where AI most clearly hits its limits
The week’s most useful assessment research separates scoring from feedback. In a randomized trial across 178 schools in Brazil, AI essay scoring performed at the level of human review; students improved by about a tenth of a standard deviation whether essays were scored by AI alone or AI plus human graders . But feedback is a different task: it requires identifying what a specific student got wrong in the context of that student’s reasoning .
That gap shows up across multiple studies. In middle-school math, the model that scored best produced teacher-preferred feedback only 12% of the time, while GPT-4 produced the most trusted feedback and the worst scores . On science work, LLMs matched teachers on next-step "Feed Forward" guidance but lagged on "Feed Back" that diagnoses the student’s specific reasoning error, scoring 3.05 versus teachers’ 3.52 out of 5 . In a physics-feedback study, about 20% of AI responses were inaccurate, and students rated wrong feedback as just as accurate as correct feedback .
This helps explain why AI is reallocating teacher time rather than eliminating the teacher role. In Brazil, AI scoring gave teachers about 30% more one-on-one writing conferences without increasing workload . And it helps explain Justin Reich’s warning that AI is most useful when the user already has enough domain knowledge to separate strong output from confident nonsense .
Real classrooms are already adapting. Reich’s 120-interview project found widespread homework bypass, with some students deciding which assignments are important enough to do themselves and teachers responding with everything from rewrites and detectors to harder AI-required tasks . Teachers on Reddit describe moving essays and tests in-class or on-demand because "anything that goes home is AI’d now" . Another teacher working with younger students reported near-identical AI-generated responses on a research assignment .
Higher ed and policy are moving from experimentation to governance
In higher education, roughly three-quarters of faculty say students use GenAI to write essays and papers, and roughly the same share of faculty use GenAI themselves . But most institutions are still in a wait-and-see phase or running patchwork experiments, not seeing broad gains in learning or efficiency . That is pushing redesign in two directions: tougher assessments such as live oral defenses, presentations, and more rigorous feedback loops , and better student-support systems that connect academic, financial, and well-being data while reducing administrative overhead . It also aligns with the argument that higher education and workforce systems need continuous reskilling, more personalized learning, and more real-world experience as AI changes work .
Governance is becoming more explicit. EDUCAUSE described low-risk AI uses such as brainstorming, tutoring, translation, summarization, and simple coding; medium-risk uses such as course design, grading, student feedback, and administrative tasks; and high-risk uses involving student records, HR data, or financial information . For community colleges, the warning is that commercial tools may encode assumptions that do not fit part-time, working, or caregiving students, which can automate weak judgments at scale . Recommended safeguards include contextual auditing, vendor transparency, and collaborative governance with faculty and student advocates .
The same shift is happening at system level. England’s Department for Education has funded 16 edtech firms to build trustworthy AI tools for lesson planning and marking using a national curriculum data/content store prototype . A £1 million pilot has now expanded into a £23 million, four-year program recruiting 1,000 schools and colleges as test beds for AI and edtech . Google DeepMind and Edy have announced a randomized trial of Learn LM with 1,500 students across 10 schools in England , while OpenAI is testing a learning-outcomes measurement suite with 20,000 students in Estonia and Anthropic and CodePath are running a 15-month classroom study across thousands of students .
Scale, though, is not the same as settled evidence. Ben Williamson argues that these programs can privilege signals of impact and scalability over broader forms of evidence, while reshaping classrooms into measurable testing sites for experimental AI products . That warning matters because AI economics are not like normal software: every use carries inference costs, private or local deployments add storage, cybersecurity, hardware, networking, and technical expertise, and districts still have few examples of what universal access would actually cost . UNESCO’s Digital Transformation Collaborative is one example of the response, framing digital transformation around coordination, connectivity, cost, capacity, content, and data .
What This Means
- For school systems: Separate AI for tutoring and practice, AI for lesson creation, and AI for assessment. The same tool may score reliably and still fail at diagnostic feedback .
- For instructional leaders: Training and workflow design matter more than access alone. Sierra Leone’s gains followed teacher preparation, and Australian implementations started with secure environments and cultural work—not just tool rollout .
- For higher ed and L&D teams: Access without redesign leads to patchwork. The durable opportunities are stronger assessment, better student support, continuous reskilling, and more real-world AI use through projects, internships, co-ops, and apprenticeships .
- For buyers and investors: Ask what must be true locally—curriculum fit, teacher prep, infrastructure, language, inclusion, data protection, and affordability—before treating pilot results as scalable .
- For learners and families: In this week’s coverage, AI literacy was defined less as basic tool use and more as questioning outputs and evaluating reliability; foundational knowledge still determines whether AI helps or misleads .
Watch This Space
- Teacher-made microtools: "Vibe coding" is lowering the barrier for teachers to build task lists, translation workflows, dashboards, and practice games. One fourth-grade teacher reported an AI-built review game that led to students scoring five points higher on average with no retests .
- More frontier-lab trials inside education systems: The DfE test-bed expansion, Learn LM trial, OpenAI’s Estonian measurement suite, and Anthropic’s CodePath study will shape both evidence and market expectations .
- AI-native school models: Alpha World School’s launch points to a more ambitious version of AI-enabled schooling, pairing daily AI-driven academics with fieldwork in Kenya and Ecuador and research projects with university faculty .
- Agentic tools for self-directed learning: NotebookLM’s new research companion can build a source repository from loose questions, surface its reasoning process, and export outputs in formats from charts to spreadsheets and documents .
- Cost and governance as adoption bottlenecks: As pilots broaden, recurring inference costs, privacy demands, and local hosting decisions may matter as much as the model itself .
Andrew Curran
Andrew Ng
Luis von Ahn
AI becomes education infrastructure
The biggest shift this week is structural. AI in education is no longer mainly a classroom-level experiment; it is being treated as infrastructure by districts, universities, and governments. In K-12, 79% of districts now report having AI guidelines, up from 57% in 2025, and 70% are training staff on instruction-focused generative AI tools. But only 41% of initiatives focus directly on teaching and learning, while leaders still cite staffing, funding, and expertise gaps as major blockers. Cybersecurity is the dominant worry: 98% of respondents are concerned AI will enable new attacks, and many district vetting processes still skip basic checks on safety and accessibility .
Higher ed is moving in the same direction. Ethan Mollick notes that many schools, including U Penn, now offer school-wide AI access, arguing that safe and equitable access is a necessary foundation and that HIPAA- and FERPA-compliant systems lower risk for large numbers of students and researchers . But provision alone does not guarantee adoption. At Harvard, students were offered ChatGPT EDU, Gemini, and Claude, yet uptake lagged because some students distrusted university monitoring and saw centrally provided AI as a possible surveillance tool .
The governance gap remains wide. Only about 1 in 3 students say their school has a school-wide AI policy, many report teacher-by-teacher inconsistency, and 67% believe more AI use for schoolwork harms critical thinking. At the same time, roughly 85% of teachers and students report using AI for schoolwork, while 4 in 5 U.S. teachers receive no formal guidance on AI and 2 in 3 get none on one-to-one instruction or personalization .
National strategies are emerging faster than shared capacity
Outside individual institutions, governments are starting to treat AI education as a national capability. China has launched an AI Empowering Education action plan that incorporates AI into teacher qualifications and expands AI education for primary and secondary students . Malta’s AI for All program pairs free paid-tier ChatGPT access with a required responsible-use course designed by the University of Malta . Anthropic’s $200 million partnership with the Gates Foundation is aimed at evidence-based AI tutors for U.S. schools, career guidance tools, literacy and numeracy apps in Africa and India, and public goods such as benchmarks, datasets, and knowledge graphs .
But the global build-out is uneven. One Edtech Podcast discussion noted that 40% of ChatGPT web traffic comes from middle-income countries, while high-income countries host 77% of global data-center capacity and low-income countries less than 0.1%; 87% of AI models come from countries containing just 17% of the world’s population . In Jordan, sandboxes with education authorities and universities are being used to test what local systems can actually absorb before scaling solutions, with the broader argument that coordination and innovation governance may matter more than technology procurement alone .
Guardrails are tightening where stakes are highest
As access expands, restrictions are becoming more explicit — especially for young learners and high-stakes academic work. The AFT has called for no screens in pre-K through second grade except where needed, no student-facing AI in elementary schools, supervised AI for older students, and a ban on social companion chatbots for students under 16 . In New York, NYSUT passed a similar resolution and said AI use in any grade should be educator-led and designed to promote critical thinking, digital literacy, and civic readiness rather than replace human instruction or judgment .
The debate is not settled. A counterargument from Tech & Learning says the wrong target is the device itself, not passive learning. That critique argues for a purpose-first approach: explicit instruction on when AI supports learning, when it gets in the way, and why schools still need strong privacy, safety, accessibility, and evidence standards instead of blanket avoidance .
Higher ed is drawing its own lines. UC Berkeley Law has barred AI from conceptualizing, outlining, drafting, revising, translating, or editing work submitted for credit, limited AI use in research, and prohibited uploading course materials into generative AI systems .
“In the classroom, we don’t want students to write the best possible paper, but rather the best possible paper that the student is capable of.”
At the same time, some districts are choosing guidance over blocking. Dighton-Rehoboth Regional School District has explicitly avoided immediately blocking AI sites, instead focusing on responsible use, verification, and ongoing privacy education as AI features spread across already-approved tools . University of Central Florida professor Humberto López Castillo offers a similar higher-ed playbook: set expectations upfront, model acceptable use, learn from student experimentation, and use hallucinations as teachable moments to reinforce that humans remain responsible for verification .
One practical safeguard gaining traction is pre-deployment testing. Jason La Greca’s “How to Break Your Chatbot” framework is meant to stress-test student-facing bots against jailbreaks and manipulation attempts before schools release them, and one AI in Education Podcast discussion argued it is probably unethical to deploy a chatbot to students without testing it first .
The practical uses gaining traction are teacher-centered
The classroom uses with the clearest traction are not the most futuristic ones. An analysis of more than 13,000 teacher AI conversations found the dominant use cases in lesson planning, differentiation, assessment, and reflection on practice .
That matches what practitioners are describing. Dr. Sarah Thomas frames AI as a creativity amplifier: something that can automate routine work, such as turning a spoken presentation run-through into bullet-point speaker notes, while preserving teacher judgment for the parts that matter most . In the classroom, she points to students using AI as a writing tutor for formative feedback and brainstorming, with the expectation that teachers still review the work and students still verify the output . Her guardrails are practical: protect COPPA, FERPA, and student PII; read privacy policies; verify outputs; and teach students to spot errors through “find the lie” exercises across multiple models . She is equally direct that AI does not replace the relational core of teaching .
Project-based creative use is also proving more durable than simple output generation. In middle school, Jessica Pack describes using Adobe Express to teach AI citizenship alongside storytelling, moviemaking, and iterative prompting. In one example, creativity-based projects helped long-term English learners improve across reading, writing, listening, and speaking while strengthening confidence and self-concept .
At the school-model level, Primer says it is using AI first to remove administrative load from teachers, send real-time alerts when a student is struggling with a specific concept, and combine signals from direct instruction, exit tickets, virtual tutoring, and learning apps into a shared mastery dashboard for teachers, students, and families . But Primer also says it rejects a future in which AI replaces teachers and only wants student-facing tools that clear a high bar for academic outcomes and mapping to standards .
The evidence is getting clearer about both gains and distortions
Some of the week’s strongest research signals were positive. In England, a study of 259 science teachers found that ChatGPT users completed assigned tasks in 69% of the control group’s time, with equivalent output quality, and reallocated time toward more human work such as relationship-building . In a Stanford law study, professors blindly preferred Gemini 2.5 Pro responses to peer-written answers to office-hour questions 75% of the time and rated the model’s responses less harmful; Ethan Mollick added that newer models performed even better .
But the risks are now more specific, too. In one study, identical student writing received meaningfully different AI feedback depending on the demographic and motivational persona attached to the student, including less constructive criticism and more praise for some Black, Hispanic, Asian, female, and “unmotivated” profiles . A separate analysis of 370,000 college admissions essays found more stylistic diversity after ChatGPT’s arrival, but convergence around fewer original ideas, with human-written essays containing up to eight times more new ideas . Mollick made the same point more broadly: once many people use AI on the same prompt, similarities become obvious .
There are also mounting signs that overreliance can damage learning. In UC Berkeley computer science classes, failing grades rose sharply in spring 2026, and instructors pointed to increased AI reliance, weak math preparation, and understaffing as possible contributors . Teachers on Reddit describe blatant AI use as a weekly reality in some classes and say they are responding with oral assignments, in-class handwritten work, required revision histories, or assignments that require students to use AI and then critique or substantiate its output. In those threads, AI detectors were often viewed as unreliable or counterproductive .
The deeper higher-ed critique is shifting from cheating to task value. Lance Eaton argues that students’ turn to AI often reflects a widening gap between what institutions say assignments are for and what the work actually feels like, forcing a harder question about which learning experiences remain worth doing when basic production becomes cheap .
“What kinds of learning experiences are still worth showing up for when the baseline production of work is cheap, but the cost of accessing education is incredibly high?”
What This Means
- For district and school leaders: Reaching policy coverage is not the same as being operationally ready. The real work now is teacher guidance, cybersecurity capacity, vendor safety review, accessibility, and family trust .
- For universities: Expect expansion and restriction at the same time. School-wide AI access is spreading, but institutions are drawing harder lines around high-stakes writing, judgment formation, and source verification .
- For teachers and L&D teams: The strongest use cases remain planning, differentiation, formative feedback, administrative relief, and project-based creation. The weakest use case is still unsupervised substitution of student thinking .
- For edtech builders and investors: Capability is no longer enough. Products increasingly need privacy compliance, source transparency, standards mapping, accessibility information, and evidence that they can withstand misuse by students .
- For anyone thinking about the future of learning and work: The broader economic bargain is getting murkier. Chalkbeat notes that AI could reduce the performance edge of more-educated workers in some workplace-like tasks, leaving schools and learners preparing for a more uncertain payoff to traditional schooling alone .
- For self-directed and lifelong learners: AI keeps lowering the barrier to access and building. Duolingo says its model reaches learners across the socioeconomic spectrum and is expanding beyond languages into math and music, while Andrew Ng is teaching non-coders to build working web apps through iterative prompting. But both examples still point back to the same disciplines: specificity, feedback, iteration, and continued practice .
Watch This Space
- National AI capability agendas: China’s teacher-certification push, Malta’s training-gated public access model, and Jordan’s sandbox approach suggest that national AI education strategies are moving from rhetoric to implementation .
- A student-safety testing layer for AI: Common Sense Media’s new Youth AI Safety Institute aims to create safety standards and open evaluations for children’s AI products, while large partnerships are starting to publish benchmarks and districts are being urged to stress-test chatbots before deployment .
- A parallel human-skills strategy: Schools are increasingly asking not just for an AI strategy but for a human skills strategy, including authentic interaction, discussion skills, and clearer policies for what should remain face-to-face and student-generated .
- More transparent study tools: NotebookLM’s new Source Attribution shows the exact prompts and sources behind an artifact, and its mobile app now creates briefing docs and study guides on the go — a sign that study tools are moving toward auditability rather than opaque generation .
- Platform control over agentic AI: After Amazon’s win against Perplexity over AI agents logging in on behalf of users, education platforms could try to do the same with LMS access — a potentially important development for online courses and agentic study tools .
MacKenzie Price
Andrej Karpathy
liemandt
The big shift: structured AI is gaining ground over open chat
Two recent K-12 studies point in opposite directions. In Turkey, plain ChatGPT access helped high school math students with homework and made them feel they were learning more, but they underperformed classmates on tests because the system often supplied answers and reduced the mental effort learning requires . In Taipei, a five-month Python course using an AI tutor that assigned personalized problem sequences produced final exam scores 0.15 standard deviations higher than a control group on an exam taken without AI help . The lesson across both studies is that AI helps when it tailors practice and pushes students to solve problems themselves, not when it solves the task for them .
"To benefit from AI in learning you need to pivot from using AI to solve problems, to pushing you to solve problems yourself."
That design logic is now showing up in products. Microsoft’s new Learning Activities in Microsoft 365 centers on flashcards, fill-in-the-blanks, matching, and self-quizzes rather than a blank chat box . Educators can paste or upload Word, PDF, and PowerPoint materials, generate activities with controls for language, difficulty, hints, and images, preview them, edit them, and share them by link or join code . Students can retry missed items, see personal summaries, and get instant feedback and answer explanations in self-quizzes via Microsoft Forms . Microsoft also said matching games have led to better test performance in schools, though the demo described examples rather than a broader study . Access points include the Microsoft 365 app launcher, the Teach module on m365.com, and Teams Classwork with Copilot, and the app is now available globally in all languages .
Some of the strongest design advice now explicitly limits free-form AI. Alpha School co-founder MacKenzie Price said the schools do not use a chatbot interface for academics because students tend to copy questions into a bot and ask for the answer . Andrej Karpathy argued that educational AI works better when a teacher-facing app creates an auditable course artifact and a separate student app serves that course, keeping the system tied to a syllabus and project progression instead of sending learners into an unconstrained chat . Even in coding, Tech & Learning’s description of “vibe coding” frames the learning as the testing, questioning, fixing, and improving that happen after AI generates a first version .
Guardrails are tightening, especially for younger learners
New York City Schools Chancellor Kamar Samuels said the system’s draft AI guidance “missed the mark” and signaled stronger guardrails, including close consideration of restrictions for children ages 3 to 5 . He said feedback revealed not simple fear but anger, distrust in institutions, and deep skepticism toward security protections and edtech companies . The caution is not abstract: Bank Street College president Shael Polakow-Suransky described a Bronx classroom where an AI math tutor helped students reach correct answers on fractions and decimals while few showed conceptual understanding and the teacher spent time troubleshooting technology instead of teaching .
At the national level, AFT President Randi Weingarten called for a unified strategy that includes a ban on student-facing AI in elementary schools, a ban on screens in preK through grade 2 classrooms, a new safety and privacy standard for K-12 AI tools, and an independent research consortium on AI’s effects in education . She paired those limits with a push toward active, project-based, and career-oriented learning, arguing that schools need to “harness the benefits of technology while mitigating the harms” . At the same time, she said educators need enforceable guardrails and a say in how AI is used, even as the union continues to support teacher AI training .
A related debate is shifting from “critical thinking” in the abstract to what knowledge students need in order to judge AI output. Chalkbeat’s reporting on AFT-backed teacher training found that prompt-writing and accuracy checks were being emphasized, but the link between critical thinking and subject knowledge was largely missing . Daniel Willingham’s formulation remains a useful corrective:
"Domain knowledge is a crucial driver of thinking skill. Critical thinking for open-ended problems is enabled by extensive stores of knowledge."
A more constructive version of this reset is showing up in computing education. Eli Dvorkin argued that NYC needs a “CS4All 2.0” for the AI era, with less emphasis on isolated coding electives and more emphasis on computational fluency across the curriculum: helping students understand, question, and shape technology, including when and how to use AI tools . He pointed to CUNY’s CITE program, which prepares future teachers to integrate computational thinking into pedagogy, including approaches that do not depend on more screen time .
Practitioner anecdotes help explain why these debates are intensifying. In teacher forum posts, some K-12 educators said they are shifting more grades to lockdown-browser quizzes, paper assignments, or handwritten essays because projects and take-home work are becoming harder to assess meaningfully in an AI-saturated environment . Others described students delaying work so they could use AI at home or turning in what they called “AI generated slop” on digital assignments . These are anecdotal reports, not systemwide data, but they align with the policy backlash.
Higher ed is separating AI literacy from assessment integrity
In higher education, the debate is getting more specific. An EDUCAUSE survey found that most faculty, instructional designers, and professional development staff already use AI to create assessments, exam questions, and prompts, and most believe students use AI on assessments too, but only 28% said that use is always or often unauthorized . Respondents wanted flexibility from institutions: support for instructors who want to use AI and support for those who do not . They also reported sharp disagreements over practices such as AI-assisted grading, pointing to a need for shared best practices and multi-pronged support rather than a single policy memo .
The scale of the issue is clear in new faculty surveys. A College Board survey of more than 3,000 faculty found 74% report students using AI to write essays or papers, while 45% hold an overall negative view of AI in higher education . In separate AAC&U/Elon survey results cited in the same analysis, 95% of faculty said AI will make students over-reliant on technology and 78% said cheating has increased since AI became widely available . Student use is also outrunning faculty practice: one survey cited by Edtech Insiders found 92% of students versus 79% of faculty actively use AI, and 50% of students want AI-assisted feedback while only 19% of faculty currently provide it . Yet institution-wide strategy remains thin: EDUCAUSE found only 22% of institutions have an institution-wide AI strategy, and the AAUP reported only 20% of colleges and universities have published a formal AI policy .
One practical recommendation gaining traction is to stop treating AI literacy and assessment integrity as the same meeting. Mike Kentz argues institutions should run separate working groups: one focused on teaching critical, metacognitive AI use, and another focused on evaluating student thinking through portfolios, process-based assessment, project-based learning, and conversation as artifact . His broader point is that AI exposed pre-existing weaknesses in mass, standardized assessment rather than creating them from scratch .
The most actionable teaching advice in this week’s higher-ed coverage was to emphasize process over polished output. EDUCAUSE participants recommended reflections, peer review, staged submissions, and asking students to interrogate AI outputs rather than accept them, with the explicit goal of reintroducing challenge and metacognition into assignments . They also argued that humanities skills such as critical thinking, curiosity, imagination, ethical judgment, and communication matter because AI makes it more important to connect technical systems to human values and consequences .
Beyond classrooms, AI strategy is being judged by learner outcomes
Some of the more concrete AI deployments this week were outside day-to-day instruction. Brandeis University launched Faye, an AI-powered tool that calculates the precise tuition price a student would pay based on personal academic and financial information if admitted, bringing upfront price transparency to a typically opaque admissions process .
In continuing education and workforce learning, The EvoLLLution argued that buying AI chatbots or analytics tools may speed up workflows without improving completion or employment outcomes . A stronger strategy uses real-time labor-market data to update programs continuously, gives learners portable competency records, and makes non-credit and credit pathways visible to students, faculty, and employers . Cuyahoga Community College’s ASCEND initiative is piloting that model in nursing, STEM, and business by tying living competency records to labor outcomes and choosing tools only after defining student-readiness goals . This matters because job postings requiring skills such as AI collaboration and prompt fluency have surged 134% above 2020 levels, and adult learners increasingly need personalized support paired with human advising and coaching .
Across higher ed and lifelong learning, the same operating principle kept surfacing: start with the outcome and the experience, not the model. EDUCAUSE interviews with Cisco and Dell leaders stressed governance, training, trust, and human-in-the-loop design, with AI treated as a co-pilot rather than an autopilot .
What This Means
- For K-12 leaders: The strongest evidence and product signals both favor AI that is tied to teacher-selected materials, constrained activities, and visible student progress—not blank chat interfaces that make answer extraction easy .
- For early-childhood and elementary settings: Expect tighter limits on student-facing AI and screen time. District and union leaders are now openly discussing outright restrictions for the youngest learners .
- For higher ed: Treat “teach students to use AI well” and “assess student thinking well” as different operational problems. Mixing them may slow both .
- For workforce and adult learning teams: AI is starting to be judged against readiness, placement, and pathway clarity, not just internal efficiency gains .
- For edtech builders and investors: The market signal is moving toward auditability, privacy, safety, and designs that reduce cognitive offloading while preserving teacher oversight .
- For learners and families: AI can support explanation and practice, but using it in ways that let you “switch your brain off” remains the central risk .
Watch This Space
- AI-native schooling beyond the campus: Alpha Anywhere says it is now available worldwide, with high school access coming in the fall, extending Alpha School’s AI-powered academic model to homeschoolers and mobile families . The broader Alpha effort is also pushing TimeBack software for schools and a free-to-learn video game that it says will start shipping later this year .
- Computational thinking without more screen time: The CS4All reboot argument in NYC and programs like CUNY’s CITE suggest a coming wave of AI-era computing education that is less about more devices and more about judgment, pedagogy, and screen-light integration across subjects .
- New evaluation frameworks for AI in learning: Several sources are pushing beyond “did the tool give the right answer?” toward deeper measures such as understanding, question quality, evidence comparison, collaboration, creativity, agency, wellbeing, and ethical competence .
- AI as a creation tool for learners: “Vibe coding” is emerging as a way for students and teachers to build small apps and learning tools by describing ideas, then testing and debugging what AI produces. Karpathy described kids using it as a potential “gateway drug to software development” .
Andrej Karpathy
Justin Reich
Tutor-mode AI moves from principle to product
The most important development this week is that major education AI offerings are being designed less like answer engines and more like guided coaches.
Microsoft’s Study and Learn Agent is now generally available in US English for Microsoft Education license holders on A1, A3, and A5 plans at no extra cost . Its core design is explicit: it leads with questions, tracks what a student understands, gives feedback, and moves them toward an answer without simply giving it away . It supports concept help across K-12 and higher ed, builds flashcards, quizzes, matching, and fill-in-the-blanks, works from student-uploaded materials, and offers writing help without generating the essay itself . Microsoft is positioning it as a response to existing behavior: 76% of Gen Z already use AI, and more than half use it for homework . Within schools, the pitch is guardrails, visibility, and tenant-level data protection, though student access for ages 13-17 still depends on admins enabling Copilot chat .
Microsoft also says the agent is built on four learning-science pillars: scaffolded guidance from what the student already knows, productive struggle, practice through activities, and checkpoints for application and transfer .
Anthropic described a similar pattern in university settings. In Claude’s "learning mode," a student who asks for an essay does not just get one back; the system redirects toward discussion of the assignment, its themes, and what the student has not yet read or understood . Google is pushing in the same direction with NotebookLM: answers are based on uploaded sources, show citations back to exact locations, and can be paired with Learning Guide, customized quizzes and flashcards, interruptible audio overviews, custom chat instructions, multilingual outputs, and Classroom-attached notebooks . When NotebookLM is attached through Google Classroom, students can use chat and Studio, but they cannot add new sources themselves .
The limitation is still the same one educators keep surfacing: AI can support learning, but direct-answer use still invites cognitive offloading. This week’s guidance from one education Substack put it simply: use AI in tutor mode, not answer mode, and create the work first before asking AI for feedback . That advice aligns with research summaries pointing to weaker recall and reduced cognitive effort when AI does the thinking for the learner .
The teacher time dividend looks real — but uneven
The evidence on teacher productivity is getting stronger, and it is starting to look less like labor replacement than time reallocation.
In an Education Endowment Foundation trial across 68 schools, teachers using ChatGPT spent 69% of the control group’s time on lesson preparation — about 25 minutes saved per week — with no detectable quality difference in blind review . In Brazil, a randomized study across 178 public schools and roughly 19,000 students found that AI-supported essay feedback freed teachers for 35% more one-on-one writing conversations and 30% more essays written, with the biggest learning gains on the complex writing tasks that AI itself could not evaluate well .
But the gains were not uniform. AI-generated lesson conclusions were preferred 59.7% of the time, while human-designed materials were preferred at the elementary level roughly 65% of the time; AI became more competitive in middle school and outperformed human designers 59.2% of the time at high school . AI also appears especially helpful when teachers are working outside their strongest subject area .
The risks were practical, not theoretical. Teachers often did not use follow-up prompts to improve outputs, nearly half of educator-AI conversations in one analysis involved assessment-related tasks that could slip into ungrounded grading, and supports for multilingual learners, students with disabilities, and under-resourced teachers did not appear by default .
That pattern is showing up in product design. Microsoft’s Learning Zone can generate, edit, and share interactive lessons, then show educators which students struggled and which items need reinforcement . But lesson generation requires a Copilot Plus PC, even though editing, sharing, and management work on other Windows devices and student access works across devices . Google Forms’ AI add-on can build exit tickets from prompts plus source files and summarize short-answer themes for faster instructional adjustment, but that summary feature is specifically tied to short-answer responses. Edcafe AI packages lesson creation, quizzes, chatbots, auto-grading, and real-time tracking into one workflow, yet its own guidance is to start from teachers’ materials and refine outputs for tone, accuracy, and class context .
Adoption is shifting from tool choice to institutional design
This week’s strongest leadership lesson was that successful adoption now looks less like tool shopping and more like governance, co-creation, and shared norms.
Jean-Claude Brizard of Digital Promise argued that co-creation with teachers and principals helps mitigate bias, that AI literacy must include teachers, administrators, students, and parents, and that schools should decide what they want to teach before choosing certified, safe, equitable, research-based tools .
"Technology is not going to revolutionize education. People will. Teachers will. Principals will. The technology is going to be an enabler."
City Schools of Decatur offers a concrete district example. Rather than starting with curriculum purchases, the district chose a policy-first strategy centered on privacy safeguards, cybersecurity, and deliberate rollout before scaling tools . District leadership framed AI readiness as an equity issue because students still need to graduate ready for a world that includes AI, especially in a system with a large wealth achievement gap . Its professional-development model pairs newer, more AI-native teachers with veteran educators’ pedagogical expertise . The district also elevated student voice by putting students in school board roles and sending them into a national AI fellowship with Day of AI and MIT RAISE that will culminate in a student-developed national AI policy .
Justin Reich’s latest interview findings explain why shared governance matters. Drawing on 120 interviews with K-12 teachers and students, he said AI is a top-tier issue in some affluent districts but sits far lower on the list in schools dealing with chronic absenteeism, staffing shortages, and basic capacity problems . He also argues there are no established best practices yet, so schools need shared expectations and transparent local experiments rather than leaving every teacher to invent their own rules alone . Domain knowledge still matters because experts can spot weak or false output that novices miss . And the same trust logic applies to adults: transparent teacher use of AI can model honesty, while undisclosed AI use in grading risks eroding trust with students .
One more operational warning: heavily locked-down systems can drive use off-platform rather than stop it. In one school technology discussion this week, a leader described teachers turning to personal devices because sanctioned tools did not fit the work, creating a shadow-AI problem rather than solving one .
What This Means
- For K-12 leaders: The market is moving toward bounded, coach-like AI, not unrestricted chat. The more promising tools are grounded in student materials, explicit guardrails, and learning workflows that keep humans in charge .
- For higher ed and L&D teams: The better question is no longer whether to allow AI at all. It is whether the experience reinforces questioning, practice, and transfer instead of fast completion .
- For teachers: The clearest near-term payoff is time reallocation — prep, differentiation, faster formative feedback, and more student conversation — not blind grading or fully automated instruction .
- For learners: Foundational knowledge still matters because it is what lets you judge AI output. Reich made that point directly about domain expertise , and Andrej Karpathy offered the cleanest shorthand:
"you can outsource your thinking but you can't outsource your understanding"
Demis Hassabis made a similar case this week: become native with the tools, but keep building strong foundations in math, computer science, physics, biology, creativity, taste, and judgment .
- For district and institutional operators: Policy, privacy, and family communication are now product-adoption issues. If usage is invisible, it becomes hard to govern; if expectations are unclear, classrooms drift into inconsistency or shadow use .
Watch This Space
- Affective AI in classrooms: EDUCAUSE flagged early pilots of AI emotional-management systems that monitor student affect in real time. It is still a fringe signal, but it raises an immediate empowerment-versus-surveillance question .
- Trusted learning context and skills records: 1EdTech is testing principles for sharing "trusted learning context" with AI and advancing CASE and Learning and Employment Records, while University of Phoenix has already launched AI-focused badges .
- Conversational lifelong learning products: Andrew Ng’s new Code Dream preview replaces a standard course with an interruptible conversation and practice environment for modern coding agents using up-to-date documentation .
- Large-scale funding: Anthropic and the Gates Foundation launched a four-year, $200 million partnership to build AI tools across education and other social-impact areas .
Ethan Mollick
Luis von Ahn
MacKenzie Price
AI-native learning models are getting real
This week’s biggest shift is structural: AI is moving from add-on tool to the operating layer of some school models and learning platforms .
At Alpha School, leaders describe TimeBack as a background system rather than a classroom chatbot. It assesses what each student has mastered, keeps them in the zone of proximal development, and uses spaced repetition, rapid quizzing, and immediate feedback to adjust pace and difficulty . Alpha says this lets K-8 students finish core academics in about two hours a day, while guides focus on mentoring, motivation, and afternoon life-skills workshops; the school also reports double expected NWEA MAP growth .
The model is also moving beyond one flagship campus. Alpha plans Boston-area sites next year, and says a lower-cost Texas Sports Academy variant using the same two-hour academic model is showing similar two-times learning gains even though many students there come from the bottom 40% academically . But even Alpha does not present this as full automation: for kindergarten and first-grade reading, the school says students still get 30 minutes a day with a reading specialist rather than relying only on apps .
Duolingo shows the same pattern on the product side. The company says two non-engineers with no chess knowledge used AI tools to prototype a full chess course in about six months, and that course now has 7 million daily active users . More broadly, Duolingo says AI enabled 50 times more content production in the last two years than in its first 12 years, while product managers increasingly bring working prototypes instead of written documents .
Access is widening too. Duolingo says AI conversation practice is moving from its highest-priced tier toward cheaper tiers and likely free access . Google’s Gemini is pushing in a similar direction with free SAT practice tests built with Princeton Review resources and instant feedback, on top of its LearnLM tutoring system, with expansion planned to other exams such as India’s Joint Entrance Examination . A separate study at two Thai universities found Gemini ahead of ChatGPT for English learners’ academic writing on multimodal feedback and source integration .
The limitation is just as clear. Duolingo’s CEO says motivation remains the hardest part of learning, and teachers still outperform AI on inspiration and context . Tech & Learning also notes that subtle inaccuracies remain a real risk in AI-based test prep .
Assessment is being redesigned around evidence, not output
A second major theme is that institutions are treating AI not just as an integrity problem, but as a signal that many assignments no longer show what a learner actually knows.
The EvoLLLution argues that if faculty cannot tell whether an assignment reflects student understanding, the problem is the assignment design, not AI alone . Its proposed response is to move toward real-time interaction, dialogue, explanations, process evidence, and work valued for interpretation and judgment rather than word count or format .
Ethan Mollick’s practical playbook points in the same direction. He argues that the “homework apocalypse” is real, but manageable if schools rely more on in-class writing, frequent testing, and flipped classrooms . He also draws a sharper product distinction: AI that simply gives answers can make students think they learned when they did not, while AI that acts as a tutor — quizzing, personalizing, and withholding direct answers — shows strong gains in controlled studies, including work from Taiwan, Kenya, Harvard, and Stanford .
That is why interface design now matters. Mollick called the quiet removal of ChatGPT’s visible Study Mode a mistake because parents and teachers need an easy way to steer students toward tutor behavior rather than answer behavior; OpenAI later said the feature still exists through /study and /learn, but Mollick noted that slash commands are not intuitive for most users .
"We really need to embrace AI as a teammate, not in place of any human"
VLACS is translating that idea into assessment design. Leaders are exploring discussion-based assessments where AI can raise additional questions while instructors verify learning, and they argue that if AI can complete a course, the course itself may need to shift toward more authentic, project-based work .
In higher ed more broadly, EDUCAUSE says AI is no longer a future trend but an accepted part of teaching and learning that is actively reshaping the student-instructor relationship, especially as students question why instructors may use AI while students face more prohibitive guidance. The report frames the broader challenge as a push-pull between empowerment and surveillance .
The strongest deployments are controlled, narrow, and human-accountable
The most convincing operational examples this week were not open-ended chatbots. They were bounded systems with defined knowledge bases, staff workflows, and clear human responsibility.
At Lone Star College, an internal advising chatbot built from about 250 validated questions across advising, admissions, and financial aid has handled more than 70,000 student conversations in a year, saving nearly 10,000 advisor hours by taking repetitive informational questions off advisors’ plates . Advisors were told from the start that the bot was an assistant rather than a replacement, and the college says the system is holding at roughly 96% accuracy on its top questions .
Instructional design teams are also using AI to create richer course materials. At Belmont, designers are using AI to turn faculty experiences into interactive empathy interviews — scripts from AI, lifelike audio from ElevenLabs, and images from ChatGPT or Claude — so activities that once took days or weeks to stage can be produced and refined quickly .
In the Netherlands, the EduGenAI pilot is giving higher education and vocational institutions a local AI environment — mostly open-source models hosted locally, plus some commercial options — to experiment safely while retaining control over data storage, model use, and allowed use cases . The project is also shaped by European rules that classify some educational uses, such as checking student work, as high risk and subject to certification . Just as important, the pilot is framed as institutional learning, not just tool testing: participants get AI-literacy training, policy support, and space to work through tensions between privacy, transparency, and student autonomy .
That same need for institutional capacity is showing up in faculty practice. A new OER+AI framework offers six entry points — Curate, Contextualize, Co-create, Cultivate, Amplify, and Sustain — to help faculty use generative AI in open educational resources while staying focused on pedagogy, copyright, and human-centric design . The authors also warn that institutions without real GenAI capacity risk creating a split between a small group of pioneers and a much larger sidelined faculty majority .
What This Means
- For K-12 leaders: The boldest AI stories are no longer just about a classroom app. They are about redesigned schedules, staffing, and school models. But even the most AI-forward examples still keep humans central for motivation, mentoring, and early literacy .
- For higher ed: If AI can produce acceptable assignments, assessment has to move toward explanation, dialogue, process, and authentic application. That is a design challenge more than a detection challenge .
- For student support teams: Narrowly scoped assistants can create real capacity when the knowledge base is validated and staff remain accountable. Lone Star’s advising bot is a strong example of AI taking repetitive work so humans can spend more time on guidance and relationships .
- For self-directed learners: Access is improving fast — free SAT practice in Gemini, cheaper AI conversation practice in Duolingo — but reliability and learning mode still matter. A tutor that questions you is safer than a chatbot that finishes the task for you .
- For institutional buyers and regulators: Controlled environments are becoming a competitive advantage. The Dutch EduGenAI pilot shows why institutions want sovereignty over data, clearer permitted use cases, and room to test tools before broad rollout .
- For L&D teams and lifelong learners: AI is lowering the cost of building custom tools and learning products. Andrew Ng argues this means more people should learn to prompt and use AI to code, not fewer, including people in nontechnical roles .
Watch This Space
- Platform consolidation around continuous learning: Andrew Ng says Coursera and Udemy have merged to build a broader skills platform as AI changes work and increases demand for continuous, job-relevant learning .
- State-scale deployments: Dan Hart described Educhat at the New South Wales Department of Education as serving half a million students, a reminder that some of the largest AI rollouts are happening inside public systems, not just consumer apps .
- Non-engineer builders and AI-native subject expansion: Duolingo’s chess launch and Andrew Ng’s examples of marketers and recruiters building custom tools both point to a future where smaller teams can create learning products and internal workflows faster .
- New delivery formats: Alpha says a Fortnite-style game built on top of its learning platform is already in pilot, with broader rollout expected to surface in August .
- Sector-specific AI infrastructure: The Dutch EduGenAI pilots are explicitly being used to decide what the final ecosystem should look like, making them a useful model to watch for other system-level deployments .
OpenAI
Sal Khan
MacKenzie Price
Brain-first AI becomes the default design rule
The most important development this week is a sharper distinction between AI that supports learning and AI that substitutes for it. Stanford’s SCALE Initiative reviewed 20 causal studies on AI in K-12 and found that AI improves performance while students have access to it, but those gains weaken or disappear on independent assessment . In one randomized study, AI users led by 22 points on immediate recall and comprehension, but the gap fell to 6 points after three weeks and was no longer significant; on synthesis, evaluation, and application, the no-AI control group led at every timepoint . Other studies in the same review found better essay scores without knowledge transfer , weaker memory-related brain activity and recall among ChatGPT users , and a shift from “write, reread, evaluate, revise” toward “ask AI, accept output, ask again” .
“Brain first. AI second.”
That phrase matched a broader classroom consensus. The MIT Media Lab writing study discussed by Philip Seyfried and Vicki Davis found an advantage for students who started with their own thinking and brought in AI later, leading to a “yes-and” approach: keep writers’ notebooks, paper books, and partner talk, then use AI to accelerate thinking after the early work is done .
The strongest pro-AI evidence points in the same direction. Sal Khan described Khanmigo as a tutor that gives hints, examples, and nudges rather than direct answers . He also said only about 10-15% of students can use AI constructively on their own; many need help learning what to ask and how to engage . In an independent study in India, Khan Academy plus a human “lab in charge” produced roughly a half-standard-deviation gain in seven months versus Khan Academy alone . Ethan Mollick noted that peer-reviewed meta-analyses still find positive effects of GenAI on learning, with the strongest evidence coming from randomized AI tutor interventions .
The practical design rules are getting clearer: protect the first attempt, force self-assessment before AI access, use AI to reduce extraneous load rather than remove productive struggle, and give teachers visibility into prompts so they can spot substitution early . In AI-rich classrooms, that raises rather than lowers the value of teacher content knowledge, because someone still has to validate accuracy, sequence knowledge, interpret nuance, and design assessments that require judgment instead of surface correctness .
Schools are moving from generic chatbots to grounded workflows
Some of the clearest implementations this week came from schools that constrained AI tightly; even strong AI-school advocates are now separating structured mastery systems from open-ended chatbots . At Oran Park Anglican College, teachers built NotebookLM “brains” around curriculum documents, syllabi, universal design for learning principles, and writing scaffolds so outputs stay aligned with pedagogy and inclusion goals . The school says AI use is saving about 52 hours per week on average while improving pedagogy, and every tool goes through staff training and a six-month pilot before wider rollout .
Student use is being structured just as deliberately. Oran Park launched a simple assessment scale — no AI, AI as assistant, AI as collaborator — so teachers can decide when and how AI is allowed . At Roseville College, surveys found that 90% of students had used large language models and 56% were using them at least weekly, but only 23% had seen teachers model strong use . That gap is pushing schools toward process-based assessment: one geography task now locks students’ research folders before class, lets them use AI to prepare materials, and then requires in-class work that reveals whether they actually engaged with the content . A similar framing is emerging in policy design: once AI is allowed, the more useful question becomes whether a student engaged with it maturely — with enough effort, thinking, and persistence — not whether every imperfect interaction is an ethical failing .
Other classroom practices follow the same logic. Teacher Nathan Jones encourages AI for scaffolds, checklists, and task breakdowns, asks students to attach prompt appendices, and explicitly teaches verification because hallucinations still happen . That is one reason AI detectors are losing ground: teachers are shifting toward monitored class-time use and process conversations rather than trying to infer authorship after the fact . Wayground’s new accommodations feature lets one staff member enter IEP-based accommodations once and apply them automatically across all of a student’s activities, reducing teacher overhead while keeping support consistent .
On the product side, the most useful capabilities are increasingly source-grounded and editable rather than one-shot magic. NotebookLM’s new auto-labeling organizes large notebooks into overlapping categories and lets users focus chats or generated artifacts on selected source groups . Gemini Canvas lets teachers build custom interactive tools without manual coding, but the educator stays “in the lead” and iterates the result . Microsoft’s Teach module can generate standards-aligned Minecraft Education lesson plans, yet the output is still a draft that educators edit, enhance, and save into their own workflow . Ethan Mollick’s warning on an AI-built physics simulation captures the limitation: even when something looks good after cursory checks, it still needs deeper verification before being used to teach students .
AI fluency is becoming a core literacy
Higher education and K-12 are both moving beyond “can students use the tool?” toward “do they understand what the tool is doing?” Agnes Scott College will embed a three-part AI curriculum in the first-year experience starting in fall 2026, treating AI fluency as a core literacy alongside writing and quantitative reasoning . Its distinction is useful: competency is basic operation; fluency means understanding limits, bias, ethics, and who benefits or bears the costs when AI is deployed . The school argues that practical skills matter, but not at the expense of judgment, critical analysis, and accountability .
That broader literacy need is also showing up in K-12. At Philadelphia’s Marian Anderson Neighborhood Academy, middle schoolers researched AI’s effects on education, government, creativity, and the environment. Students described both upside — such as writing help or creative experimentation in Roblox and video editing — and downside, including cheating and the sense that AI is now hard to avoid because search engines surface AI overviews by default . School leaders framed the work as an ongoing dialogue with families, educators, and public officials, not a one-off tech lesson .
Institutions are starting to back that shift with money and policy. The U.S. Department of Education is giving preference in discretionary grants to proposals that expand AI literacy, support ethical use, and improve student outcomes . Google is putting $10 million into AI skills across Asia-Pacific, explicitly centering teachers as the people who translate tools into classroom impact .
Adoption is colliding with privacy, trust, and procurement
The strongest policy signal came from New York City. The city recently canceled plans for a selective AI-focused high school after community criticism of both its admissions process and AI use in the classroom . After that proposal was nixed, more than 100 New Yorkers demanded an AI moratorium at a marathon board meeting, echoing broader concerns about transparency, safety, and oversight .
Those concerns are not happening in a vacuum. NYC’s new AI framework could expand AI use for tasks like lesson planning and translating materials for bilingual learners, but a state audit found the district lacks a complete inventory of third-party software and has already faced multiple breaches, including PowerSchool and Illuminate incidents . Critics argue the framework raises privacy risks further and that the city should strengthen in-house infrastructure and oversight before accelerating AI adoption .
At the state level, the conversation is also getting more concrete. Vermont’s proposed H.650 would require edtech providers to register and be reviewed on design features including AI, geotracking, and targeted advertising . Rhode Island’s Safe School Technology Act would restrict providers from activating audio or video functions outside school activities and ban use of location data . As Andrew Marcinek argued, the answer is not simply to ban technology, but to build more intentional programs and communicate more clearly with parents . Employers including Microsoft and Anthropic are already signaling demand for workers who understand AI alongside strong soft skills .
What This Means
- For classroom design: Build assignments so students think before AI enters the room. Independent brainstorming, first drafts, or problem attempts followed by AI critique fits the strongest evidence better than AI-first generation .
- For school leaders: The safer bet is not “allow AI” or “ban AI,” but structured use: assessment scales, prompt visibility, teacher modeling, and pilot-based rollout .
- For higher ed and L&D teams: AI training should move beyond prompt tips toward fluency — limits, bias, evaluation, and when not to delegate the work .
- For product teams and buyers: Grounding matters. Tools tied to curriculum docs, standards, or user-provided sources are showing clearer value than open-ended chat, but they still need expert review for accuracy and alignment .
- For policymakers and families: Privacy and trust are now adoption constraints, not side issues. If districts cannot account for third-party tools or data exposure, AI expansion will keep meeting resistance .
- For self-directed learners: AI can help organize complex projects and materials, but tools that remove all friction may also remove some of the struggle that produces learning .
Watch This Space
- Source-grounded personal study spaces: Gemini Notebooks are being positioned as a place to gather drafts, requirements, and deadlines in one AI-assisted workspace . Lance Eaton describes a similar agentic pattern: semantically analyzing 200+ course materials, tagging them, mapping connections, and building a custom reader with recommendations — useful for reducing organizational friction, but still in tension with learning’s need for deliberate friction .
- Young builders using AI for real work: OpenAI’s ChatGPT Futures highlighted students using AI to map 1.5 million previously unknown objects in space, make 100 million galaxy images searchable, detect disaster survivors through walls and debris, preserve endangered languages, and reroute more than 5 million pounds of unsold inventory from landfills . In Philly, middle schoolers are already using AI for Roblox game coding and video editing outside school .
- Rules for AI in schools: Watch for more states to move beyond generic AI statements and toward product-level requirements covering design features, data access, and device permissions .
Andrew Ng
Andrej Karpathy
Sal Khan
AI tutors are separating from AI shortcuts
The clearest signal this week is that education is getting more specific about how AI should be used. The strongest evidence does not support generic "use AI to study" behavior. Ethan Mollick pointed to randomized evidence showing that broad AI-assisted studying hurts learning and retention, while AI prompted to act as a tutor — especially with teacher support — can produce large positive learning effects .
Syracuse University saw the same pattern in a real deployment. Claude initially generated multiple-choice practice from lecture recordings, but students who used it heavily were not outperforming peers. After faculty changed the prompt so Claude asked short-answer questions and gave feedback on what students got wrong, average exam scores jumped by 12 points .
That distinction matters beyond one campus. In the OpenAI podcast, researchers described ChatGPT as unusually good at tailoring explanations to a learner's background, generating follow-up questions, and making solitary STEM study feel more interactive — but they also warned that expertise and hard work still matter if learners want deep understanding rather than a polished surface answer .
"You can outsource your thinking but you can’t outsource your understanding."
That line from Andrej Karpathy captures the week well. AI is increasingly useful as a coach, explainer, and practice partner — but the evidence is getting sharper that it works best when it keeps the learner cognitively engaged, not when it removes the struggle entirely .
AI-era degrees and skilling pathways are moving into the mainstream
The biggest higher-ed announcement came from Khan Academy. Sal Khan said the new Khan TED Institute will offer an online bachelor's degree in AI for under $10,000 total, developed with input from employers including Google, Microsoft, McKinsey, Bain, Accenture, and Replit . The program combines core academic subjects with AI-era portfolio work such as vibe coding, research, analysis, and AI art, while using Schoolhouse.world for peer tutoring, dialogue, and small-group collaboration .
Khan also put an important limit on the idea: the program is meant to complement traditional higher education, not replace it, and he said it will not be the right fit for everyone .
The broader market is moving in the same direction. Coursera said it is seeing one AI course enrollment every four seconds globally, while enrollments in critical thinking courses are up 184% year over year — faster growth than AI courses themselves . Its AI tutor, Coach, is being used for questions, roleplay, and quiz prep, and the company says engagement is especially high among women and first-generation students . Coursera also says lifelong learning is replacing the old degree-then-work model, with enterprises asking for ongoing skill development and one global professional services firm upskilling 5,000+ employees into specialized AI-centered roles .
Andrew Ng's new AI Prompting for Everyone course fits the same shift: AI use is no longer being framed as a niche technical skill, but as a general capability that includes deep research, giving models more context, asking them to think through important decisions, and knowing when not to trust an answer .
K-12 adoption is getting more practical, accessible, and teacher-shaped
In K-12, the strongest stories were not about fully autonomous classrooms. They were about teachers using AI for specific tasks that save time, widen access, or make learning more interactive.
A Stanford analysis of 150,000+ prompts from 4,400+ K-12 teachers offers a useful baseline. More than 40% of prompts focused on curriculum and personalizing learning, more than 50% asked AI to generate materials such as lesson plans or assessments, and about 1 in 7 used AI as a sounding board for reflection or working through instructional problems . Half of teacher-AI conversations were under 10 prompts, suggesting short, practical interactions rather than long dependency loops .
At Leo Academy Trust in the UK, Google's AI Works training was rolled out to all staff, and teachers reported savings of 2.9 hours per week. The examples were concrete: bespoke poems and reading materials, math challenge questions, simplified texts, storybooks for children transitioning back to school, and easier data analysis across schools . The same school described live translated captions in class, safer research via controlled tabs, and SEND supports such as screen masks, voice notes, and accessibility tools that increased student independence and confidence .
Google is also pushing AI deeper into classroom workflows. Gemini Canvas is positioned as an active-learning tool that can generate flawed writing for whole-class editing, interactive quizzes with hints and feedback, classroom simulations, and simple coded tools like random group generators . On the admin side, Google Workspace Studio lets staff build no-code flows across Gmail, Drive, Docs, Sheets, and Calendar — but only inside managed school accounts, with admin enablement, and for adults rather than students .
The practical-accessibility layer is expanding too. Google Meet now supports translated captions in 70 languages, while Gemini notes can transcribe and summarize sessions so students can review what they missed later . DeepMind's Experience AI program, built with the Raspberry Pi Foundation, says it has already trained 30,000+ teachers and reached 2.9 million students across 180 countries in 19 languages, with a Latin America expansion planned through 2028 .
The important caveat: more powerful tools also create new quality risks. EDUCAUSE warned that AI-generated slide decks can break reading order, color contrast, and alt-text expectations, while non-experts using AI to build tools can ship inaccessible products if accessibility is treated as an afterthought . Their advice was blunt: AI can help with accommodations and universal design, but there is no magic wand for accessibility compliance .
Governance, accessibility, and market skepticism are becoming adoption bottlenecks
This week's policy story was New York City. More than 100 New Yorkers testified at a marathon school board meeting to demand a moratorium on AI use in public schools, even though AI was not formally on the agenda . Their concerns centered on unclear rules, limited transparency, surveillance, and the sense that AI was being rolled out before the system understood its implications .
That pressure is already changing policy. NYC withdrew its proposal for an AI-focused high school after opposition over screened admissions, equity, corporate influence, and the rushed process . The city's full AI policy is expected at the end of June, with public input on the early framework open through May 8 .
At the same time, institutions are being asked to make higher-quality buying decisions in a noisier market. Chalkbeat found that five superintendents shared 90 marketing messages from 79 companies in one day, including 17 pitches tied to generative AI . Leaders described the energy drain, weak product fit, and real integration and training costs when new systems are introduced badly .
That helps explain why some education leaders say the hype cycle is cooling. Carl Hooker said educator enthusiasm has moved into the Gartner "zone of disillusionment," with leaders becoming more cautious about AI spending, more aware of limitations, and more concerned about environmental impact and post-ESSER device constraints .
Accessibility rules are tightening in parallel. EDUCAUSE said higher-ed AI tools must meet WCAG 2.1 AA expectations under impending ADA Title II requirements, and should be reviewed like other third-party vendors, especially when they have broad reach across campus . In other words: governance is no longer a side conversation. It is becoming part of product design, procurement, and deployment.
What This Means
- For learning designers: The biggest practical takeaway is simple: tutor mode beats shortcut mode. The best results this week came from AI that asked better questions, gave feedback, and kept learners engaged in the work .
- For higher ed and L&D teams: AI skill demand is rising, but so is demand for critical thinking, collaboration, and self-direction. The growth in critical-thinking enrollments alongside AI course demand is a strong signal that employers and learners want both .
- For K-12 leaders: Current wins are concentrated in resource creation, differentiation, translation, accessibility, and admin automation — especially when tools are implemented inside managed systems with training and oversight .
- For edtech buyers and investors: Feature velocity is no longer enough. Schools are asking harder questions about evidence, accessibility, implementation cost, and whether a tool fits real workflows rather than adding another layer of noise .
- For policymakers and compliance teams: Community trust can slow or stop adoption. Equity, privacy, accessibility, and transparency are now core product risks, not peripheral objections .
Watch This Space
- AI-native credentials: Khan TED Institute is the highest-profile example this week, but similar models are emerging, including the UK's AI-native London School of Innovation for postgraduate reskilling .
- Source-grounded study spaces: NotebookLM's deeper integration with the Gemini app points toward more study workflows built around personal notebooks and user-provided materials rather than open-ended chat .
- Learners as builders: More examples are surfacing of students using AI to make things — from a 5-year-old prompting a typing game to student-built AP prep resources and Oak Ridge students using AI in advanced manufacturing and AR projects .
- Governance as a growth market: NYC's coming AI policy, the spread of state AI bills, and new policy training like the free edX MOOC on governing education in the age of AI all point to rising demand for implementation playbooks, not just tools .
- AI workflow layers for educators: Tools like Workspace Studio suggest the next step is not just generating content, but automating recurring school workflows — with access still constrained by admin settings, account type, and governance rules .
liemandt
MacKenzie Price
AI-native learning models are moving from experiment to expansion
The clearest development this week is that AI-native schooling is being described less as a pilot and more as a scalable model. In public interviews, Alpha School leaders said their approach uses AI-delivered one-to-one tutoring, mastery thresholds around 80-85%, spaced repetition, and short work blocks to help students learn roughly twice as much academic material in about two hours a day, then spend the rest of the day in workshops, projects, sports, and social learning .
Alpha also attached the model to outcomes and expansion. Its leaders said students score in the top 1% on standardized tests, freshmen average above 1400 on the SAT, more than 90% say they love school, and 43% said they would rather go to school than go on vacation . They also said the model is widening access through Texas education savings accounts and related programs, with thousands of students from families under $65,000 expected to join through Texas Sports Academy, and that 50+ Alpha schools are set to open .
A separate first-hand account from a public-school teacher who shadowed an Alpha campus adds texture to those claims. She reported seeing 3rd-6th graders collaborate, troubleshoot, track their own progress, schedule coaching when stuck, and present with unusual confidence and independence . She also noted that the two hours of screen-based academics were less device time than many traditional schools use, with later screen use reserved for purposeful tasks like design and research .
The important caveat is that Alpha's own leaders and outside commentators framed this as a school-model redesign, not a chatbot rollout. Alpha leaders called most chatbot use in schools "cheat bots," warned about cognitive offloading, and argued that layering AI onto a conventional classroom is likely to repeat the broader failure pattern of edtech . Michael Horn made the same point more broadly: AI can raise the floor and ceiling when the school model is rethought, but it can also lower the floor when used without deliberate design .
"A good AI tool ... is one that actually increases human connectivity, not decreases it."
This time-compression logic is showing up beyond K-12. Austen Allred said Gauntlet AI rewrites its 10-week curriculum from cohort to cohort because AI changes that quickly, spends up to an hour a day absorbing what changed in the prior 24 hours, and is seeing employers hire graduates directly .
The most useful classroom AI is narrow, grounded, and supervised
The strongest practical deployments this week were not generic assistants. They were tightly scoped systems attached to specific school problems.
In Northern Ireland, a six-month C2k/CDK pilot found teachers saved an average of about 35 hours a month with Gemini, freeing more time for direct student interaction and improving well-being . The reported gains came from bounded tasks: updating planners across primary grades in minutes instead of weeks, drafting policy-grounded parent emails, and reducing exam-analysis work from 5-7 hours to under 90 minutes . The same pilot surfaced student-facing benefits, including a teacher using NotebookLM-generated mind maps, audio, FAQs, and revision materials to help a student with ASD pass a GCSE, and stronger reported impact in Irish-medium settings because Gemini enabled work teachers previously could not do another way .
At a UK further-education college, Vice Principal Chris Love Day described a similar design philosophy taken further. His team built 21 bespoke agents, including a school-owned LLM that blocks harmful topics, logs both prompts and outputs, and flags serious safeguarding issues to the designated safeguarding lead . The college made that system free for 5,000 students to address digital-equity gaps, and built "Barton Buddy" as a student assistant that answers questions from school systems and policies rather than the open internet . That assistant is used heavily on evenings and weekends, effectively extending access to information and well-being guidance when staff are offline .
What is striking is how often the successful uses were small and operational. Educators in the same discussion described building self-marking recall quizzes in Canva, an interactive parents' evening map in Claude, and policy or options-process chatbots grounded in school documents . Their rollout advice was consistent: start with policy, solve a real problem, keep a human in the loop, red-team the tool before release, and train both staff and students iteratively rather than once .
Accessibility was another concrete theme. At Epsom & Ewell High School, support assistant Dawn Knight described using Microsoft Teams captions and AI-generated transcripts, word fills, simplified texts, and checklists to support deaf and SEN learners, while accessibility exceptions allow deaf students to use phones or iPads with the Roger On app to route a teacher's voice directly to hearing aids or cochlear implants . Her framing was pragmatic: AI saves time, but that time is then reinvested in more student interaction and differentiated support .
The major platforms are moving in the same direction. Google presenters emphasized NotebookLM as a source-grounded tool that creates audio and video overviews, flashcards, infographics, and mind maps from uploaded materials, while Classroom now lets teachers assign Gems and NotebookLM resources directly to students . Recent NotebookLM updates added saved quiz progress, mastery tracking, and automatic source categorization, all of which push it further toward structured study rather than generic chat .
Assessment is shifting from polished output to visible thinking
The pressure on traditional academic work keeps rising. Ethan Mollick reported that GPT-5.5-powered Codex processed hundreds of research files, generated a new hypothesis, ran sophisticated statistical tests, built a real literature review, and produced what he described as a near-PhD-quality academic paper from four prompts without manual editing . His conclusion elsewhere was blunt: systems regulated by the fact that they were effortful for humans — including essays and letters of recommendation — will break .
Higher education is responding by moving attention away from finished artifacts and toward process, judgment, reflection, and applied thinking . EDUCAUSE speakers described asking students to document which prompts they used, what AI output they accepted or rejected, and how the interaction changed their thinking; they also highlighted verbal explanations and student-made videos as ways to surface process rather than just output .
"AI is an interface that produces plausible text, not truth."
That framing is becoming more important because the practical risks are now clearer. Researchers discussing Microsoft guidance on AI-supported research workflows warned about "appropriate reliance," miscalibration when AI output looks smarter than it is, the need to test prompt robustness, and the importance of documenting model, date, prompts, and custom instructions as if they were lab methods . They also warned about "cognitive debt" when users offload too much of the thinking itself .
Several educators are answering that problem by adding friction back in. One essay this week argued that the default "Helpful Assistant" is the worst possible mode for thinking because it rewards passivity and produces polished output without requiring cognitive work . Others described using AI to ask harder questions, challenge assumptions, and push clarification instead of merely generating answers . That aligns with a broader view from Khan Academy: practice drives learning, and AI should support it rather than replace it .
There is also growing skepticism about AI as a writing partner. One educator argued that while grammar has improved, student writing has become more boring, less individualized, and less meaningful over the last few years, leading him to conclude that AI's writing role has been oversold .
Governance is becoming operational, not just rhetorical
Another theme this week: AI governance in education is getting more technical. AffectLog proposes federated risk analytics with differential privacy, local computation of risk metrics, and auditing across 300+ constraints drawn from frameworks including the EU AI Act, NIST RMF, ISO/IEC 42001, and GDPR Article 22 . Its core argument is that pre-deployment audits are not enough because the most consequential failure modes often emerge only when systems meet real users, real behavior, and real environments at scale .
That argument is easy to understand when paired with a classroom example. An independent audit of Wayground's AI quiz generator found that the biology questions were accurate, but the NGSS alignment metadata was fabricated: the tool generated standard descriptions that sounded plausible but did not match the actual standards . The risk is not just bad pedagogy; it is bad compliance information entering lesson plans, curriculum documents, and procurement decisions with the same confidence as correct content .
The wider field seems to be converging on the same middle ground. Edtech Insiders described a sector where product security, privacy work, and pedagogy are improving even as public patience thins, and argued for an AI "harness" that provides enough steering, safety, and learning friction to make the tools genuinely useful . New America, as cited in the same roundup, warned that blanket restrictions on classroom technology can backfire by limiting access and widening inequities, making nuanced policy a practical necessity rather than a nice-to-have .
What This Means
- For school leaders: The strongest implementations are redesigns, not overlays. Alpha, Northern Ireland's Gemini pilot, and UK school-built agents all point to the same pattern: bounded tools tied to a clear model or workflow outperform generic "AI for everything" deployments .
- For higher ed and assessment teams: The artifact economy is under real pressure. If AI can now produce credible research papers and polished prose at low cost, the defensible move is to collect evidence of process, reflection, judgment, and revision — not just the final file .
- For inclusion, SEN, and accessibility leaders: AI is already improving access when it is grounded in actual learner needs — captions, transcripts, differentiated texts, mind maps, audio study aids, and policy- or curriculum-specific supports are doing more than generic chatbots for many students .
- For product buyers and investors: The evidence bar is rising. Look for measured time savings, real outcome signals, source grounding, safety instrumentation, and documented human review. Also inspect hidden layers such as standards alignment and metadata, not just the surface quality of the output .
- For learners and L&D teams: AI seems most valuable when it accelerates practice, feedback, and iteration without outsourcing judgment. Knowledge still matters, and so does the ability to notice when a system is plausible, polished, and wrong .
Watch This Space
- Shared institutional agents: OpenAI's workspace agents are now in research preview for ChatGPT Edu and Teachers plans, signaling a move from one-off prompts toward shared agents that coordinate across tools and keep work moving over time .
- Evidence-backed classroom AI: Watch for more tools that anchor themselves in specific parts of teaching and publish clearer impact claims. This week's examples included Coursemojo's focus on the hardest-thinking part of ELA lessons and Nectir's reported 7.5% campuswide GPA lift in a peer-reviewed study .
- Faster, shorter workforce learning cycles: Gauntlet's constant curriculum rewrites and next cohort of about 100 learners suggest that AI-era workforce programs may keep compressing time-to-skill while updating far more frequently than conventional courses .
- Procurement-grade AI governance: Continuous monitoring, federated auditing, and routine verification of standards claims may become part of edtech buying, especially as more tools influence grading, support, and compliance workflows .