Gen AI: too much spend, too little benefit

“Gen AI: Too Much Spend, Too Little Benefit” What To Fix Now

Goldman Sachs’ newsletter titled “Gen AI: Too Much Spend, Too Little Benefit” exposes a costly truth. Discover simple ways to turn it around.

Shrey Kant

19 Dec 2025 • 9 min read

Is your Gen AI budget growing faster than your results? Are you spending more each quarter but seeing little to show for it? The latest Goldman Sachs research notes that many companies have increased Gen AI investment, yet most still report unclear or limited ROI. Spending keeps rising, but benefits remain uneven.

This blog explores why the pattern of Gen AI: too much spend, too little benefit is now so common. You will see what teams are missing, why workflows break, and how small gaps compound into high costs. The goal is to give you clear fixes that any developer, creator or PM can apply.

What is causing the gap between investment and outcome? More importantly, how can you fix it now? Let’s break it down further.

If you only have 30 seconds, start here:

Most Gen AI waste begins early. Teams start projects without clear goals. This leads to uncontrolled experimentation and rising costs.
Model choices shape your spend. Large models get picked for simple tasks. This increases compute use and creates extra work.
Workflow fragmentation creates hidden expenses. Switching tools forces repeated generation. It also causes inconsistent outputs and longer cleanup time.
Lack of visibility slows you down. You cannot track output quality or run time. This makes it hard to correct problems quickly.
Structure prevents the waste cycle. Unified tools and defined quality targets keep your workflow predictable. This reduces repeat runs and stabilizes cost.

Why Gen AI: too much spend, too little benefit remains a growing challenge

Gen AI budgets keep rising across industries. Yet many teams still struggle to see clear results. The Goldman Sachs report highlights that spending continues to climb, but value remains hard to measure for most firms. This challenge affects developers, creators and PMs across sectors.

Global economic conditions add to the push. Real GDP growth is expected to reach 2.7 percent in 2024. Higher household income and better manufacturing cycles encourage more Gen AI experimentation. The pressure begins when adoption grows faster than the systems needed to manage it.

A key insight from the report is direct. Your problem is rarely the Gen AI models. The real strain comes from weak processes, poor tracking and workflows that waste compute and time. Many companies in the survey admit they still cannot measure ROI with confidence.

To make the challenge clear, here are the main friction points you encounter:

Where the pressure builds:

Gen AI projects scale before you define goals.
Teams run costly experiments without guardrails.
Model outputs vary, which leads to repeat runs.
Tooling stays scattered across different platforms.
Tracking performance, cost and output consistency remains difficult.

Why this turns into high spend and low benefit:

Workflows rely on manual steps and repeated model calls.
No unified structure to compare model performance.
Assets are not reused, so teams generate the same content multiple times.
Decision-making becomes slower because no one has clear metrics.

The challenge grows when your processes and workflows lag behind your ambition. The result is predictable. You spend more. You gain less.

You can trace the pressure points through three areas.

1. Rising experimentation costs
Teams test multiple models without cost controls. Each model run increases spend, especially when teams repeat tasks instead of reusing assets. This behavior compounds during periods of economic uncertainty, when businesses hesitate to impose strict guardrails.

2. Limited performance visibility
You may not have a clear method to compare model quality, speed or reliability. Without this visibility, you continue using expensive models even when simpler options work.

3. Workflow fragmentation
Developers and creators switch between tools, scripts and APIs. Each step adds latency, manual effort and repeat runs. These workflow gaps amplify the pattern of high spend and low benefit.

Gen AI pushes organizations forward, but without the right structure, it also introduces waste. Your goal is to identify where this waste begins and prevent the same issues from repeating.

Try a PixelFlow chain and see your workflow speed up instantly.

The root causes of Gen AI: too much spend, too little benefit for your team

You often feel the gap between high Gen AI spend and low returns because the foundations are weak. The Goldman Sachs report highlights that most companies accelerate Gen AI adoption before building the structures needed to support it. When expectations rise faster than readiness, your team ends up with high cost and low payoff.

1. Lack of clear business use cases
You move into Gen AI projects without a focused problem to solve. This leads to broad experimentation, unclear goals and oversized workloads. Each new experiment adds compute cost and increases the chance of rework because no one agrees on the intended outcome.

2. Models chosen without cost or performance trade offs
You may select advanced models because they appear powerful, even when you do not need that level of complexity. Larger models demand more compute, run slower and cost more per output. Using them for simple tasks raises spend without improving results.

3. Disconnected tool chains and workflows
Your team jumps between notebooks, APIs, scripts and design tools. Each switch adds friction. Outputs vary, so you rerun models to match formatting or quality. This creates repeat fees and slows progress because no single workflow keeps your tasks consistent.

4. Skills gaps and unclear governance of Gen AI spend
Teams often lack the expertise to judge model quality or cost efficiency. Without governance, no one sets usage limits or monitors spending patterns. As a result, you see surprise bills, inconsistent decisions and repeated trial runs that inflate costs.

Also Read: The Most Common Types of Generative AI

Common mistakes that inflate Gen AI costs

You tend to see these mistakes when Gen AI programs scale without structure. Each one adds to the pattern of high spend and low benefit.

Cost drivers you should watch out for:

Using large models for tasks that simpler models handle well and at far lower cost.
Regenerating outputs instead of storing and reusing assets from earlier runs.
Jumping between tools that do not connect or share context across steps.
Ignoring model invocation patterns and allowing API calls to run unchecked.
Building complex workflows for niche tasks instead of optimizing common use cases.
Running multiple model tests without comparing performance or tracking cost per result.

This list gives you a clear view of the habits that drain resources before you notice the impact.

Also Read: How to Build Your Own Gen AI Workflow and Convert it into an API

How you see Gen AI: too much spend, too little benefit in daily workflows

You notice this pattern inside your team long before the monthly bill arrives. Delays appear in simple tasks. Outputs vary from run to run. Rework becomes routine. Budget overruns start to feel normal instead of exceptional.

Developers face it when they run the same prompt across different models and still end up debugging inconsistent outputs. Creators see it when an image to video workflow looks good once, then fails two more times because each frame changes quality. PMs notice it when teams miss production schedules because each asset takes longer than planned.

These patterns show up in predictable ways.

Common workflow issues that signal the problem:

Task delays because model output varies and needs additional clean up.
High rework levels because assets do not match expected quality.
Multiple tool switches across steps that break consistency.
Frequent regeneration of the same asset instead of using stored versions.
Rising spend on repeated tasks that should have been automated.

You see the cost grow because each broken workflow forces your team to start again instead of building on previous work.

The productivity signals you must watch out for

You can spot the issue early by watching these signals. Each one points to unnecessary cost and wasted effort.

Key red flags in your daily work:

Generating the same asset multiple times because outputs do not match earlier runs.
Long wait times between prompt and usable output that slow the entire task
Manually stitching results from different tools to complete a single asset.
No system to store and reuse earlier outputs, which leads to repeated spending.
Sudden spikes in model usage cost that do not match the amount of work done.

These signals help you catch the waste before it grows into a larger budget problem.

Browse Segmind’s model library and pick the right tool without inflating your Gen AI budget.

What you should fix first to stop Gen AI: too much spend, too little benefit

You can reduce waste quickly by strengthening the basics of your Gen AI setup. This starts with clear objectives. Without those, you run experiments that keep adding cost without improving results. Your goal is to define what “good” output looks like before you generate anything.

Model selection comes next. You do not need the most advanced model for every task. For example, instead of using the top model for each image, pick the one that reaches your quality threshold at a lower cost. This alone can cut a large portion of your spend.

Standardizing workflows also helps. When your team uses the same steps and tools, you avoid repeated runs and mismatched outputs. You also gain visibility into what parts of the workflow cost the most.

Performance measurement completes the foundation. Tracking cost per result, run time and model success rate gives you a clear view of what to adjust. Each insight helps you direct effort where it matters most.

Here is where you can begin.

Early fixes that improve your cost to value ratio:

Set one clear objective per Gen AI task to avoid repeated output attempts.
Use models that meet your required quality instead of defaulting to the highest tier.
Build a simple workflow template so your team follows the same steps each time.
Track output quality and cost to identify where most of your resources go.
Store reusable assets so you do not regenerate work that already exists.

Also Read: Build Your Generative AI Toolkit For Interior Design and Architecture

How Segmind helps stop Gen AI: too much spend, too little benefit in its tracks

You reduce waste instantly when you centralize your Gen AI work in one place. Segmind gives you that structure. It brings 500 plus models into a single platform so you do not jump between tools or run repeat tasks without knowing which model performs best. This alone helps you control cost and improve output consistency.

Segmind uses VoltaML for fast and efficient inference. You complete tasks with fewer model calls and shorter wait times. This makes your cost per asset more predictable. You also gain PixelFlow, which lets you connect models, automate chains and remove manual steps that often lead to repeated runs.

Segmind supports fine tuning and dedicated deployment when your workload grows. This helps you scale without switching to multiple vendors or rebuilding the same workflows.

Here are examples of how you benefit.

Where Segmind cuts waste and improves results:

Creators run image to video tasks once instead of regenerating outputs that vary across tools.
Developers build custom chains inside PixelFlow instead of writing scattered ad hoc scripts.
Marketing teams reduce cost per asset by choosing the right model for each task and reusing outputs through stored versions.

Segmind fits the exact gaps highlighted in the Goldman Sachs research. It gives you the structure you need to avoid the pattern of too much spend and too little benefit.

A simple PixelFlow workflow example to prevent Gen AI: too much spend, too little benefit

You can prevent repeated runs by building a simple chain inside PixelFlow. A common setup is Text to Image to Video. You select your models, connect them in sequence and run the chain once. PixelFlow handles the transitions so you do not switch tools or reprocess assets manually.

This workflow reduces model calls because the output from the first step moves into the next step automatically. You standardize the format, quality and resolution across the chain. Your team spends less time fixing mismatches and more time producing usable content.

You can track these checkpoints while running the chain.

Key metrics to monitor inside PixelFlow:

Cost per asset to ensure each generation stays within limit.
Time per asset to measure how long each step takes.
Output reuse to avoid regenerating the same work.

This simple structure removes the friction that usually drives high spend and gives both developers and creators a consistent way to produce assets.

Conclusion

You now see that higher Gen AI spending does not guarantee better outcomes. Your results improve only when you refine your processes, choose the right models and build predictable workflows. Each fix you apply cuts waste and brings you closer to consistent output.

You also have clear steps to prevent the pattern of too much spend and too little benefit. These steps help you control cost, reduce repeat work and improve asset quality. Segmind gives you the structure to apply these changes with fewer tools and fewer repeat runs.

Your next step is simple. Review your current workflows, identify gaps and apply the practices that bring your Gen AI program back under control.

Sign up to Segmind and stop losing budget to scattered Gen AI workflows.

FAQs

Q: How can I decide if a Gen AI task deserves automation or manual execution?

A: Review task frequency and the value of consistent results before deciding. Use automation for repetitive tasks that cause delays or quality swings. Keep manual control for work that needs careful judgment or creative nuance.

Q: What should my team measure first when checking Gen AI efficiency?

A: Start with output stability and task completion time because these reveal early gaps. Add a cost per usable asset metric for clarity. Keeping these three indicators visible helps you act quickly when results slip.

Q: How do I know when to introduce workflow orchestration into my Gen AI process?

A: Introduce orchestration when your team repeats steps or switches tools often. Frequent handoffs usually signal weak structure. Centralizing these steps improves consistency and reduces wasted effort.

Q: What is the best way to test model suitability before committing to a budget?

A: Run small controlled batches instead of full production tests. Compare quality, run time and stability across a few outputs. Select the model that reaches your minimum quality requirement at the lowest predictable cost.

Q: How do I reduce prompt variation across team members who work on the same project?

A: Build a shared prompt library with examples that meet your quality standards. Encourage teams to refine prompts together and document effective ones. This helps prevent random attempts that waste time and compute.

Q: How can I avoid unnecessary rework when using Gen AI for multi-step creative tasks?

A: Lock the format, size and quality requirements before the first generation. Store outputs immediately so your team does not recreate them. Clear specifications early in the flow remove most rework and cut project delays.