Critical ReviewArts & DesignHCI & Taxonomy

Generative AI and Art: Mapping the Human-Machine Creative Spectrum

Where does the human end and the machine begin in AI-assisted art? Recent research maps the spectrum of human-machine creative interaction, revealing that the level of automation profoundly affects creative experience—and that the 'agency gap' between user intent and machine output is the central design challenge.

By Sean K.S. Shin

This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

The question "is AI art real art?" may be the wrong question. A more productive framing—one that the recent literature increasingly adopts—is: what is the human contribution, and how does the level of AI automation affect the creative experience? The answer turns out to be nuanced: more automation does not simply mean less creativity. The relationship between human agency, machine capability, and artistic outcome is more complex than either AI enthusiasts or skeptics acknowledge.

The Research Landscape

Automation Level and Creative Experience

Qiao and Wang (2025), with 5 citations, provide the most rigorous empirical study in this cohort. They investigate how different levels of AI automation affect human creative experience and efficiency in design tasks. The study uses a controlled experimental design where participants complete design tasks with varying degrees of AI assistance—from no AI support to fully automated generation.

Key findings:

Moderate automation produces the best creative experience. Participants reported the highest levels of creative satisfaction and perceived autonomy when AI provided suggestions that they could modify, rather than either working entirely alone (low efficiency) or accepting fully generated outputs (low agency).

Full automation reduces perceived ownership. When AI generated the complete output, participants rated their own creative contribution as low even when the output quality was high. This "ownership gap"—where the product is good but the creator does not feel like a creator—is a significant finding for the design of creative AI tools.

Efficiency and creativity trade off at high automation levels. Fully automated generation was the fastest but produced the least creative diversity (participants accepted the first good-enough output rather than exploring alternatives). Moderate automation, by requiring human evaluation and modification, produced more varied and original outputs.

The Agency Gap

Troussas and Krouska (2026) introduce the concept of the "Agency Gap"—the disconnect between what a user intends and what a generative AI produces. Current generative AI systems use a prompt-response interaction model where the user provides a text description and the AI generates a complete output. The user's control is limited to accepting, rejecting, or reprompting—there is no mechanism for granular intervention during the generation process.

This, the authors argue, is a fundamental HCI problem. They propose an "Agency-First Framework" that prioritizes user control at every stage:

Pre-generation agency: The ability to specify not just what but how—style, composition, color palette, focal point.
In-generation agency: The ability to intervene during the generation process—pausing, redirecting, modifying partial outputs.
Post-generation agency: The ability to selectively modify parts of the output without regenerating the whole thing.

The framework is primarily a design proposal (not yet implemented as a complete system), but it articulates a direction that many creative professionals have expressed informally: the desire for AI tools that feel more like collaborators and less like vending machines.

Algorithmic Aesthetics: A Critical Analysis

Khadake (2024), with 3 citations, provides a critical analysis of AI-generated art from the perspective of aesthetic theory. The paper examines how technologies like GANs and diffusion models produce outputs that are visually compelling but that raise questions about what "aesthetics" means when the creator has no intention, no emotional state, and no cultural context.

The analysis identifies several distinctive features of algorithmic aesthetics:

Statistical beauty: AI outputs tend to converge on visual patterns that are statistically average across the training data—a form of beauty that is "safe" but rarely surprising.
Stylistic fluency without semantic depth: AI can reproduce the surface characteristics of artistic styles (Impressionist brushwork, Cubist fragmentation) but does not engage with the conceptual motivations that produced those styles.
Uncanny familiarity: AI art often elicits a response Khadake describes as "uncanny familiarity"—the viewer recognizes the style but senses that something is off, without being able to articulate what.

Deconstructing AI Authorship

Putri, Susilo, and Khadake (2024) bring critical theory to the analysis, using the concept of the "algorithmic gaze" to examine how generative AI constructs visual culture. Drawing on Benjamin, Berger, and poststructuralist aesthetics, they argue that AI-generated images are not neutral reflections of their training data but actively construct particular ways of seeing.

Their specific claim is that generative AI systems encode the biases, preferences, and representational patterns of their training data—which is predominantly Western, predominantly digital, and predominantly recent—and that this creates a form of visual hegemony: AI art looks a certain way because the training data looks a certain way, and the more AI art is produced, the more the training data for future models will look like AI art. This feedback loop risks narrowing aesthetic diversity rather than expanding it.

Critical Analysis: Claims and Evidence

Claim	Evidence	Verdict
Moderate AI automation produces the best creative experience	Qiao et al.'s controlled experiment	✅ Supported — replicated finding
Full automation reduces creative ownership	Qiao et al.'s ownership measures	✅ Supported
The "agency gap" is a central HCI challenge for creative AI	Troussas et al.'s framework analysis	✅ Supported — well-articulated problem; solution not yet tested
AI art exhibits "statistical beauty" that converges on safe patterns	Khadake's aesthetic analysis	⚠️ Uncertain — plausible critical analysis, not empirically quantified
Generative AI creates visual hegemony through training data feedback	Putri et al.'s critical theory analysis	⚠️ Uncertain — theoretically coherent but empirically undemonstrated

Open Questions and Future Directions

Agency-preserving tools: Can we build generative AI tools that maintain human agency without sacrificing the efficiency gains of automation? Troussas et al.'s framework points the direction but implementation is needed.

Aesthetic evaluation criteria: How should AI art be evaluated—by the same criteria as human art, or by new criteria that account for the algorithmic process?

Training data diversity: If AI aesthetics converge toward the statistical center of the training data, diversifying the training data should diversify the outputs. Is this empirically true?

Cultural context: Most research on AI art uses Western aesthetic frameworks. How do non-Western aesthetic traditions (Chinese ink painting principles, Islamic geometric aesthetics, African mask-making traditions) interact with AI generation?

The ownership question: If moderate automation produces better creative experiences but full automation produces faster outputs, which will the market select? The answer may determine whether AI tools empower or replace creative professionals.

What This Means for Your Research

For artists working with AI, Qiao et al.'s finding about moderate automation is practically important: tools that require your active engagement produce better experiences and more diverse outputs than tools that generate everything for you.

For tool designers, the agency gap is the core challenge. Closing it requires giving users granular control without overwhelming them with options.

Explore related work through ORAA ResearchBrain.

"AI 예술은 진짜 예술인가?"라는 질문은 어쩌면 잘못된 질문일 수 있다. 최근 문헌이 점점 더 채택하는 보다 생산적인 프레임은 이것이다: 인간의 기여는 무엇이며, AI 자동화 수준은 창의적 경험에 어떤 영향을 미치는가? 그 답은 단순하지 않다. 더 많은 자동화가 곧 더 적은 창의성을 의미하지는 않는다. 인간의 에이전시, 기계의 역량, 예술적 결과물 사이의 관계는 AI 열광론자나 회의론자 어느 쪽이 인정하는 것보다 훨씬 복잡하다.

연구 동향

자동화 수준과 창의적 경험

Qiao and Wang (2025)은 이 연구군 중 가장 엄밀한 실증 연구로, 5회 인용을 기록하고 있다. 이 연구는 서로 다른 수준의 AI 자동화가 디자인 과제에서 인간의 창의적 경험과 효율성에 어떤 영향을 미치는지를 조사한다. 연구는 참가자들이 AI 지원 없는 상태부터 완전 자동 생성까지 다양한 수준의 AI 보조를 받으며 디자인 과제를 수행하는 통제 실험 설계를 사용한다.

주요 발견:

중간 수준의 자동화가 가장 좋은 창의적 경험을 만들어낸다. 참가자들은 완전히 혼자 작업할 때(낮은 효율)나 완전히 생성된 출력물을 수용할 때(낮은 에이전시)보다, AI가 수정 가능한 제안을 제공할 때 창의적 만족감과 자율성을 가장 높게 보고하였다.

완전 자동화는 소유감을 감소시킨다. AI가 출력물 전체를 생성한 경우, 참가자들은 출력물의 품질이 높더라도 자신의 창의적 기여를 낮게 평가하였다. 결과물은 우수하지만 창작자가 창작자처럼 느끼지 못하는 이 '소유권 격차(ownership gap)'는 창의적 AI 도구 설계에 있어 중요한 발견이다.

높은 자동화 수준에서는 효율성과 창의성이 상충한다. 완전 자동화 생성이 가장 빠르지만, 창의적 다양성은 가장 낮았다(참가자들이 대안을 탐색하지 않고 처음 나온 적당한 출력물을 수용하였다). 인간의 평가와 수정을 요구하는 중간 수준의 자동화는 더 다양하고 독창적인 결과물을 만들어냈다.

에이전시 격차

Troussas and Krouska (2026)는 '에이전시 격차(Agency Gap)'라는 개념을 도입한다. 이는 사용자가 의도한 것과 생성형 AI가 실제로 만들어내는 것 사이의 괴리를 가리킨다. 현재의 생성형 AI 시스템은 사용자가 텍스트 설명을 입력하면 AI가 완성된 출력물을 생성하는 프롬프트-응답 상호작용 모델을 사용한다. 사용자의 제어는 수용, 거부, 재프롬프트에 한정되며, 생성 과정 중에 세밀하게 개입하는 메커니즘은 존재하지 않는다.

저자들은 이것이 근본적인 HCI 문제라고 주장하며, 모든 단계에서 사용자 제어를 우선시하는 '에이전시 우선 프레임워크(Agency-First Framework)'를 제안한다:

생성 전 에이전시: 무엇을 만들지뿐만 아니라 어떻게 만들지—스타일, 구성, 색상 팔레트, 초점—를 지정하는 능력.
생성 중 에이전시: 생성 과정에서 개입하는 능력—일시 정지, 방향 전환, 부분 출력물 수정.
생성 후 에이전시: 전체를 재생성하지 않고 출력물의 일부를 선택적으로 수정하는 능력.

이 프레임워크는 완전한 시스템으로 구현된 것이 아니라 주로 설계 제안이지만, 많은 창작 전문가들이 비공식적으로 표현해온 방향—협업자처럼 느껴지는 AI 도구에 대한 요구, 즉 자판기가 아닌 협업자—을 명확히 제시하고 있다.

알고리즘 미학: 비판적 분석

Khadake (2024)는 3회 인용을 기록하며, 미학 이론의 관점에서 AI 생성 예술에 대한 비판적 분석을 제공한다. 이 논문은 GAN과 확산 모델 같은 기술들이 시각적으로 설득력 있는 출력물을 생성하지만, 창작자에게 의도도, 감정도, 문화적 맥락도 없을 때 '미학'이란 무엇을 의미하는지에 대한 질문을 제기한다는 점을 고찰한다.

분석은 알고리즘 미학의 몇 가지 독특한 특징을 식별한다:

통계적 아름다움: AI 출력물은 학습 데이터 전반의 시각적 패턴 중 통계적으로 평균적인 것으로 수렴하는 경향이 있다—'안전'하지만 좀처럼 놀랍지 않은 형태의 아름다움이다.
의미적 깊이 없는 양식적 유창성: AI는 예술 양식의 표면적 특징(인상주의적 붓터치, 입체파적 분절)을 재현할 수 있지만, 그 양식들을 탄생시킨 개념적 동기와는 무관하다.
기묘한 친숙함: AI 예술은 Khadake가 '기묘한 친숙함(uncanny familiarity)'이라 묘사하는 반응을 종종 유발한다—관람자는 양식을 알아보지만 무언가 어긋남을 감지하면서도 그것이 무엇인지 말로 표현하지 못한다.

AI 저작권 해체하기

Putri, Susilo, and Khadake (2024)는 '알고리즘적 시선(algorithmic gaze)'이라는 개념을 활용하여 생성형 AI가 시각 문화를 어떻게 구성하는지를 비판 이론의 관점에서 분석한다. Benjamin, Berger, 포스트구조주의 미학을 바탕으로, 이들은 AI 생성 이미지가 학습 데이터의 중립적 반영이 아니라 특정한 보는 방식을 능동적으로 구성한다고 주장한다.

이들의 구체적인 주장은, 생성형 AI 시스템이 학습 데이터—주로 서구적이고, 주로 디지털이며, 주로 최근의—의 편향, 선호, 표상 패턴을 인코딩하며, 이것이 일종의 시각적 헤게모니를 만들어낸다는 것이다. AI 예술이 특정 방식으로 보이는 것은 학습 데이터가 그런 방식으로 보이기 때문이며, AI 예술이 더 많이 생산될수록 미래 모델의 학습 데이터는 AI 예술을 더 많이 닮게 된다. 이러한 피드백 루프는 미적 다양성을 확장하기보다 오히려 축소시킬 위험이 있다.

비판적 분석: 주장과 근거

주장	근거	평가
중간 수준의 AI 자동화가 가장 좋은 창의적 경험을 만들어낸다	Qiao et al.의 통제 실험	✅ 지지됨 — 반복 검증된 발견
완전 자동화는 창의적 소유감을 감소시킨다	Qiao et al.의 소유권 측정	✅ 지지됨
'에이전시 격차'는 창의적 AI의 핵심 HCI 과제이다	Troussas et al.의 프레임워크 분석	✅ 지지됨 — 문제는 잘 설명되었으나 해결책은 아직 검증되지 않음
AI 예술은 안전한 패턴으로 수렴하는 '통계적 아름다움'을 보인다	Khadake의 미학 분석	⚠️ 불확실 — 타당한 비판적 분석이나 실증적으로 정량화되지 않음
생성형 AI는 학습 데이터 피드백을 통해 시각적 헤게모니를 만들어낸다	Putri et al.의 비판 이론 분석	⚠️ 불확실 — 이론적으로 일관성이 있으나 실증적으로 입증되지 않음

열린 질문과 향후 방향

에이전시를 보존하는 도구: 자동화의 효율성 이점을 희생하지 않으면서 인간의 에이전시를 유지하는 생성형 AI 도구를 만들 수 있는가? Troussas et al.의 프레임워크가 방향을 제시하지만 구현이 필요하다.

미학적 평가 기준: AI 예술은 어떻게 평가해야 하는가—인간 예술과 동일한 기준으로, 아니면 알고리즘 과정을 고려한 새로운 기준으로?

학습 데이터 다양성: AI 미학이 학습 데이터의 통계적 중심으로 수렴한다면, 학습 데이터를 다양화하면 출력물도 다양해져야 한다. 이것이 실증적으로 사실인가?

문화적 맥락: AI 예술에 관한 대부분의 연구는 서구적 미학 프레임워크를 사용한다. 비서구적 미학 전통(중국 수묵화 원리, 이슬람 기하학적 미학, 아프리카 가면 제작 전통)은 AI 생성과 어떻게 상호작용하는가?

소유권 문제: 중간 수준의 자동화가 더 나은 창의적 경험을 만들어내지만 완전 자동화가 더 빠른 결과물을 만들어낸다면, 시장은 어느 쪽을 선택할 것인가? 그 답이 AI 도구가 창작 전문가를 강화하느냐 대체하느냐를 결정할 수 있다.

연구에의 함의

AI를 활용하는 예술가들에게, Qiao et al.의 중간 자동화에 관한 발견은 실용적으로 중요하다. 능동적인 참여를 요구하는 도구는 모든 것을 대신 생성해주는 도구보다 더 좋은 경험과 더 다양한 결과물을 만들어낸다.

도구 설계자들에게는 에이전시 격차가 핵심 과제이다. 이를 해소하려면 사용자에게 너무 많은 옵션으로 부담을 주지 않으면서도 세밀한 제어권을 부여해야 한다.

관련 연구는 ORAA ResearchBrain을 통해 탐색할 수 있다.

References (4)

[1] Qiao, Y., Gao, Y., & Wang, Y. (2025). Integrating Generative Artificial Intelligence and Human Design: The Impact of Automation Level on Human Creative Experience and Efficiency. International Journal of Human-Computer Interaction.

DOI Scholar

[2] Troussas, C., Papakostas, C., & Krouska, A. (2026). The Agency-First Framework: Operationalizing Human-Centric Interaction and Evaluation Heuristics for Generative AI. Electronics, 15(4), 877.

DOI Scholar

[3] Khadake, V. (2024). Algorithmic Aesthetics: A Critical Analysis of AI-Generated Art in the Digital Age. International Journal For Multidisciplinary Research, 6(6).

DOI Scholar

[4] Putri, H., Susilo, D., & Munandar, E. (2025). The Algorithmic Gaze: Deconstructing Authorship and Aesthetics in Generative Artificial Intelligence (AI) Art. Cultural, 3(1).

DOI Scholar

Generative AI and Art: Mapping the Human-Machine Creative Spectrum

The Research Landscape

Automation Level and Creative Experience

The Agency Gap

Algorithmic Aesthetics: A Critical Analysis

Deconstructing AI Authorship

Critical Analysis: Claims and Evidence

Open Questions and Future Directions

What This Means for Your Research

연구 동향

자동화 수준과 창의적 경험

에이전시 격차

알고리즘 미학: 비판적 분석

AI 저작권 해체하기

비판적 분석: 주장과 근거

열린 질문과 향후 방향

연구에의 함의

References (4)

Explore this topic deeper