Earticle

현재 위치 Home

Culture Convergence (CC)

Image Similarity Analysis in Generative AI

첫 페이지 보기
  • 발행기관
    국제문화기술진흥원 바로가기
  • 간행물
    International Journal of Advanced Culture Technology(IJACT) KCI 등재 바로가기
  • 통권
    Volume 12 Number 4 (2024.12)바로가기
  • 페이지
    pp.208-214
  • 저자
    Choi Haerin, Lee Hyunseok
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A462182

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
In Consciousness Explained, Daniel Dennett argued that consciousness is a phenomenon emerging from the complex flow of information in the brain, and to understand it, an objective approach is necessary. While AI is increasingly mimicking human functions, it is difficult to say that AI possesses consciousness similar to humans. However, consciousness is an essential factor for perception, but perception does not necessarily require consciousness. Therefore, this study aims to analyze how similar the way AI, particularly the DALL-E model developed by OpenAI, processes visual information is to the structure of human perception. In the study, new images were generated using the GPT-4 DALL-E model based on five sets of reference images, and the structural similarity between the generated images and the reference images was analyzed using SSIM (Structural Similarity Index Measure). The SSIM scores of the images generated by DALL-E based on the reference images ranged between 0.131 and 0.63. This confirmed that AI learned some degree of the visual patterns from the reference images. However, AI did not generate images that perfectly aligned with human perception, and images that contained complex shapes or fine textures recorded lower SSIM scores. Notably, the AI showed limitations in depicting human portraits, suggesting that AI’s perception system is simplified compared to the complexity of human perception structures. This study demonstrated that while the DALL-E model has potential in processing visual information, there remains a clear difference from the complex human perception system. These results suggest that AI still has limitations in mimicking the way humans process visual information, indicating a need for further in-depth research into the independent characteristics of AI perception in the future

목차

Abstract
1. INTRODUCTION
2. THEORETICAL CONSIDERATIONS
2.1 Perception, Consciousness and AI
2.2 Generative AI
2.3 SSIM (Structural Similarity Index Measure)
3. RESEARCH METHOD
4. DATA ANALYSIS RESULTS
4.1 GPT-4, DALL-E Generated Images
4.2 SSIM Evaluation
5. RESULT
REFERENCES

키워드

AI DALL-E Visual Information Processing SSIM Structural Similarity Human Perception

저자

  • Choi Haerin [ Master Student, Department of Design, Pusan National Univ., South Korea ]
  • Lee Hyunseok [ Professor, Department of Design, Pusan National Univ., South Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제문화기술진흥원 [The International Promotion Agency of Culture Technology]
  • 설립연도
    2009
  • 분야
    공학>공학일반
  • 소개
    본 진흥원은 문화기술(Culture Technology) 관련 산·학·연·관으로 구성된 비영리 단체이다. 문화기술(CT)은 정보통신기술(ICT), 문화적 사고 기반의 예술, 인문학, 디자인, 사회과학기술이 접목된 신융합기술(New Convergence Technology, NCT)로 정의한다. 인간의 삶의 질을 향상시키고, 진보된 방향으로 변화시키고, 문화기술 관련 분야의 학술 및 기술의 발전과 진흥에 공헌하기 위하여, 제3조의 필요한 사업을 행함을 그 목적으로 한다.

간행물

  • 간행물명
    International Journal of Advanced Culture Technology(IJACT)
  • 간기
    계간
  • pISSN
    2288-7202
  • eISSN
    2288-7318
  • 수록기간
    2013~2025
  • 등재여부
    KCI 등재
  • 십진분류
    KDC 600 DDC 700

이 권호 내 다른 논문 / International Journal of Advanced Culture Technology(IJACT) Volume 12 Number 4

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장