Auto-Lama combines object detection and image inpainting to automate object removals

Last update: Dec 09, 2022

Related tags

Overview

Auto-Lama

Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and Lama from Samsung Research. The entire process is extremely simple:

Objects are detected using the detector.
Masks are generated based on the bounding boxes drawn by the detector.
The original image is sent to the inpainter along with the masks.

Demo

Masking

There are currently a few ways of generating masks:

Masking objects with specified indices.
Masking one main object at a time.
Masking all other objects other than the main object.

Future Goals

Use a more precise segmentation method other than bounding boxes
Implementing a detector that has more

Environment Setup

Prerequisites

docker
make
conda

Building Environment

make build-conda-env
conda activate auto-lama
make build-env

Cleaning Directory

make clean

Detect and Inpaint

Setup

The default config for the detector is

PARAMETERS = {
    "model_name": "facebook/detr-resnet-50",
    "threshold": 0.9,
    "max_items": 10,
    "save_destination": "./test_images",
    "output_destination": "./output_images",
    "max_width": 2000,
    "max_height": 2000,
    "resize": True,
    "resize_scale": 0.75,
    "excluded_objects": [91],
    "image_format": "PNG",
    "mask_target_items": [],
}

Please reference here for the target items that you want to mask, as the default DE:TR uses the COCO Dataset,

Run

make detect_and_inpaint IMAGE_PATH=path/to/image or make detect_and_inpaint IMAGE_PATH={image_url}

Auto-Lama combines object detection and image inpainting to automate object removals

Related tags

Overview

Auto-Lama

Demo

Masking

Future Goals

Environment Setup

Prerequisites

Building Environment

Cleaning Directory

Detect and Inpaint

Setup

Run

Owner

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

We are More than Our JOints: Predicting How 3D Bodies Move

Deep Latent Force Models

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

Static-test - A playground to play with ideas related to testing the comparability of the code

Self Governing Neural Networks (SGNN): the Projection Layer

Coded illumination for improved lensless imaging

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

Pre-Trained Image Processing Transformer (IPT)

Road Crack Detection Using Deep Learning Methods

GitHub repository for the ICLR Computational Geometry & Topology Challenge 2021

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym

The 2nd Version Of Slothybot

내가 보려고 정리한 <프로그래밍 기초 Ⅰ> / organized for me

SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation, CVPR 2022

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

PyTorch implementation of Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network

Material related to the Principles of Cloud Computing course.

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .