Оптический поток от Nerf [закрыто]

Оптический поток от Nerf [закрыто] ⇐ Python

1 сообщение • Страница 1 из 1

Anonymous

Цитата

Сообщение Anonymous » 20 июл 2025, 12:36

Я работаю с мгновенным NGP TL. Я пытаюсь рассчитать оптический поток, используя выходной рендеринг объема. Мой подход к этому следуют: < /p>

Принимайте выборки вдоль каждого луча, рассчитанное по маркеру Ray. /> Умножьте веса на отобранные точки и суммируйте все точки вдоль каждого луча < /li>
Рассчитать acc_map = веса. ACC_MAP представляет собой суммирование всех весов вдоль каждого луча < /li>
Умножение (1 - ACC_MAP) * (rays_o + rays_d), где rays_o - это Ray Origins, а Rays_d - это направления < /li>
Сумма с весовыми точками I, так что я -то 1 -й, так что я -то 1 -й кадр. Возьмите позу Frame I+1 и преобразуйте результат суммы в мировое пространство < /li>
Использование обратной позы кадры I+1 Я преобразую обратно в пространство камеры < /li>
Использование внутренней матрицы я рассчитываю положения пикселя < /li>
< /ul>
На следующее подход. Оптический поток, рассчитанная по плоту. < /p>
Это код, который я использовал для вычисления: < /p>
H => image height
W => image width
focal => the focal length
c2w_current => the pose which convert from the camera space to the world space for the current image
c2w_shifted => the pose of the neighbour image (i+1) next to the current image (i) from camera space to world space
weights => the weights calculated by the volumen render
xyzs => the sampled points along each ray calculated by ray marching algorithm
rays => [#num_of_image, 6] the first three elements contain the ray origin and the last three elements contain the ray direction
rays_a => contain three values the ray_idx, start_idx, num_samples . the ray_idx gives the origin and direction in rays array, start_idx gives the starting index of the sampled points and weights arrays to get the last element sum start_idx with num_samples
grid => is 2d meshgrid which starts from 0 to H-1 and from 0 to W-1 and is used to calculate the optical flow
< /code>
def calculate_flow_2(H, W, focal, c2w_current, c2w_shifted, weights, xyzs, rays, rays_a, grid, scale=1.0):
rays_d = rearrange(rays[:, 3:], 'n c -> n 1 c') @ rearrange(torch.inverse(c2w_current[..., :3]), 'n a b -> n b a')
rays_d = rearrange(rays_d, 'n 1 c -> n c')
rays_with_shifted_origins = rays[:, :3] - c2w_current[:, :, 3]
camera_space_rays = torch.cat([rays_with_shifted_origins, rays_d], -1).view(-1, 6)

c2w_current_4x4 = torch.zeros((c2w_current.shape[0], 4, 4), device=c2w_current.device)
c2w_current_4x4[: , 3, 3] = 1
c2w_current_4x4[:, :3, :4] = c2w_current[:, :3, :4]
w2c_current = torch.inverse(c2w_current_4x4)

idx_arr = torch.arange(0, xyzs.shape[0], device=rays.device)
acc_weights = torch.zeros((rays.shape[0]), device=xyzs.device)
for ray_idx, start_idx, num_samples in rays_a:
acc_weights[ray_idx] = torch.sum(weights[start_idx:start_idx + num_samples])
idx_arr[start_idx:start_idx + num_samples] = ray_idx

rotated_points = torch.matmul(w2c_current[idx_arr, :3, :3], xyzs[..., None])
shifted_points = torch.squeeze(rotated_points) + w2c_current[idx_arr, :3, 3]

weighted_points_no_sum = weights[..., None] * shifted_points
weighted_points = torch.zeros((rays.shape[0], 3), device=xyzs.device)
for ray_idx, start_idx, num_samples in rays_a:
weighted_points[ray_idx] = torch.sum(weighted_points_no_sum[start_idx:start_idx + num_samples], dim=0)

weighted_ray = (1 - acc_weights[..., None]) * (camera_space_rays[:, :3] + camera_space_rays[:, 3:])

camera_space_point = weighted_points + weighted_ray

c2w_shifted_4x4 = torch.zeros((c2w_shifted.shape[0], 4, 4), device=c2w_shifted.device)
c2w_shifted_4x4[:, 3, 3] = 1
c2w_shifted_4x4[:, :3, :4] = c2w_shifted[:, :3, :4]
w2c_shifted = torch.inverse(c2w_shifted_4x4)

rotated_points = torch.matmul(c2w_shifted_4x4[rays_a[..., 0], :3, :3], camera_space_point[..., None])
shifted_points = torch.squeeze(rotated_points) + c2w_shifted_4x4[rays_a[..., 0], :3, 3]
rotated_points = torch.matmul(w2c_shifted[rays_a[..., 0], :3, :3], shifted_points[..., None])
shifted_points = torch.squeeze(rotated_points) + w2c_shifted[rays_a[..., 0], :3, 3]

point_map = torch.zeros((rays.shape[0], 2), device=xyzs.device)
point_map[:, 0] = (shifted_points[:, 0] / shifted_points[:, 2]) * focal + W * 0.5
point_map[:, 1] = (shifted_points[:, 1] / shifted_points[:, 2]) * focal + H * 0.5

return point_map - grid
< /code>
The values of the optical flow function should be close to the values estimated by RAFT

Подробнее здесь: https://stackoverflow.com/questions/776 ... -from-nerf

1753004193

Anonymous

 Я работаю с мгновенным NGP TL. Я пытаюсь рассчитать оптический поток, используя выходной рендеринг объема. Мой подход к этому следуют: < /p>

 Принимайте выборки вдоль каждого луча, рассчитанное по маркеру Ray. />  Умножьте веса на отобранные точки и суммируйте все точки вдоль каждого луча < /li>
 Рассчитать acc_map = веса. ACC_MAP представляет собой суммирование всех весов вдоль каждого луча < /li>
 Умножение (1 - ACC_MAP) * (rays_o + rays_d), где rays_o - это Ray Origins, а Rays_d - это направления < /li>
 Сумма с весовыми точками I, так что я -то 1 -й, так что я -то 1 -й кадр. Возьмите позу Frame I+1 и преобразуйте результат суммы в мировое пространство < /li>
 Использование обратной позы кадры I+1 Я преобразую обратно в пространство камеры < /li>
 Использование внутренней матрицы я рассчитываю положения пикселя < /li>
< /ul>
 На следующее подход. Оптический поток, рассчитанная по плоту. < /p>
Это код, который я использовал для вычисления: < /p>
H => image height
W => image width
focal => the focal length
c2w_current => the pose which convert from the camera space to the world space for the current image
c2w_shifted => the pose of the neighbour image (i+1) next to the current image (i) from camera space to world space
weights => the weights calculated by the volumen render
xyzs => the sampled points along each ray calculated by ray marching algorithm
rays => [#num_of_image, 6] the first three elements contain the ray origin and the last three elements contain the ray direction
rays_a => contain three values the ray_idx, start_idx, num_samples .  the ray_idx gives the origin and direction in rays array, start_idx gives the starting index of the sampled points and weights arrays to get the last element sum start_idx with num_samples
grid => is 2d meshgrid which starts from 0 to H-1 and from 0 to W-1 and is used to calculate the optical flow
< /code>
def calculate_flow_2(H, W, focal, c2w_current, c2w_shifted, weights, xyzs, rays, rays_a, grid, scale=1.0):
rays_d = rearrange(rays[:, 3:], 'n c -> n 1 c') @ rearrange(torch.inverse(c2w_current[..., :3]), 'n a b ->   n b a')
rays_d = rearrange(rays_d, 'n 1 c -> n c')
rays_with_shifted_origins = rays[:, :3] - c2w_current[:, :, 3]
camera_space_rays = torch.cat([rays_with_shifted_origins, rays_d], -1).view(-1, 6)

c2w_current_4x4 = torch.zeros((c2w_current.shape[0], 4, 4), device=c2w_current.device)
c2w_current_4x4[: , 3, 3] = 1
c2w_current_4x4[:, :3, :4] = c2w_current[:, :3, :4]
w2c_current = torch.inverse(c2w_current_4x4)

idx_arr = torch.arange(0, xyzs.shape[0], device=rays.device)
acc_weights = torch.zeros((rays.shape[0]), device=xyzs.device)
for ray_idx, start_idx, num_samples in rays_a:
acc_weights[ray_idx] = torch.sum(weights[start_idx:start_idx + num_samples])
idx_arr[start_idx:start_idx + num_samples] = ray_idx

rotated_points = torch.matmul(w2c_current[idx_arr, :3, :3], xyzs[..., None])
shifted_points = torch.squeeze(rotated_points) + w2c_current[idx_arr, :3, 3]

weighted_points_no_sum = weights[..., None] * shifted_points
weighted_points = torch.zeros((rays.shape[0], 3), device=xyzs.device)
for ray_idx, start_idx, num_samples in rays_a:
weighted_points[ray_idx] = torch.sum(weighted_points_no_sum[start_idx:start_idx + num_samples], dim=0)

weighted_ray = (1 - acc_weights[..., None]) * (camera_space_rays[:, :3] + camera_space_rays[:, 3:])

camera_space_point = weighted_points + weighted_ray

c2w_shifted_4x4 = torch.zeros((c2w_shifted.shape[0], 4, 4), device=c2w_shifted.device)
c2w_shifted_4x4[:, 3, 3] = 1
c2w_shifted_4x4[:, :3, :4] = c2w_shifted[:, :3, :4]
w2c_shifted = torch.inverse(c2w_shifted_4x4)

rotated_points = torch.matmul(c2w_shifted_4x4[rays_a[..., 0], :3, :3], camera_space_point[..., None])
shifted_points = torch.squeeze(rotated_points) + c2w_shifted_4x4[rays_a[..., 0], :3, 3]
rotated_points = torch.matmul(w2c_shifted[rays_a[..., 0], :3, :3], shifted_points[..., None])
shifted_points = torch.squeeze(rotated_points) + w2c_shifted[rays_a[..., 0], :3, 3]

point_map = torch.zeros((rays.shape[0], 2), device=xyzs.device)
point_map[:, 0] = (shifted_points[:, 0] / shifted_points[:, 2]) * focal + W * 0.5
point_map[:, 1] = (shifted_points[:, 1] / shifted_points[:, 2]) * focal + H * 0.5

return point_map   - grid
< /code>
The values of the optical flow function should be close to the values estimated by RAFT 

Подробнее здесь: [url]https://stackoverflow.com/questions/77666652/optical-flow-from-nerf[/url]

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Похожие темы

Ответы

Просмотры

Последнее сообщение

Оптический поток Лукаса-Канаде — расчет градиента

Последнее сообщение Anonymous « 28 июн 2024, 09:33
Добавлено в форуме C++

Anonymous » 28 июн 2024, 09:33 » в форуме C++

Я реализовал метод Лукаса-Канаде (версия для каждого пикселя, а не для функций, которые есть в OpenCV). Однако у меня есть вопрос относительно расчета градиентов (dx, dy, dt). В нескольких реализациях я видел это:
for (int y = 0; y <...

0 Ответы

11 Просмотры

Последнее сообщение Anonymous
28 июн 2024, 09:33
Применение ускоренного реймарча к реализации NeRF

Последнее сообщение Anonymous « 13 дек 2024, 18:46
Добавлено в форуме Python

Anonymous » 13 дек 2024, 18:46 » в форуме Python

Я пытаюсь добавить ускоренный маршинг лучей в HashNeRF, реализацию NeRF в PyTorch ( с целью сократить время рендеринга.
В частности, я пытаюсь реализовать метод раннего завершения лучей, чтобы прекратить добавление сэмплов вдоль любого луча, который...

0 Ответы

22 Просмотры

Последнее сообщение Anonymous
13 дек 2024, 18:46
Применение ускоренного реймарча к реализации NeRF

Последнее сообщение Anonymous « 13 дек 2024, 19:33
Добавлено в форуме Python

Anonymous » 13 дек 2024, 19:33 » в форуме Python

Я пытаюсь добавить ускоренный маршинг лучей в HashNeRF, реализацию NeRF в PyTorch ( с целью сократить время рендеринга.
В частности, я пытаюсь реализовать метод раннего завершения лучей, чтобы прекратить добавление сэмплов вдоль любого луча, который...

0 Ответы

11 Просмотры

Последнее сообщение Anonymous
13 дек 2024, 19:33
Как правильно получить происхождение лучей в NeRF?

Последнее сообщение Anonymous « 30 дек 2024, 16:25
Добавлено в форуме Python

Anonymous » 30 дек 2024, 16:25 » в форуме Python

Я изучил две разные реализации NeRF в pytorch

def get_rays(H, W, focal, c2w):
'''
c2w =
'''
i, j = torch.meshgrid(torch.linspace(0, W-1, W), torch.linspace(0, H-1, H)) # pytorch's meshgrid has indexing='ij'
i = i.t()
j = j.t()

dirs =...

0 Ответы

20 Просмотры

Последнее сообщение Anonymous
30 дек 2024, 16:25
Поток не запущен, как запустить поток, чтобы он запускался каждые 300 мс [закрыто]

Последнее сообщение Anonymous « 08 июл 2024, 03:01
Добавлено в форуме C#

Anonymous » 08 июл 2024, 03:01 » в форуме C#

Я пытаюсь заставить это запускаться каждые 300 мс, но оно вообще не запускается
var gametick = new System.Threading.Timer((e) =>
{
//stuff
}, null, 0, TimeSpan.FromMinutes(Convert.ToDouble(300).miliseconds);

Я не знаю, что попробовать, но я...

0 Ответы

25 Просмотры

Последнее сообщение Anonymous
08 июл 2024, 03:01

Вернуться в «Python»