Theta X for mapping. Can it do better in positioning quality?

JoscTr · October 21, 2024, 7:43pm

Copy from my post in the Facebook group
(10) Ricoh THETA 360° Camera User Group | # Theta X for mapping | Facebook

[updated 2023-09-06]

I would like to draw attention to this subject and offer some thoughts that may help to improve position accuracy. I have addressed image quality before. Disclaimer: I am not a GNSS specialist, an informed user only.

Mapping with 360 imagery, Google Street View and Mapillary that is, aids: e.g., map creation; digital twin creation; travel destination promotion; city and infrastructure planning; disaster recovery; access, value, risk, damage assessment. It can be regarded as part of the Architecture, Engineering and Construction field of profession, as well as marketing, insurance, rescue services. Position precision and image quality are both key in this. Contrary to what Ricoh states in its blog Reach the full potential of RICOH THETA X with built-in GPS l RICOH360 Blog, position accuracy should be of navigation quality, which is not within reach of the current GNSS module.

First off, I think for this mapping use case we would better have a dedicated plug-in to experiment and develop proper dedicated functionality without overloading the general settings with options. Professor Taichi Furuhashi suggests the same in Ricoh360 blog The Potential of RICOH THETA X Built-in GPS Function - Interview with Professor Furuhashi, Aoyama Gakuin University l RICOH360 Blog.

Currently, the most convenient way to capture for mapping are the 8K 2 and 10 fps video modes, which includes the Google defined Camera Motion Metadata, CaMM track in the video file; Camera Motion Metadata Spec | Street View Publish API | Google for Developers. Theta X writes the case 2, 3 and 5 type data packets. I think it may be useful to write a case 6 type GPS information data packet instead of the case 5 type and add a case 7 type with the compass data.

In the blogs Ricoh expresses ‘The ability to acquire location information at short intervals during low frame rate video is one of the THETA X abilities that we hope will be utilized by construction and other industries.’. I think it is too much of an expectation that 3rd parties and users will each write code to extract video frames and calculate proper position for each from the CaMM. Dean Zwikel’s toolset helps in this. He can be contacted through Facebook Messenger.

Inspection of the Theta X CaMM track has shown that relative within the file, the first position data is timed about 0.8 seconds after the first video frame. As far as I know, video frame track and position data track are best be synchronized in time, which doesn’t seem to be the case here. Video frames in time before the first position data package will have to be dropped in the processing.

Theta X uses 1 Hz frequency only in the GNSS module to acquire position. However, it offers video frame rates of 2 and 10 fps.

Google guidelines for capturing are as follows.

Under 5 mph or 8 km/h for 1 FPS; walking.
Under 30 mph or 45 km/h for 5 FPS; bicycle, light motorcycle.
Under 45 mph or 70 km/h for 7 FPS; car, motorcycle.

Walking 2 fps, say I take 1 step of 1 meter each second, then we have a GNSS position for every meter covered and every other frame.

80 km/h 10 fps = 22 meters/sec 10 fps, then we have a GNSS position for every 22 meters and for 1 out of the 10 frames only. I think that in curves or with quick speed changes this leads to a position deviation that can be resolved by setting the GNSS module to a higher position acquisition frequency. That requires switching the GNSS module from the low power Super-E mode to the regular Continuous mode. The Continuous mode consumes more power, but I consider that no longer of the essence now that we can power the Theta with a powerbank.

ZOE-M8B_ProductSummary_(UBX-17012173).pdf (u-blox.com)

ZOE-M8B (u-blox.com)

The GNSS module needs to retain locks on the moving satellites while moving itself. The camera is not just moving forward, but swaying as well, depending on the transport mode. The GNSS module has settings to take this into account for performance and accuracy. Relevant settings are Pedestrian, Automotive, Bike. Refer paragraph 8.1 Platform settings and paragraph 32.10.19.1 Navigation engine settings in u-blox8-M8_ReceiverDescrProtSpec_UBX-13003221.pdf.

So, the user interface needed would have the following modes, avoiding separate settings.

pedestrian: video 8K 2fps, GNSS platform set to Pedestrian, Continuous mode (refer below on Galileo) 1 Hz refresh.
bicycle, light motorcycle: 8K 5fps, GNSS platform set to Bike, Continuous mode 3 Hz refresh.
motorcycle: 8K 10fps, GNSS platform set to Bike, Continuous mode 5 Hz.
car: 8K 10fps, GNSS platform set to Automotive, Continuous mode 5 Hz.

The frequencies are my educated guess, but I gladly leave that to the specialist to determine.

Last, but not least, the number of GNSS’ used simultaneously. It should be noted that now both BeiDou and Galileo are fully operational and generally deliver a higher precision than GPS and GLONASS. General rule of thumb is, that the more GNSS’ are used simultaneously, the better the positioning and less risk of having insufficient clear line of sight satellite locks. This is important especially in nature or city canyons. A test posted earlier by Sam Rohn in New York demonstrates serious position issues with current settings. The GNSS module is likely running in a manufacturer default mode with Super-E power saving and GPS, GLONASS only, while it can use 3 GNSS’ simultaneously. As power consumption while using a power bank is no longer an issue, I suggest improving by using Continuous mode and 3 of the GNSS’ always. Which 3 can automatically be determined by where on Earth the capture is and which 3 GNSS’ are most effective there, taking into account availability of support systems like SBAS too. I would definitely appreciate the use of Galileo.

That is it for now and I hope this helps to improve Theta X’s effectiveness and usability.

craig · October 21, 2024, 11:27pm

@jcasman if you want to summarize the main points and put into the agenda, you may want to reference the Camera Motion Metadata Spec for this section:

Theta X writes the case 2, 3 and 5 type data packets. I think it may be useful to write a case 6 type GPS information data packet instead of the case 5 type and add a case 7 type with the compass data.

To @jcasman I have not checked this

Inspection of the Theta X CaMM track has shown that relative within the file, the first position data is timed about 0.8 seconds after the first video frame. As far as I know, video frame track and position data track are best be synchronized in time, which doesn’t seem to be the case here.

You could attempt to verify and then put information in this document. I’ll change the document title from GPS to GNSS.

Google guidelines for capturing are as follows.

Under 5 mph or 8 km/h for 1 FPS; walking.

Under 30 mph or 45 km/h for 5 FPS; bicycle, light motorcycle.

Under 45 mph or 70 km/h for 7 FPS; car, motorcycle.

framerates are here theta-api-specs/theta-web-api-v2.1/options/file_format.md at main · ricohapi/theta-api-specs · GitHub

The 5fps may be newer?

@jcasman you’ll need to summarize the 2nd half of the article.

JoscTr · October 22, 2024, 2:18pm

@craig, @jcasman

I have that 0.8 sec time lag from an inspection by @Dean_Z.
Z1 used to have an issue with timing too. I do not not know if it ever got resolved. @Shotaro_Igami knows.

Those Google frame rate recommendations have been there all along in their Help system. The lower frame rates just avoid an excess of imagery, taking the camera movement speed into account. They save on needed stitching, battery, storage, upload and Google processing.

That year ago @Sam_Rohn position quality test was walking along a sidewalk in New York. The trace showed many anomalies. Likely caused by buildings on one hand blocking signals and signal reflections from buildings too. The usual city canyon issue. Of course those situations exist in mountainous nature too.

I feel camera manufacturers should get more serious on selection of antenna and fused GNSS / Inertial Measurement Unit module, as well as the tuning thereof, exposing usefull settings too.

jcasman · October 22, 2024, 8:36pm

@JoscTr Thanks for posting. I’m summarizing the details and will provide the information to a RICOH manager today. Just to be clear, I’m not suggesting it’s a solution. But hopefully we’re improving communications.

We work with RICOH to help build the community, we can:

Summarize your points (from multiple posts over time) to make it easier for RICOH to understand
Organize information into central document. We’ve already started improving it based on your information.
Discuss with product manager
Include the information in a written report to a wider group of managers, including engineering managers

To be clear:

We are unable to discuss directly with the engineer in charge of GNSS development
At this time, we will not communicate directly with the engineering team, only through the managers

craig · October 22, 2024, 8:39pm

@JoscTr first, thank you for all your help. The GNSS information is new to us.

I have made changes to this document, which is freely available to developers:

link to your article on this forum
changed title to IMU and GNSS Sensors (was GPS). I am going through the document to find other uses of GPS and will change to GNSS
reference to u-blox ZOE-M88 GNSS module based on u-blox press release

I will include the changes in the meeting agenda with RICOH and monthly community summary report to management.

I believe that @jcasman is working on a more detailed summary of your article.

In addition to your valuable feedback, our overall strategy is to get more feedback from companies that want to use the GNSS technology in the THETA X. We are using the document linked above to send information to developers in companies.

The combination of community and industry feedback may help RICOH in future product decisions.

JoscTr · October 23, 2024, 2:11pm

Sounds good @craig . I think Sam ones told me that even if we have some developers with whom we can talk directly, they cannot do anything without orders from managers.

It seems to me that a first step to improve on this is that developers create the fundamental api’s. Then Ricoh or someone with the skills could build a dedicated plug-in with easy settings combining api calls. There are more requirements for such a plug-in than the current topic.

I understand that Ricoh has a renewed focus on Construction, Architecture, Engineering and so. I think it helps to realize that “(3D) mapping” and creating “digital twins” is part of this and that position quality is a key quality aspect. Refer CupixVista | AI, 360° Photogrammetry, 3D Map.

My comment about a fused GNSS / IMU (Inertial Measurement Unit) is because such a fused uBlox module will use both technologies to determine position. If GNSS signal is lost, then it will be able to provide correct position for some covered distance. This is relevant for passing under bridges, tunnels etc. Even if there is sufficient GNSS signal, it will help to improve position. Of course this is a matter for future hardware.

(As for image quality, 8K is fine for now and there are sufficient other usual camera settings. I would appreciate an improvement in dynamic range. 8K 60 fps video capability would be nice for non mapping use. However, looking at how Insta360 is moving, I imagine Ricoh might want to stretch specs. I do not feel it to be necessary to stitch in real-time or on device at all if that would help realizing 11K video for mapping.)

JoscTr · November 5, 2024, 8:42pm

@craig, here a screen shot of an old Facebook post of mine. It shows positioning anomalies with X. The capture was on foot, mounted on helmet. The anomalies occur when I had to slow down or stop and wait.

JoscTr · November 5, 2024, 8:46pm

Following screen shot speaks for itself.

craig · November 5, 2024, 10:58pm

@JoscTr thanks for this information. I had not seen the interview before.

The Potential of RICOH THETA X Built-in GPS Function - Interview with Professor Furuhashi, Aoyama Gakuin University

Hopefully, as the THETA X is used more widely in the GIS and related areas, software fixes can be applied.

I’ll summarize the points as follows:

erroneous data sometimes occurs in Mapillary when the there are pauses or changes to a straight line, constant pace
request for option to prevent recording 8K or 5.7K low fps video (with camm GNSS data) unless the GNSS chip has a more stable satellite lock.

Another thing that I didn’t fully appreciate is that for A-GPS to work, the THETA X must first be connected to a WiFi router in client mode.

Thanks.

JoscTr · November 6, 2024, 12:14am

About your A-GPS, Assisted GPS, remark. Refer here Assisted GNSS - Wikipedia

On 1. The erroneous data does not occur in Mapillary. It is erroneous positioning by X. In this case Google Street View rejecting it (left pic). Mapillary accepts as recorded (right) pic, which explains the issue, showing unstable positioning at standstill.

craig · November 7, 2024, 6:53pm

updated points as follows:

Google Street View sometimes rejects the THETA X video file. We can use the Mapillary service to test the accuracy of the GNSS data recorded in the THETA images and get a better idea of why the rejection occurred. In this example, the GNSS data in the image appears to be inaccurate sometimes when there are pauses or changes to a straight line, constant pace. in this example, the inaccuracies appear to be when the THETA X is at a still position.
request for option to prevent recording 8K or 5.7K low fps video (with camm GNSS data) unless the GNSS chip has a more stable satellite lock.

JoscTr · November 8, 2024, 11:20pm

Hello @craig,

In the screenshot left is Google Street View Studio, right is Mapillary Uploader. Google Street View Studio is a web tool that does some local processing to trap first errors. The position path in the preview is not detailed enough to spot issues. Mapillary’s Windows/MAC uploader is an on-device app that extracts the frames from the video and gives a clearer plot of the positions. They have recently provided an update which allows for manual position correction of the selected video frames. Easiest to extract the GNSS data in standard format is one of the tools in the UL2GSV toolset of @Dean_Z (Dean Zwikel can best be contacted through Facebook Messenger). The extracted.gpx file can be read, plotted on map and edited with several gpx/gps tools. (GPS is most of the time still used as the general term, while that should be GNSS.) As Google Street View Studio does accept separate .gpx files too, this may help to overcome positioning issues. I have never tried with a video that has included position data to add an additional corrected .gpx file.
I am not sure if I understand your request properly. Please note the following. If one wants all of the video frames to be used in the processing, then the camera should start recording the CAMM track before the first video frame and stop after the last video frame. However, if one doesn’t want hand or face in the image while starting or stopping the camera, then the opposite is handy. It may be that Google processing itself skips start and ending frames when detecting hand or face. With Mapillary one can delete frames even after processing. Camera should not start recording at all if position is not yet stable after powering on the camera or awakening it from sleep. The indicator shows what is the GNSS reception state. While capturing, recording of position can never be stopped, as Google requires position data every second, even at standstill. If position data is lacking for 5 seconds or more at any point of the recording, then the video will be rejected. That is why the cameras position determination at standstill or very slow movement should not jump around too much, as otherwise the video gets rejected too, as shown in the screenshot. I do not know if the X used GNSS module with better settings can prevent this. A fused GNSS IMU module definitely can.

Hope this clarifies the matter.

Topic		Replies	Views
Using THETA 360 Video from a Drone 360 Media video	19	17964	August 1, 2019
Theta Z1 GPS track in VIDEO files 360 Media gps , video	17	4524	September 25, 2023
GPS Coordinates in Theta App too coarse and not enough decimal places in the Lat Long. Theta V 360 Media thetav , gps	25	5574	August 5, 2019
Ricoh Theta X, A-GPS working but is SBAS accessible ? Camera Hardware and Software gps	18	443	October 22, 2024
Introducing Street View Studio: The easiest workflow is coming to RICOH THETA users 360 Media thetaz1 , thetax , thetav , video , plugin	44	7721	November 19, 2022

Theta X for mapping. Can it do better in positioning quality?

Related topics