What Is Speaker Tracking And Auto Shooting?
Whether working at home or holding business meetings with people from other countries during the period of novel coronavirus, distance is no longer a limit. Thanks to advanced video conferencing equipment. Easy connection through video conference camera to improve communication efficiency.
With the video conference camera using AI technology, you can clearly see and hear every participant in the conference room. You don’t have to worry about having too many or too few people in an inappropriate framework. This is because the AI video conference equipment we use has speaker tracking and automatic shooting functions.
The video conference camera can accurately capture and track conversations, and these two functions work together. This document details what is a speaker tracking and automatic photography video conferencing camera.
What is Speaker Tracking?
Tracking a speaker means that the camera identifies the person who is speaking in the room and focuses until he stops. The speaker tracking function makes the video conference more smooth, and all participants can clearly see the speaker’s expression and body language, just as he said in front of you.
The tracking of the speaker plays the role of focusing the camera on the speaker for several seconds when he stops speaking until he catches a new speaker, and the camera switches the focus to the new speaker. If no one speaks for a period of time, the camera will automatically shrink and set the screen in the whole team or meeting room.
What do I need for speaker tracking? Beam forming microphones and cameras must work together for best results. The microphone first detects the source of the sound and guides the camera to focus there. The camera will then pan, tilt, and zoom physically or digitally depending on the location of the sound source. No matter how far or near you are from the camera, you can get all the focus in the room when you speak.
The video conference camera we used to use can also take pictures of the entire conference room and all participants, but the participants in the distance look very small on the screen, and can neither be seen nor heard on the other side of the screen. This is the problem that the speaker tracking of AI video function solves in the current technology. This is very common, especially in medium-sized and large conference rooms. The speaker tracking video conference camera can solve this problem, so that all participants can experience a smooth cinema level video conference, improve work efficiency and reduce communication costs.
If you can’t imagine how this function works on the basis of text description, let me take the product with high price in the market as an example. The Nexvoo N110 is an integrated video bar with a 4K ultra-high definition camera, a 120 degree field of view, and a 6 meter pickup distance.
N110 uses advanced AI algorithms to achieve intelligent interlocutor tracking, ensuring that everyone in the conference room can be captured and included. More refined details and texture images allow you to have a more enjoyable time in the conference.
What is automatic shooting?
Automatic shooting is face recognition, Combining the composition algorithm based on the trichotomy principle and pixel level super resolution, create a portrait with the best composition in the detected face image. The automatic positioning function can automatically identify all participants in the conference room, use real-time face detection and positioning, adjust the camera according to the number and position of participants, and Overwrite each participant.
For example, When using the Nexvoo N109, participants can detect when entering or leaving the judgment area. If a person is in the conference room, the camera will focus on itself through the automatic photography function, and follow when changing the position. When moving from the front of the camera to other corners of the conference room, the camera will adjust the focus. You can change, capture you in the corner, and place you in the center of the screen.
After adding new participants, the camera will be reduced to include new participants. This way, the view of all participants will be more complete, rather than moving the camera. The autofocus function will improve the autofocus level. After participating in a video conference, you do not need to adjust the camera angle to concentrate on communication.
The automatic frame video conference equipment function is applicable to conference rooms of various sizes. In medium and large conference rooms, when the number of participants reaches a certain level, the automatic positioning function will expand the recognition range to capture more participants.
Since it is impossible to force all participants to be completely fixed in one position and maintain a fixed distance from the camera, it is necessary for the speaker to track the automatic shooting function of the video conference equipment to avoid this embarrassing situation. In this case, the camera can take pictures for all positions of the participants in as many fields of vision as possible. Longer focal length and wider wide-angle are required. For example, the Nexvoo N120 is a dual camera video strip with a 6-meter focal length and a 120 degree field of view, which can seamlessly conduct video conferencing in medium or large conference rooms.
Intelligent tracking through interlocutor tracking and automatic regional cooperation
although the principles of speaker tracking and automatic camera are different, they are not completely independent and can work together. The automatic positioning function can immediately capture the people in the conference room and detect the real-time position. On this basis, the speaker tracking function can identify the speakers in real time, enlarge the frame, and center the speakers in the view. Like the Nexvoo N110 and N120, it provides speaker tracking and automatic positioning functions.
The N120 offers dual cameras, long focal length and wide viewing angle. In addition, you can automatically optimize the brightness according to the lighting conditions to effectively prevent the meeting room from being too dark. The resulting negative impact makes the other party see more clearly the detailed actions of all participants in the conference room.