Search results for "AUDIO"
09:47

OVERTAKE partners with Walrus Protocol to advance on-chain game asset ownership

BlockBeats news, on August 14, according to official news, OVERTAKE announced a partnership with the Web3 Data Layer project Walrus Protocol to bring players complete ownership of in-game assets. With on-chain custody and Decentralization data management, the visual, audio, and Metadata within the game will achieve permanent storage, verifiability, and ownership by players. OVERTAKE is a peer-to-peer gaming asset trading market based on the Sui Network, enabling secure transactions through smart contracts custody.
More
WAL-2.69%
01:05

GARI (Gari Network) rose 272.26% in 24 hours.

Gate News Bot news, on August 14, according to CoinMarketCap, as of the time of writing, GARI (Gari Network) is currently reported at 0.00908468 USD, with a rise of 272.26% in the last 24 hours, reaching a maximum of 0.00968867 USD and a minimum of 0.0019731 USD. The current market capitalization is approximately 5.1 million USD, an increase of 3.73 million USD compared to yesterday, currently ranked 1434. Gari Network is the world's largest Web3 audio and video live streaming platform, ranked as the 8th most popular app in the Google Play store. The platform has over 7 million downloads and nearly 9900 daily on-chain active users. Chingari is a thriving community where users can engage in real-time conversations and receive expert consultations.
More
GARI287.63%
  • 1
  • 1
21:00

Thai Prime Minister Prayuth Chan-o-cha applied to the Constitutional Court for an extension to submit defense materials for the "audio recordings case."

On July 15, Thailand's Prime Minister's Secretary-General, Poonmin Lesuriad, revealed that Prime Minister Prayuth Chan-o-cha has applied to the Constitutional Court for a 15-day extension to submit defense materials for the "Audio Recording Case," citing the inability to complete all defense material preparations within the previously stipulated 15-day deadline by the court. Poonmin stated that such applications for extension are rights legally enjoyed by the respondent and are normal procedures in legal processes, with the decision to approve the extension needing to be reviewed and decided by the Constitutional Court.
More
04:12

Meta acquisition of voice startup PlayAI to enhance audio technology

Meta Platforms acquisition of voice artificial intelligence startup PlayAI to enhance its audio technology and AI initiatives. The PlayAI team will join Meta to support its AI roles and audio content development, aligning with the technology industry's trend towards conversational interfaces.
More
06:28

Elon Musk's xAI releases Grok 4: Breakthroughs and Challenges of the New Generation of Artificial Intelligence

Elon Musk's xAI recently released its latest artificial intelligence model—Grok 4, which serves as a direct competitor to OpenAI's GPT-5. Despite experiencing latency in the release process, Musk talked about Grok 4's significant advancements in multimodal capabilities during a live broadcast, enabling reasoning and responses across text, images, and audio. However, the launch of Grok 4 has not been without controversy. Recently, Grok has faced widespread criticism for generating inappropriate content (such as "MechaHitler"), raising ethical questions about AI outputs. Additionally, X's CEO Linda Yaccarino resigned due to the negative impact associated with Grok, further intensifying concerns over the regulation and ethical framework of xAI. Nevertheless, xAI continues to roll out new features, including a high-performance subscription tier named SuperGrok Heavy, which offers advanced reasoning, coding tools, and priority support. While it has not yet been confirmed whether full API access will be provided, some endpoints are already live, and broader access is expected soon.
More
ELON-2.68%
XAI-4.28%
GROK-3.3%
  • 1
09:23

Juchip Technology: Achievements in the Promotion of New Edge AI Audio Chip Products

Jin10 data reported on June 18th that Juchip Technology announced the launch of its new edge AI audio chip series based on in-storage computing technology, including the ATS323X, ATS286X, and ATS362X product lines. Among them, the ATS323X series chips achieved rapid volume after the client's first terminal product went into mass production in a short period, marking a phased success in the promotion of edge AI new products.
More
ATS7.24%
11:45

The Indian CBI has cracked a transnational Crypto Assets scam, seizing $327,000 in virtual assets.

Gate News bot news, the Central Bureau of Investigation (CBI) in India has arrested suspect Rahul Arora in New Delhi. Law enforcement officers seized approximately $327,000 in Crypto Assets and $26,400 in cash during the operation. According to Decrypt, this case involves an online scam targeting users in the United States and Canada. The suspect carried out the scam by disguising as government officials and technical support personnel. During the investigation, law enforcement also seized multiple tools used in the crime, including caller ID spoofing software, social engineering tools, and related audio recordings. The CBI has established a dedicated system to manage this batch of seized virtual digital assets.
More
06:18

Kimi releases a brand new universal audio foundation model Kimi-Audio

Jin10 data reported on April 26th, today, Kimi released a new Open Source project - the brand new general audio foundation model Kimi-Audio. According to the introduction, this model supports various tasks such as speech recognition, audio understanding, audio to text, and voice dialogue.
More
AUDIO-1.46%
01:31

Canalys: It is expected that the global shipments of personal smart audio devices will reach 500 million units by 2025.

On March 4th, Jinshi Data News, Canalys' latest data shows that the global shipment of personal smart audio devices (including TWS, wireless headphones, and wireless neck-hanging headphones) reached 455 million units in 2024, a rise of 11.2% year-on-year. Canalys holds a cautiously optimistic attitude towards the rise in the market in 2025, expecting the global shipment of personal smart audio devices to reach 500 million units in 2025.
More
14:18

Rockchip: SoC chips have been applied to robots in various forms

Jinshi data news on December 18th, Rui Xingwei said on the interactive platform that the company has SoC chips applied in a variety of forms of robots, and has a certain market share in the field of robots. The company's high-performance general-purpose processor chip can handle data processing functions in robots, and has the ability to run AI models at the edge; the company's products in the field of machine vision can undertake visual perception processing functions in robots; the company's audio products can provide audio interaction capabilities for robots.
More
09:13

泰凌微: Release machine learning and artificial intelligence development platform TLEdgeAI-DK

Jinshi Data News on December 17th, Terminus Micro announced that the company recently released the TLEdgeAI-DK, a machine learning and artificial intelligence development platform based on the TL721x and TL751x chips. The company has successfully integrated edge AI machine learning models into smart home and smart audio products using the TLEdgeAI-DK platform, achieving close integration with practical applications. Additionally, the company is collaborating with more users and strategic partners to develop innovative products with edge AI capabilities suitable for different application fields. The release of the TLEdgeAI-DK platform will enhance the company's competitiveness in related fields and further open up the huge market that requires both wireless connectivity and edge AI computing capabilities, foreseeing a positive impact on the company's future market expansion and performance growth.
More
X-4.19%
11:00

AI voice company WaveForms completes $40 million seed round of financing, led by a16z

WaveForms is an AI voice start-up founded by former OpenAI researcher Alexis Conneau. The company has raised $40 million in seed funding, with a valuation of $200 million, and aims to develop AI audio software that can capture emotional cues and achieve more natural speech interactions, enhancing the experience of voice conversations between humans and machines.
More
  • 1
06:20

Canalys: In the third quarter, the global shipments of personal smart audio devices rose 15% year-on-year.

On November 18th, Jinshi data, Canalys report showed that in the third quarter of 2024, the global personal smart audio device market experienced a strong Rebound, with a total shipment of nearly 126 million units, a rise of 15% year-on-year. This marks the third consecutive quarter of rise in the market, indicating that it has emerged from the plight encountered in 2023 and achieved sustained recovery.
23:38

Audio streaming platform Spotify resumes service

Jinshi data news on September 30th, according to the network condition monitoring website DownDetector, the audio streaming media platform Spotify has returned to normal after experiencing a failure for about three hours on Sunday, affecting more than 40,000 users in the United States at its peak.
06:44

Canalys: Q2 2024 intelligent personal audio market rose 10.6% year-on-year

According to the latest research from Canalys, the smart personal audio market (including TWS, wireless in-ear headphones, and wireless over-ear headphones) rebounded strongly in the second quarter of 2024, with significant rises in multiple segments. The total shipment reached 106 million units, a rise of 10.6% year-on-year, setting a record for the highest shipment in the history of the second quarter. TWS and wireless headphones are the driving forces behind the rise, reaching 77 million units and 15 million units respectively.
03:22

Alitongyi Open Source audio language model Qwen2-Audio, related paper selected for top conference ACL 2024

Jinshi data, August 13 news, Ali Tongyi's large model continues to be Open Source, and the Qwen2 series Open Source family has added the audio language model Qwen2-Audio. Qwen2-Audio can directly perform voice Q&A without the need for text input, understand and analyze the audio signals input by users, including human voice, natural sound, music, etc. The model has significantly surpassed the previous best models in multiple authoritative evaluations. Tongyi team also simultaneously released a new trap audio understanding model evaluation Benchmark, and the related paper has been selected for the international top conference ACL2024 being held this week.
AUDIO-1.46%
  • 3
18:14
Odaily Planet Daily News OpenAI has released its latest flagship model, GPT-4o, which can perform real-time inference on audio, visual, and text data. It is designed to be a personalized voice interactive assistant with human-like, supernatural, and ultra-low latency capabilities. According to the official website of OpenAI and the official account of the X platform, the "o" in GPT-4o stands for Omni, representing a step towards more natural human-machine interaction. It accepts arbitrary combinations of text, audio, and images as input, and supports generating arbitrary combinations of text, audio, and image outputs. It can respond to audio input within 232 milliseconds on average, which is similar to human reaction time in conversations. In terms of English and code, its performance is comparable to GPT-4 Turbo, with significant improvements in non-English language texts. Additionally, the API speed is faster and costs 50% less. Compared to existing models, GPT-4o excels in visual and audio understanding. Text and image input will be launched today in API and ChatGPT, while voice and video input will be launched in the coming weeks.
TURBO-4.94%
OMNI-4.05%
GPT-1.12%
  • 1
08:44
According to the report, this month, with the decrease in the heat of the Spring Festival, the number and intensity of activities of some top products on mobile and client platforms have decreased, and the performance of new products has been unable to support the increase. As a result, the domestic mobile client game market has shown a month-on-month decline. In March 2024, the revenue of the Chinese game market was 23.417 billion yuan, a month-on-month decrease of 5.86% and a year-on-year increase of 7.18%. Among them, the actual sales revenue of the Chinese mobile game market was 16.953 billion yuan, a month-on-month decrease of 7.13% and a year-on-year increase of 8.75%. The actual sales revenue of the Chinese client game market was 5.528 billion yuan, a month-on-month decrease of 2.02% and a year-on-year decrease of 0.39%. The actual sales revenue of Chinese self-developed games in the overseas market was 1.427 billion yuan, a month-on-month increase of 5.98% and a year-on-year increase of 11.34%.
  • 1
04:17
1. The United States and China announced that they will hold the first meeting of the U.S.-China Intergovernmental Dialogue on Artificial Intelligence. 2. The world's first full-scale humanoid robot "Tiangong" with pure electric drive for anthropomorphic running was released3. Apple has restarted negotiations with OpenAI to add new AI features to new products. 4. Musk: TSL will invest about $10 billion this year in AI training and inference in the automotive field. 5. The U.S. government reportedly set up an AI Security Committee, whose members include executives from tech giants such as Jensen Huang and Sam Altman. 6. China's version of Sora-level video model Vidu released: it can generate up to 16 seconds and up to 1080p video. 7. Nvidia Jensen Huang says AI won't completely replace human jobs. 8. The EMO model will be fully launched on the Tongyi App for free, and the cooperation with enterprise customers will be opened as soon as possible. 9. Tsinghua University established the School of Artificial Intelligence, and the first dean was Academician Yao Chizhi, a Turing Award winner. 10. Danghong Technology released the longest audio-visual model BlackEye. 11. General k question Open Source k billion-level parameter model.
APP3.02%
04:41
Golden Ten Data on April 28, according to Danghong Technology, at the "2024 Zhongguancun Forum - UHD Audiovisual Technology Innovation and Development Forum" held on the afternoon of April 27, Danghong Technology and Beijing Economic and Technological Development Zone jointly released the BlackEye large model base and application scenarios. BlackEye integrates long kinds of Depth neural network components, including Transformer, Diffusion and other components, through text, image, video and audio, 3D model and other long modal encoding, decoding, long modal latent short alignment, long modal language reasoning and generation and other technologies, to achieve inference and prediction generation between different modal information.
12:17
On April 27th, Jin10 News learned that the "Beijing Ultra HD Audiovisual Pioneer Action Plan (2024-2026)" was officially released at the Zhongguancun Forum's "Ultra HD Audiovisual Technology Innovation and Development Forum". The plan, jointly compiled by the Beijing Municipal Radio and Television Bureau and relevant departments, is committed to providing strong policy support for the development of the ultra HD audiovisual industry, and helping Beijing to continue leading the country in the research and application of ultra HD audiovisual industry. The action plan proposes ten major support directions: supporting the construction of ultra HD content zones, supporting ultra HD technology filming, supporting lightweight production of ultra HD content, supporting the construction of ultra HD channels, supporting the ultra HD household penetration initiative, supporting innovation in ultra HD content distribution models, supporting the construction of a digital asset sharing platform, supporting post-production of ultra HD audiovisual content, supporting the development of ultra HD audiovisual industry parks, and supporting the cultivation of ultra HD application scenarios.
07:36

Perfect World Games and NVIDIA continue to explore the application of AI in gaming scenarios

According to the latest news from Perfect World Games' official WeChat, in the early morning of March 19, Beijing time, the NVIDIA AI Conference (NVIDIA GTC 2024) was held at the SAP Center in San Jose, California, USA. NVIDIA CEO Jensen Huang spoke on the topic of "Witnessing AI's Transformative Moment" and shared how NVIDIA's accelerated computing platform is driving the next wave of AI, digital twins, cloud technologies, and sustainable computing. GTC also announced that Perfect World Games' Xianxia MMORPG terminal game "Zhuxian World" has officially connected to NVIDIA's Audio2Face technology (generative AI easily converts audio into animation technology), and used this conference to show the global audience the results of the combination with "Zhuxian World", and the two sides will continue to maintain close exchanges and cooperation in multiple fields and scenarios of AI in the future.
More
GTC-1.67%
AUDIO-1.46%
03:00

The new model Sora exploded, and the industry has two hidden dangers in addition to shock

Recently, OpenAI's new model "Sora" has attracted attention, and the New York Times reported that OpenAI's valuation may now reach about $80 billion. On the one hand, the new model Sora has shocked the content production industry, and on the other hand, the market is also paying attention to its two hidden dangers. First of all, an executive of an advertising company said that there are still some questions about content copyright. At present, OpenAI does not disclose the number of videos involved in the training of the model and its specific sources, but only said that all training materials are from public sources or authorized content. Another concern is Depth Fake. This year is a big year for elections around the world, which will affect more than 4 billion people, including Long Long countries. AI Depth Fake technology may generate a large number of fake audio, video, and images to influence elections.
More
  • 1
06:41

Gate Learn Bitcoin Inscription Discovery Journey to win great Chinese New Year gifts

Gate Learn has recently added a new label for "Inscriptions". Inscription refers to any data written on the Blockchain, a kind of text that can write text, pictures, videos and audio into BTC for inscription, it is a creative and unknown field. Do you want to learn about this technology that is leading the future of digital art? Come and join us for learning about Bitcoin inscriptions and great prizes! Event Period: January 29, 2024 - February 12, 2024 (14 days) Activity 1: Invite friends to participate in the weekly inscription quiz An article related to the inscription will be updated every day, and points can be earned by learning to answer questions and invite friends, and different tasks can get different point values. At the end of each period, you will be able to share $4,000 in rewards according to the proportion of your accumulated total points! The top 10 points will receive an additional 10 points. The prize pool grows according to the number of participants, and the more participants there are, the more chances each user has to win higher prizes Activity 2: Inscription Knowledge Sharing Session In order to let users have a deeper understanding of inscription technology, we will share the basics, technical principles and application scenarios of Bitcoin inscriptions with you on our official Twitter and dynamic circles. Follow and leave a message, ask questions and discuss in the comment area of the inscription sharing post, and we will draw 50 lucky users in the comment area to give away 5 point cards as rewards. For details, please visit Gate.io's official website.
More
BTC0.05%
09:25
According to upstream news reports, as AI has gradually become a hot topic in the industry, Ximalaya is also actively embracing the AI trend, helping platform anchors reduce costs and increase efficiency through AI technology, and improve content production capacity. Recently, Ximalaya held the "2024 Drama Elite Festival" audio anchor annual ceremony in Chengdu, and Ximalaya presented awards to outstanding creators on the platform, among which the "Best AI Producer" and "Best AI Production Team" awards were awarded to outstanding creators who embrace AI with the platform. In 2023, Ximalaya will use the "human-machine combination" model of AI producers to double the output of high-quality content, and AI will continue to iterate to achieve "hyper-realism", which will also promote the rapid improvement of the quality and quantity of free content. In 2024, in addition to continuing to use AI technology to help audio generation, Ximalaya will also use AI to improve the production process, help anchors greatly drop production costs, and make every anchor a drama club.
  • 1
01:32

Researchers say AI can mimic human handwriting

With the help of artificial intelligence (AI) tools, people have been able to produce audio and video that is realistic enough. And soon, AI could also be used to mimic a person's handwriting. Researchers at a university in Abu Dhabi say they have developed technology that mimics someone's handwriting based on several paragraphs of handwritten material. To do this, researchers at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) used a transformer model, a neural network that traces the relationships between series of data and connects them to context. MBZUAI, which claims to be the world's first AI university, has received a patent from the U.S. Patent and Trademark Office for the AI system.
More
04:22
According to IT Home on January 8, antivirus software company McAfee recently launched a new project "Project Mockingbird" (robinbird) to detect and block AI-generated voice scams, and the official claim that the success rate of the project is more than 90%. It is reported that "Mockingbird" has AI-driven "Depth Fake" audio detection technology, and McAfee CTO Steve Grobman introduced that this technology will be officially announced during CES 2024.
03:34
NVIDIA NeMo, an open source conversational AI toolkit, announced the Parakeet ASR model family, a series of state-of-the-art automatic speech recognition (ASR) models capable of transcribing spoken English with outstanding accuracy, as reported by Webmaster Home on January 8. Nvidia announced four Parakeet models that are based on the RNN Transducer/Connectionist Temporal Classification decoder and have 0.6-110 million parameters. They are able to handle a wide range of audio environments and, after training on only 64,000 hours of datasets, achieve excellent word error rate (WER) performance on the Benchmark dataset, outperforming previous models. According to the developers, the models are robust to non-speech segments such as music and mute, and outperform OpenAI's Whisper v3 in benchmark testing. They also provide user-friendly integration into the project with pre-trained control points.
ASR-5.75%
  • 1
07:57
According to IT House, the U.S. Federal Trade Commission (FTC) recently announced a bounty order to find a way to distinguish whether a sound is made by a real human or generated by AI. According to the FTC, participants can revolve around the following three points: prevention or authentication: a way to restrict the use or application of voice cloning software by unauthorized users, real-time detection or monitoring: a way to detect cloned voices or use voice cloning technology must be provided, and follow-up evaluation: a way to check whether audio clips contain cloned voices. The overall winner of the competition will receive $25,000 and the runner-up will receive $4,000, with up to three honorable mentions (one for each intervention point) each awarded $2,000.
  • 1
02:56
According to a report by the webmaster's home on December 22, Meta has recently released a series of AI translation models, which achieve real-time voice conversion latency of no more than 2 seconds, support multiple language translations, and have the ability to imitate characteristics such as tone, speech speed, and emotion. This family of models, called Seamless Communication, includes SeamlessExpressive, SeamlessStreaming, SeamlessM4 T v2, and Seamless, the first three of which have been open-sourced on GitHub. To ensure translation accuracy and avoid abuse, Meta employs toxicity mitigation technology that filters out "toxic content" before training and automatically detects and adjusts the generated toxic words during translation generation, while watermarking the audio to trace the source. To prevent the risk of abuse, Meta has also added a watermark to the audio, which allows you to accurately trace the source of the audio and combat various attack vectors by embedding an imperceptible signal in the audio.
06:20

The China Audio-video and Digital Publishing Association released the group standard of "Guidelines for the Application of Generative AI Technology in the Publishing Industry".

The China Audio-Video and Digital Publishing Association issued an announcement on the group standard "Guidelines for the Application of Generative AI Technology in the Publishing Industry". In accordance with the relevant requirements of the "Provisions on the Management of Group Standards of the China Audio-video and Digital Publishing Association", the group standard "Guidelines for the Application of Generative Artificial Intelligence Technology in the Publishing Industry" is hereby approved for release after procedures such as project review, standard drafting, solicitation of opinions, and review by the expert group, and has passed the review of the Youth League Standards Committee. It will be implemented from January 20, 2024.
More
08:55
On December 20, the China Audio-video and Digital Publishing Association issued an announcement on the group standard "Guidelines for the Application of Generative Artificial Intelligence Technology in the Publishing Industry". In accordance with the relevant requirements of the "Provisions on the Management of Group Standards of the China Audio-video and Digital Publishing Association", the group standard "Guidelines for the Application of Generative Artificial Intelligence Technology in the Publishing Industry" is hereby approved for release after procedures such as project review, standard drafting, solicitation of opinions, and review by the expert group, and has passed the review of the Youth League Standards Committee. It will be implemented from January 20, 2024.
  • 1
Load More
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate app
Community
English
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)