Gerald Friedland

Gerald Friedland is a German-American computer scientist and author specializing in multimedia computing, machine learning, and artificial intelligence. He is a principal scientist at Amazon Web Services and a professor at the Electrical Engineering and Computer Science Department of the University of California, Berkeley. He focuses on AutoML and generative AI. His work has advanced large-scale multimedia analysis, privacy-aware AI, and explainable machine learning.^[1]^[2]

Education

Friedland completed his education in Germany, earning his Abitur in 1998.^[3] He received a Master of Science in Computer Science with a minor in Linguistics from Freie Universität Berlin in 2002.^[4] His master’s thesis, "Towards a Generic Cross Platform Media Editor: An Editing Tool for E-Chalk," was recognized as the best computer science master’s thesis in German-speaking countries by the German Association for Computer Science.^[5]

In 2006, Friedland earned his Ph.D. in Computer Science from Freie Universität Berlin, graduating summa cum laude. His dissertation, "Adaptive Audio and Video Processing for Electronic Chalkboard Lectures," was nominated for the university's Ernst-Reuter Award.^[6]^[7]

Career

Friedland began his career in academia as a research associate in the AI (Machine Learning) group at Freie Universität Berlin from 2002 to 2006. During this time, he developed the "Simple Interactive Object Extraction (SIOX)" algorithm^[8], now widely used in open-source tools like GIMP and Blender and conducted research on lecture webcasting technologies.^[9]

From 2006 to 2021, Friedland was affiliated with the International Computer Science Institute (ICSI) in Berkeley, California. He held various roles, including Senior Research Scientist and principal investigator. As a Principal Data Scientist at Lawrence Livermore National Laboratory (2016–2019), Friedland led a team addressing machine learning challenges for multimedia and simulation data.^[10]

In 2014, he founded Audeme, a company developing cloud-independent speech recognition hardware.^[11] He also co-founded Brainome, Inc., where he led a team to develop no-code machine learning solutions, leveraging tools like PyTorch and NumPy.^[4]^[12]

Friedland served as director of conferences for ACM SIGMM (2017–2021), program co-chair for ACM Multimedia (2017), and associate editor for IEEE Multimedia Magazine and ACM Transactions on Multimedia Computing.^[13]^[14]

Research

Friedland is a computer scientist specializing in the processing and analysis of multimedia data and machine learning.^[15] He is mostly known as the original author of the widely used "Simple Interactive Object Extraction" image and video segmentation algorithm,^[16]^[8]^[17]^[18]^[19]^[9]^[20]^[21] created as part of his PhD thesis,^[22]^[23] and as the co-author of a textbook on Multimedia Computing.^[24] He also led the initiative to create and release the YFCC100M corpus (see also: List of datasets for machine learning research),^[25]^[26]^[27] the largest freely available research corpus of consumer-produced videos and images. He co-founded the field of geolocation estimation for images and videos, sometimes also referred to as placing.^[28]^[29]^[30] Friedland also frequently uncovers privacy risks in multimedia publishing practice^[31]^[32]^[33]^[34]^[35]^[36]^[37]^[38] and heads the development of the teachingprivacy.org^[39] portal which provides educational materials for use in US high-schools as part of the AP Computer Science Principles and the Code.org initiative. Friedland is also the co-creator of MOVI, an open-source speech recognition board that allows the creation of cloudless voice interfaces^[40] for Internet of things devices.

Awards

UNESCO IRCAI Global Top-100 AI Project (2021) for his measurement-based approach to AI
AI2000 Most Influential Scholar of the Decade (2009–2019)
ACM Multimedia Grand Challenge Winner (2009)
Best Paper Award at the IEEE International Conference on Multimedia Big Data (2019)
Make Magazine Editor’s Choice Award (2015)

Publications

Friedland has authored six books, including:

Information-Based Machine Learning: Data Science as an Engineering Discipline (Springer-Nature, 2023).
Introduction to Multimedia Computing (Cambridge University Press, 2014).
Beginning Programming Using Retro Computing (Apress, 2018).

He has also published over 100 peer-reviewed journal and conference articles on topics ranging from machine learning to multimedia computing.^[15]

References

^ "Gerald Friedland | EECS at UC Berkeley".
^ "Gerald Friedland".
^ "Refubium - Suche".
^ ^a ^b "Brainome launches product to optimize machine learning development process". ZDNet.
^ "Error".
^ "Entropy discussion group". 23 August 2019.
^ Friedland, Gerald "Information-Driven Machine Learning: Data Science as an Engineering Discipline", Springer-Nature, January 2024.
^ ^a ^b "SIOX".
^ ^a ^b "Fiji plugin based on the SIOX project to segment color images: Fiji/Siox_Segmentation". GitHub. June 2019.
^ "Gerald Friedland | ICSI". www.icsi.berkeley.edu. Retrieved 2024-12-19.
^ "An interview with Bertrand and Gerald of Audeme | The Amp Hour Electronics Podcast". theamphour.com. 2015-07-16. Retrieved 2024-12-19.
^ Woodie, Alex (2020-11-04). "Brainome Right-Sizes Your Data Before ML Training". BigDATAwire. Retrieved 2024-12-19.
^ "New SIGMM Leadership Announced | ACM SIGMM - the Special Interest Group on Multimedia". www.sigmm.org. Retrieved 2024-12-19.
^ "Gerald Friedland - Home". Author DO Series. Retrieved 2024-12-19.
^ ^a ^b Google Scholar list of publications: https://scholar.google.com/citations?user=iBl-QgEAAAAJ
^ "Algorithm - What are the standard techniques for removing a segmentation (Such as a human or bird) from a video?".
^ "Using GIMP's Foreground select tool". 31 August 2013.
^ "Paintshopprotutorials.co.uk".
^ "Kutout - an application for cutting out images | Hook - Labs". Archived from the original on 2017-07-24. Retrieved 2017-07-16.
^ "SIOX: Simple Interactive Object Extraction".
^ Shoou Jiah Yiu, Gerald Friedland: "Method and system for identifying objects in images" US Patent Application US20170132469A1
^ Gerald Friedland: "Adaptive Audio- und Videoverarbeitung für elektronische Kreidetafelvorlesungen", Freie Universitaet Berlin, October 2006. http://www.diss.fu-berlin.de/diss/receive/FUDISS_thesis_000000002354
^ Gerald Friedland: "Adaptive Audio and Video Processing for Electronic Chalkboard Lectures", Lulu Publishing, ISBN 978-1430303886, December 2006. 2016 reprint: ISBN 978-3-659-97771-8, Lambert Publishing, November 2016.
^ Friedland, Gerald and Jain, Ramesh "Multimedia Computing", Cambridge University Press, October 2014.
^ Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, Li-Jia Li. "YFCC100M: The New Data in Multimedia Research". Communications of the ACM, Vol. 59 No. 2, Pages 64-73
^ YFCC100M: YFCC100M
^ The Multimedia Commons
^ Gerald Friedland, Oriol Vinyals, and Trevor Darrell: "Multimodal Location Estimation", in Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, October 2010, pp. 1245-1251.
^ Choi, Jaeyoung, Friedland, Gerald "Multimodal Location Estimation of Videos and Images", Springer Publishing October 2014
^ Nils Peters, Howard Lei, Gerald Friedland: "Room identification using acoustic features in a recording", US Patent US20140161270A1
^ Web Photos That Reveal Secrets, Like Where you Live (New York Times, Aug 11, 2010)
^ Tips to Turn Off Geo-Tagging on Your Cell Phone (ABC News, Aug 20, 2010)
^ Could you fall victim to crime simply by geotagging location info to your photos? (Digital Trends, Jul 22, 2013)
^ Ways to Avoid Email Tracking (New York Times, Dec 25, 2014)
^ BodyWorn, the police-worn camera that aims to reduce crime (Fox News, May 19, 2015)
^ Paris ISIS Attacks: Tech Industry Says 'Anti-Terror' Back Doors Would Make US Less Safe (International Business Times, Nov 18, 2015)
^ Why our Crazy Smart AI still sucks at Transcribing our Speech (Wired Magazine, Apr 8, 2016)
^ Transcribing Audio Sucks—So Make Machines Like Trint Do It (Wired Magazine, Apr 26, 2017)
^ "Teaching Privacy".
^ Gerald Friedland Bertrand Irissou: Method of facilitating construction of a voice dialog interface for an electronic system, US Patent Application US15382163.

[1] "Gerald Friedland | EECS at UC Berkeley".

[2] "Gerald Friedland".

[3] "Refubium - Suche".

[:0-4] "Brainome launches product to optimize machine learning development process". ZDNet.

[5] "Error".

[6] "Entropy discussion group". 23 August 2019.

[7] Friedland, Gerald "Information-Driven Machine Learning: Data Science as an Engineering Discipline", Springer-Nature, January 2024.

[:1-8] "SIOX".

[:2-9] "Fiji plugin based on the SIOX project to segment color images: Fiji/Siox_Segmentation". GitHub. June 2019.

[10] "Gerald Friedland | ICSI". www.icsi.berkeley.edu. Retrieved 2024-12-19.

[11] "An interview with Bertrand and Gerald of Audeme | The Amp Hour Electronics Podcast". theamphour.com. 2015-07-16. Retrieved 2024-12-19.

[12] Woodie, Alex (2020-11-04). "Brainome Right-Sizes Your Data Before ML Training". BigDATAwire. Retrieved 2024-12-19.

[13] "New SIGMM Leadership Announced | ACM SIGMM - the Special Interest Group on Multimedia". www.sigmm.org. Retrieved 2024-12-19.

[14] "Gerald Friedland - Home". Author DO Series. Retrieved 2024-12-19.

[:3-15] Google Scholar list of publications: https://scholar.google.com/citations?user=iBl-QgEAAAAJ

[16] "Algorithm - What are the standard techniques for removing a segmentation (Such as a human or bird) from a video?".

[17] "Using GIMP's Foreground select tool". 31 August 2013.

[18] "Paintshopprotutorials.co.uk".

[19] "Kutout - an application for cutting out images | Hook - Labs". Archived from the original on 2017-07-24. Retrieved 2017-07-16.

[20] "SIOX: Simple Interactive Object Extraction".

[21] Shoou Jiah Yiu, Gerald Friedland: "Method and system for identifying objects in images" US Patent Application US20170132469A1

[22] Gerald Friedland: "Adaptive Audio- und Videoverarbeitung für elektronische Kreidetafelvorlesungen", Freie Universitaet Berlin, October 2006. http://www.diss.fu-berlin.de/diss/receive/FUDISS_thesis_000000002354

[23] Gerald Friedland: "Adaptive Audio and Video Processing for Electronic Chalkboard Lectures", Lulu Publishing, ISBN 978-1430303886, December 2006. 2016 reprint: ISBN 978-3-659-97771-8, Lambert Publishing, November 2016.

[24] Friedland, Gerald and Jain, Ramesh "Multimedia Computing", Cambridge University Press, October 2014.

[25] Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, Li-Jia Li. "YFCC100M: The New Data in Multimedia Research". Communications of the ACM, Vol. 59 No. 2, Pages 64-73

[26] YFCC100M: YFCC100M

[27] The Multimedia Commons

[28] Gerald Friedland, Oriol Vinyals, and Trevor Darrell: "Multimodal Location Estimation", in Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, October 2010, pp. 1245-1251.

[29] Choi, Jaeyoung, Friedland, Gerald "Multimodal Location Estimation of Videos and Images", Springer Publishing October 2014

[30] Nils Peters, Howard Lei, Gerald Friedland: "Room identification using acoustic features in a recording", US Patent US20140161270A1

[31] Web Photos That Reveal Secrets, Like Where you Live (New York Times, Aug 11, 2010)

[32] Tips to Turn Off Geo-Tagging on Your Cell Phone (ABC News, Aug 20, 2010)

[33] Could you fall victim to crime simply by geotagging location info to your photos? (Digital Trends, Jul 22, 2013)

[34] Ways to Avoid Email Tracking (New York Times, Dec 25, 2014)

[35] BodyWorn, the police-worn camera that aims to reduce crime (Fox News, May 19, 2015)

[36] Paris ISIS Attacks: Tech Industry Says 'Anti-Terror' Back Doors Would Make US Less Safe (International Business Times, Nov 18, 2015)

[37] Why our Crazy Smart AI still sucks at Transcribing our Speech (Wired Magazine, Apr 8, 2016)

[38] Transcribing Audio Sucks—So Make Machines Like Trint Do It (Wired Magazine, Apr 26, 2017)

[39] "Teaching Privacy".

[40] Gerald Friedland Bertrand Irissou: Method of facilitating construction of a voice dialog interface for an electronic system, US Patent Application US15382163.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]