论文标题
编程非线性传播,用于有效的光学学习机器
Programming Nonlinear Propagation for Efficient Optical Learning Machines
论文作者
论文摘要
由于限制诸如耗电量和可扩展性之类的限制,对使用较大的机器学习模型处理数据的需求不断增加。光学是提供较低功率计算的有前途的竞争者,因为通过非吸收介质的光传播是无损操作。但是,要用光进行有用且有效的计算,在光学上产生和控制非线性是一种仍然难以捉摸的必要性。多模纤维(MMF)已显示,它们可以提供平均功率的微量流量,同时保持平行性和低损失。在这项工作中,我们提出了一种光学神经网络体系结构,该体系结构通过通过波前塑形控制MMF中超短脉冲的传播来执行非线性光学计算。借助替代模型,发现最佳参数集可以用电子计算机最少利用来为不同的任务编程。与同等执行的数字神经网络相比,我们显示模型参数数量的显着降低了97%,这导致总体上99%的数字操作减少。我们进一步证明,还可以通过竞争精确执行完全的光学实现。
The ever-increasing demand for processing data with larger machine learning models requires more efficient hardware solutions due to limitations such as power dissipation and scalability. Optics is a promising contender for providing lower power computation since light propagation through a non-absorbing medium is a lossless operation. However, to carry out useful and efficient computations with light, generating and controlling nonlinearity optically is a necessity that is still elusive. Multimode fibers (MMF) have been shown that they can provide nonlinear effects with microwatts of average power while maintaining parallelism and low loss. In this work, we propose an optical neural network architecture, which performs nonlinear optical computation by controlling the propagation of ultrashort pulses in MMF by wavefront shaping. With a surrogate model, optimal sets of parameters are found to program this optical computer for different tasks with minimal utilization of an electronic computer. We show a remarkable decrease of 97% in the number of model parameters, which leads to an overall 99% digital operation reduction compared to an equivalently performing digital neural network. We further demonstrate that a fully optical implementation can also be performed with competitive accuracies.