A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Proce(7)

2020-11-29 00:07

A 201.4 GOPS real-time multi-object recognitionprocessor is presented with a three-stage pipelined architecture.Visual perception based multi-object recognition algorithm isapplied to give multiple attentions to multiple objects in the inputimage. For human-like multi-object perception, a neural perceptionengine is proposed with biologically inspired neural networksand fuzzy logic circ

38IEEEJOURNALOFSOLID-STATECIRCUITS,VOL.45,NO.1,JANUARY

2010

Fig.8.Blockdiagram,circuits,andmeasuredwaveformsofobjectdetectionengine.

workforsimilaritymeasureanddecisionmaking,respectively.Incircuitdesign,theODEexploitsanalog-basedmixed-modecircuitstoreduceareaandpoweroverheadofGaussianfunc-tioncircuitsandneuralsynapticmultipliers.Exceptthedigitallyimplementedlearningpart,dataprocessingpartsoftheODEareimplementedbyanalogcircuits.Toexploittheanalogdataprocessing,8-bitintensity,saliency,andlocationvaluesofthetargetandseedpixelareconvertedtoanalogsignalsbyDACs.Afterthat,threeGaussianfunctioncircuitsmeasurethesimi-laritiesbetweenthetwopixelsforthreemetrics.AGaussianfunctioncircuitisrealizedbythecombinationofMOSdiffer-entialpairandminimumfollowercircuitincurrentmodecon- guration.Thedifferentialpaircircuitoutputsthesymmetricdifferentialsignals,eachofwhichhasexponentialnon-linearitycharacteristics.AndtheminimumfollowercircuitgeneratestheGaussian-likeoutputbyfollowingtheminimumbetweenthesymmetricdifferentialsignals.A2-DGaussianfunctioncircuitcanbeimplementedbytwoconsecutiveGaussianfunctioncir-cuitsbyconnectingtheoutputofaGaussianfunctioncircuittothebiascurrentinputtailofthenextGaussianfunctioncircuit.Finally,current-modeneuralsynapticcircuitsmergethethreemeasuredsimilaritieswithmultiplyingtheirweightvalues,andcomparatorcircuitmakethe naldecisionthroughthresholding.WithaHebbianlearning[14],theweightvaluesoftheneuralsynapticcircuits,whichplayaroleinclassi cationcriteria,areupdatedeverycycle.Asaresult,theODEcompletestheROIdetectionfor1objectwithin

7sat200MHzoperatingfrequency.Anditsanalog-basedmixed-modeimplementationreducestheareaandpowerconsumptionby59%and44%,respectively,comparedwiththoseofdigitalimplementation.

Fig.8alsoshowsthemeasurementwaveformsofmixed-modeODE.TheyincludeDACoutputsignal,Gaussianfunctioncir-cuitoutputsignal,and nalclassi cationsignal.Asshownintheenlargedwaveforms,theGaussianoutputsignalvarieswiththesimilarityoftwoanaloginputsignals,andthe nalclassi -cationsignalismadebasedonit.B.SIMDProcessorUnit

TheSPUisdesignedtoaccelerateparallelimageprocessingtasksofthedescriptorgenerationstage.AsshowninFig.9,theSPUconsistofaSPUcontroller,eightSIMDcontrolleddual-issuedverylonginstructionword(VLIW)PEs,128-bit-widedatamemory,and2-DDMA.TheeightPEsperformpixelparallelimageprocessingoperationsuchasGaussian ltering,localmaximumsearch,andhistogramoperation.TheSPUcon-trollercontrolstheoverallprogram owoftheSPU,decodestheinstructionfortheeightPEs,andperformsdatatransferbetweentheeightPEsanddatamemory.ForthedatamemoryoftheeightPEs,a128-bituni edmemoryisusedratherthaneight16-bitmemoriestoreducetheareaandpowerconsumptionby30.4%and36.4%,respectively.ThetwodataalignersbetweenthedatamemoryandeightPEsfacilitatethedatamovementbyrotatingtheuni ed128-bitdatain16-bitunit.The2-DDMAperformsthedatatransferbetweentheexternalmemoryandinternaldatamemoryinparallelwiththePEoperation.Itautomaticallygen-eratestheaddressesfor2-Ddataaccessforthedatatransactionsofvisionapplications.

Thedetailedblockdiagramofeachdual-issuedVLIWPEisalsoshowninFig.9.Itconsistsoftwoindependentdata


A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Proce(7).doc 将本文的Word文档下载到电脑 下载失败或者文档不完整,请联系客服人员解决!

下一篇:左传外交辞令特色

相关阅读
本类排行
× 注册会员免费下载(下载后可以自由复制和排版)

马上注册会员

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信: QQ: