A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Proce(3)

2020-11-29 00:07

A 201.4 GOPS real-time multi-object recognitionprocessor is presented with a three-stage pipelined architecture.Visual perception based multi-object recognition algorithm isapplied to give multiple attentions to multiple objects in the inputimage. For human-like multi-object perception, a neural perceptionengine is proposed with biologically inspired neural networksand fuzzy logic circ

34IEEEJOURNALOFSOLID-STATECIRCUITS,VOL.45,NO.1,JANUARY

2010

Fig.2.Three-stagemulti-objectrecognitionalgorithm.

dedicatedvectordistancecalculationunitsareemployedintheobjectdecisionstage.

Overall,theproposedalgorithmemploysgrid-basedROIpro-cessingthatdividestheinputimageintoanumberoftwo-di-mensional(2-D)grid-tilesandperformstheprocessingbasedonthem.Itenables ne-grainedROIextractionofmultipleob-jectsandreducestheeffectiveprocessingareaofinputimages.Toevaluatetheproposedalgorithm,weperformexperimentswith50of ceobjectsoutofColumbiaobjectimagelibrary(Coil-100)[10].Itisappliedto2400sampleimagesthatincluderandomobjectsinnaturalbackgroundscenes,witha16384-entrydatabasemadebytheSIFT.Asaresult,overallrecog-nitionratebytheproposedalgorithmismeasuredas92%.ForevaluationsoftheROIdetectionbyvisualperception,truepos-itiveratethatrepresentstheratioofcorrectlydetectedregionoutofgroundtruthROIandfalsepositiveratethatrepresentstheratioofincorrectlydetectedregionoutofnotinterestedre-gion[11]aremeasuredas70%and5%,respectively.Thevisualperceptionbarelyaffectstheoverallrecognitionratewhilere-ducingtheprocessingareaoftheimagesto32.8%onaverage.

III.SYSTEMARCHITECTURE

Fig.3showstheoverallblockdiagramoftheproposedpro-cessor.Itconsistsof21IPblocks:aneuralperceptionengine(NPE),aSPUtask/powermanager(STM),16SIMDprocessorunits(SPUs),adecisionprocessor(DP),andtwoexternalmemoryinterfaces.TheNPEisresponsibleforthe rstvisualperceptionstage.ItextractstheROIgrid-tilesforeachobjectandsendsthemto16SPUsfordetailedimageprocessing.The16SPUs,whosepowerdomainisseparatedintofour,areresponsiblefortheseconddescriptorgenerationstage.TheyextractobjectfeaturesoutoftheselectedROIsandconvertthemtodescriptorvectors.Thedescriptorvectorsoutofthe16SPUsaregatheredattheDP.TheDPacceleratesthevectormatchingprocessofdescriptorvectorsforthethirdobjectdecisionstage.TheSTMisspeciallydevisedtodistributethetasksoftheROIgrid-tilesfromtheNPEtothe16SPUsandtomanagethem.Italsocontrolsthepipelinestagesoftheoverallprocessorandmanagesfourpowerdomainsof16SPUs.Theoverall21IPblocksareinterconnectedthroughtheproposedmulti-castingNoC.

Toincreaseparallelismandhardwareutilizationofthepro-posedprocessor,theproposedthreestagesareexecutedinthepipelineinframelevelasshowninFig.4.ThepipelineddataareROIgrid-tilesanddescriptorvectorsbetweenthe rst–secondstageandsecond–thirdstage,respectively.Unliketheexecu-tiontimeofthe rstvisualperceptionstageisconstantduetoits xedcomputationamount,theexecutiontimeofthesecondde-scriptorgenerationandthirdobjectdecisionarevaryingwiththenumberofROIgrid-tilesanddescriptorvectors.Inordertobal-ancetheexecutiontimesofthreestages,theSTMestimatestheworkloadofthefollowingdescriptorvectorandobjectdecisionstagebasedonthenumberofextractedROIgrid-tilesandde-scriptorvectors,respectively,andcontrolstheirexecutiontimesusingtwopipelinetimebalancingschemes.

Tocontroltheexecutiontimeofthedescriptorgenerationstage,theSTMperformsworkload-awaretaskscheduling(WATS)thatdiffersthenumberofschedulingSPUsaccordingtothestage’sinputworkload.Fig.5(a)showsthe owchartoftheWATS.First,theSTMmeasuresthenumberofROIgrid-tilesfromtheNPEandclassi esittooneofNwork-loadlevelsdividedbyN-1thresholdvalues.Andthen,theSTMdeterminesthenumberofoperatingSPUsaccordingtotheclassi edworkloadlevel.SinceitallocatestheSPUsin


A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Proce(3).doc 将本文的Word文档下载到电脑 下载失败或者文档不完整,请联系客服人员解决!

下一篇:左传外交辞令特色

相关阅读
本类排行
× 注册会员免费下载(下载后可以自由复制和排版)

马上注册会员

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信: QQ: