RSP OIMK KPPF IMYH OIUR ZRXY ZRPR IMYB OaSG KYXG ZRZS KPVO ZRYK KPRZ OIPB DOGZ FSPP PITF KPXK KYZD KPOI ZRLP FSSP OITP OIYK IPCD KYXH KPPC FSNH IPCO KYXF IJGZ OIPZ IPGB OIYC KPPZ FSTH OIYO IPCR DOGP GILP KPOG IMYK FSMJ OIYH FSKU KYZI FSUO OITY KPSP OIXJ OIXK OILY KZOB ZRNY KPSF IPBZ ZRTH KPRS FSMF FSYE OIPF FSLP OBaB FSNK OaDF IPEG OIYR IPBI IMYC ZRPE IPBO KZOG IPCC IMYG FSLF OIPR FSPS KYZS FSYI OIXP IPEE KPVC OIMH KPSJ OIMJ IPER KZZO ZRLG OaRJ IPFG IJGG KPSU FSYJ OILF FSYK OaaZ KPPS IPEO IJGE OILJ ZRTY FSLJ IPFS ZRKH OaZC KZSK GIMH KPVG IPED OIKR ZSFG ZRUS ZRYZ OaaD OITJ IPCP IPBC OITH IJHS OIYF ZRSK IPEC IJGB IJGR DOGI IJHG IJGO FSNJ IJGS IPCI OIUF KPVI OaaR OIKP KPOU IPFO FSLH FSLS KPVS FSUG KPSH FSUI OITF IPEI KZSF ZRPC OIYJ IJGP OIPE ZRPZ OITU IPBR IPFB IPEB OIYI GIMG OIYE FSUS OIPO FSXP IJFC FSTK IJGI IPFE IJLS OIKY ZRYC KZVS ZRPI IMUO ZSFF IPBE IPES KZZZ ZSFI GIMP FSTU IJFS IJFO GILY GIMF IPGE OIXY FSTJ IPCG FSPR OIPP DOIJ OIMY FSXH OIXU GIMK IJGU IPEZ FSLK OILU IPBB IMYF OILG IPCS IPFI IJKG GDSG GDOB GDOR GEBR GECC GDSU GEBS GECP GDRG GDGE GDOZ GDRY GECO GDSY GDOE GECS GDZC GDOS GDRH GDGR GECD GDOC GEDG GDRF GDGP GDGD GECZ GDOP GDGS GDGZ GEBC GDRK GECB GDRZ GEaZ GDRJ GECR GEDF GDZO GDGI GDRU GEBI GECF GDZR GECG GDIU GEaS GDOI GDIP GEDE GECE GDSK GEBO GDSP GEBE GDRP GDOG GDWR GEBZ GDIJ GECI GDIY GDIK GEaR GEBD USOR USPP USOG USPB USOZ USOC USPO USOE USNK USPE USOB USPG USNJ USNP USPF DOPR DOZF DOWB DORP DORG DOPS DOWE DOSG DORU DOZD DOPO DORH GIUE DOWD DOPI DOSO DOSY GKHB DOZC GIUB DORJ GKUE DOXK DOXY GKNG GKNP DOVE DOSS DOPG GIPC GIUD DORS DOSZ GIYU DOVR GIYE DOVZ GIPO GKNK GIUF GIXY GKNU GKPB GIPP GKGG GIXH GIYK GIYF GIUR GIUI DOIY GKGI GIYO GIUC GIXK GIUZ GIXJ DOSJ GKLU DOSH GIYP GKPZ GKNF GIUS GITY GIUG GKPC GKHI DOZO GIYB DOZI GIPB GIYI DORY GIXG GIPR GIPI GIYR DOSK GIUO GIPZ GIYH GIXF DOIU DOVS GIPS GITH DORF DOSU GIPF GKUI PubMed ID:http://www.ncbi.nlm.nih.gov/pubmed/18055457?dopt=Abstract DORK GIPE GIYG DOZE GITK GITP GIXU SLGS GZDO GSUS USLG DODQ YUOD UODO OIPI DOIP LpmaxFigure Plot of statistical criteria Lp max and nb sf for the structural words seen at the least five occasions inside a SCOP superfamily. Black: words with Lpmax Red: intense superfamily-specific words (Lpmax and nbsf). Orange: intense ubiquitous words (Lpmax and nbsf). Pink: over-represented words with Lpmaxnot Acetovanillone chemical information discussed in this study.We also evaluate intense ubiquitous words with all the smaller hydrogen-bonded D motifs extracted from the Motivated Protein databaseResults of this analysis are reported in TableAs stated inside the Procedures section, there’s really tiny overlap between our initial information set as well as the proteins stored in the Motivated Protein database. Even on such a little number of fragments, the comparison reveals that seven extreme ubiquitous words (DRPI, DSPI, DSGI, DSKG, DSKH, DOIP and OIPI) correspond to nest motifs, with precision greater than and two words (BQGI and HBBQ) correspond to niche motifs with precision greater than precision. The set of words corresponding to nest motifs consists of structural words with equivalent conformations, like DRPI, DSPI and DSGI or DSKG and DSKH. We also note that some structural words overlap: in of cases, structural word DOIP is right away followed by letter I, forming the five-structural letter word DOIPI.nbs f Regad et al. BMC Bioinformatics , : http:biomedcentral-Page CCT244747 chemical information ofTable Correspondence in between extreme ubiquitous words and little structural motifsStatistics inside the initial data set Word PZCD HBDS ZCDS UFQK GYUQ YBDS FQLG YZDS GUDO FFFI FQKG SLGI QLGI DRPI DSPI DSGI DSKG DSKH DOIP.RSP OIMK KPPF IMYH OIUR ZRXY ZRPR IMYB OaSG KYXG ZRZS KPVO ZRYK KPRZ OIPB DOGZ FSPP PITF KPXK KYZD KPOI ZRLP FSSP OITP OIYK IPCD KYXH KPPC FSNH IPCO KYXF IJGZ OIPZ IPGB OIYC KPPZ FSTH OIYO IPCR DOGP GILP KPOG IMYK FSMJ OIYH FSKU KYZI FSUO OITY KPSP OIXJ OIXK OILY KZOB ZRNY KPSF IPBZ ZRTH KPRS FSMF FSYE OIPF FSLP OBaB FSNK OaDF IPEG OIYR IPBI IMYC ZRPE IPBO KZOG IPCC IMYG FSLF OIPR FSPS KYZS FSYI OIXP IPEE KPVC OIMH KPSJ OIMJ IPER KZZO ZRLG OaRJ IPFG IJGG KPSU FSYJ OILF FSYK OaaZ KPPS IPEO IJGE OILJ ZRTY FSLJ IPFS ZRKH OaZC KZSK GIMH KPVG IPED OIKR ZSFG ZRUS ZRYZ OaaD OITJ IPCP IPBC OITH IJHS OIYF ZRSK IPEC IJGB IJGR DOGI IJHG IJGO FSNJ IJGS IPCI OIUF KPVI OaaR OIKP KPOU IPFO FSLH FSLS KPVS FSUG KPSH FSUI OITF IPEI KZSF ZRPC OIYJ IJGP OIPE ZRPZ OITU IPBR IPFB IPEB OIYI GIMG OIYE FSUS OIPO FSXP IJFC FSTK IJGI IPFE IJLS OIKY ZRYC KZVS ZRPI IMUO ZSFF IPBE IPES KZZZ ZSFI GIMP FSTU IJFS IJFO GILY GIMF IPGE OIXY FSTJ IPCG FSPR OIPP DOIJ OIMY FSXH OIXU GIMK IJGU IPEZ FSLK OILU IPBB IMYF OILG IPCS IPFI IJKG GDSG GDOB GDOR GEBR GECC GDSU GEBS GECP GDRG GDGE GDOZ GDRY GECO GDSY GDOE GECS GDZC GDOS GDRH GDGR GECD GDOC GEDG GDRF GDGP GDGD GECZ GDOP GDGS GDGZ GEBC GDRK GECB GDRZ GEaZ GDRJ GECR GEDF GDZO GDGI GDRU GEBI GECF GDZR GECG GDIU GEaS GDOI GDIP GEDE GECE GDSK GEBO GDSP GEBE GDRP GDOG GDWR GEBZ GDIJ GECI GDIY GDIK GEaR GEBD USOR USPP USOG USPB USOZ USOC USPO USOE USNK USPE USOB USPG USNJ USNP USPF DOPR DOZF DOWB DORP DORG DOPS DOWE DOSG DORU DOZD DOPO DORH GIUE DOWD DOPI DOSO DOSY GKHB DOZC GIUB DORJ GKUE DOXK DOXY GKNG GKNP DOVE DOSS DOPG GIPC GIUD DORS DOSZ GIYU DOVR GIYE DOVZ GIPO GKNK GIUF GIXY GKNU GKPB GIPP GKGG GIXH GIYK GIYF GIUR GIUI DOIY GKGI GIYO GIUC GIXK GIUZ GIXJ DOSJ GKLU DOSH GIYP GKPZ GKNF GIUS GITY GIUG GKPC GKHI DOZO GIYB DOZI GIPB GIYI DORY GIXG GIPR GIPI GIYR DOSK GIUO GIPZ GIYH GIXF DOIU DOVS GIPS GITH DORF DOSU GIPF GKUI PubMed ID:http://www.ncbi.nlm.nih.gov/pubmed/18055457?dopt=Abstract DORK GIPE GIYG DOZE GITK GITP GIXU SLGS GZDO GSUS USLG DODQ YUOD UODO OIPI DOIP LpmaxFigure Plot of statistical criteria Lp max and nb sf for the structural words observed at least five occasions within a SCOP superfamily. Black: words with Lpmax Red: intense superfamily-specific words (Lpmax and nbsf). Orange: extreme ubiquitous words (Lpmax and nbsf). Pink: over-represented words with Lpmaxnot discussed within this study.We also compare intense ubiquitous words with the smaller hydrogen-bonded D motifs extracted in the Motivated Protein databaseResults of this analysis are reported in TableAs stated in the Methods section, there is certainly quite small overlap amongst our initial information set as well as the proteins stored in the Motivated Protein database. Even on such a tiny number of fragments, the comparison reveals that seven intense ubiquitous words (DRPI, DSPI, DSGI, DSKG, DSKH, DOIP and OIPI) correspond to nest motifs, with precision greater than and two words (BQGI and HBBQ) correspond to niche motifs with precision higher than precision. The set of words corresponding to nest motifs involves structural words with equivalent conformations, for example DRPI, DSPI and DSGI or DSKG and DSKH. We also note that some structural words overlap: in of situations, structural word DOIP is quickly followed by letter I, forming the five-structural letter word DOIPI.nbs f Regad et al. BMC Bioinformatics , : http:biomedcentral-Page ofTable Correspondence between intense ubiquitous words and smaller structural motifsStatistics within the initial information set Word PZCD HBDS ZCDS UFQK GYUQ YBDS FQLG YZDS GUDO FFFI FQKG SLGI QLGI DRPI DSPI DSGI DSKG DSKH DOIP.