At2g04039.1/MSA

>gi|18395648|ref|NP_565308.1|	 unknown protein [Arabidopsis thaliana] MATIAGGSFG-VPSSR ISITTPTLSSSS---LLPPLTLQ--SGTRKDNLLRC-ALQ ESST-SAVAT-EKKNKEEGEE--- --STVAVPA--KKPKPAAKAA--AVAKPLRQMMEEDVIPPLQAILESQDDIS DIDLSFQDDK--LEGFFL-KK-SIPYSFWAFFPTGNLTG-AKGFSISS HGSGPSTVEPFLVDER---KPTANHVVFWVEKRL-AAQGIIPVW--NQ -- >gi|113475477|ref|YP_721538.1|	 hypothetical protein Tery_1808 [Trichodesmium erythraeum IMS101]

MPEEQQK- AKKTETKPAGGKVKPE-KPP--AVESKPFAEFVQQDYLPALQKALLEQG-ID DLDLSLVKEKLPMIGFSSSEECWQVVGKFK-KG---QRQFNIYFPKKDIKG-PRAFSCAD NGSKPATLEPFLIDER---KITLKLLVFGVTQRL-NAQKWLSLN -- >gi|23125299|ref|ZP_00107239.1|	 hypothetical protein Npun02006772 [Nostoc punctiforme PCC 73102] MADQ-TNHNQAGEVAP ---STVDKQAPSVAEEHAPST--- DSPEATD-LPTANTPDPKAANPETNPNAAKTTATPPKREKPAAKAAVG ---E-KP-AAATEEKPAAKAAKKE-KAP--AVEDKPFVEFIEQDYLPALQKAIAQQG-VQ DLQVSFAKQKVPITGFESAEECWQIIGSWKETG---ARQFNLYFPEEDIQG-KKGFSCN- EGKKPSTLESFLIDER---KITLDLLVFGLVQRL-DGQKWLGIN -- >gi|119485084|ref|ZP_01619469.1|	 hypothetical protein L8106_06539 [Lyngbya sp. PCC 8106] MSEE

--TEAKKPAA---KKE-KPP--AVEDKPFAEFIEQDYIPALQDALTSQG-LE DLELKFAKEKFPASV-GTSEPCWQVMGQWK-GG---KRQFNVYFPNEDIKA-QKVFSLAD NGSKPSGIEPFLGDER---RINLSLLVFGVVQRL-NAQKWLNRN -- >gi|119510377|ref|ZP_01629512.1|	 hypothetical protein N9414_15977 [Nodularia spumigena CCY9414] MAEE-KNHNQTGEAAP ---SNVDKQVPST--- NEPVATN-LPTANTPDPKAANPEVNPNAATTKTAPPKAEKPPA- AAKAAKKE-KPP--AVEDKPFVEFMEQHYLPTLQNAIAQEG-VE DLQLAFAKQKLPIAGFQSAEECWQVIGSWQ-NG---QRQFNVYFPEESITG-KKGFSCN- EGKRPSTLESFLIDER---KITLDLLVFGLVQRL-NGQKWLGRN -- >gi|17230789|ref|NP_487337.1|	 hypothetical protein alr3297 [Nostoc sp. PCC 7120] MADP-TNQNQPGDTVP ---STVDQKAPTVAEENAPST--- SEPVATN-LPTANTPDPEAANPKTNPNTAKPTAAAPKEEKPAA- AAKAAKKE-KAP--SVEDKPFVEFMEQDYLPALQKAITAEG-VK DLQLSFAKQKLPIAGLSSAEEYWQVIGSWQ-NG---QRQFNVYFPDEDIQG-KKGFSCN- EGRRPSTLESFLIDER---KITLDMLVSRLVYRL-NGQKWLGRN -- >gi|75911152|ref|YP_325448.1|	 hypothetical protein Ava_4956 [Anabaena variabilis ATCC 29413] MADP-TNQNQPGDAVP ---STVDQKAPTVAEENAPST--- SEPVATN-LPTANTPDPQAANPKTNPNAAAPPKEEKPAA- AAKAAKKE-KAP--SVEDKPFVEFMEQDYLPALQKAITAEG-VK DLQLSFAKQKLPIAGLSSAEEYWQVIGSWQ-NG---QRQFNVYFPDEDIQG-KKGFSCN- EGRRPSTLESFLIDER---KITLDMLVSRLVYRL-NGQKWLGRN -- >gi|126658804|ref|ZP_01729948.1|	 hypothetical protein CY0110_08131 [Cyanothece sp. CCY0110]

--M- --AEETKPKA---KKE-KPP--AIEEKPFTEFMEEHFTPTLKESLINQG-LD DIELSFTKADISIAGATSDEPCWQVIGTWN-QG---QRQFKLYFLEEDIKG-QKAFSYAV NGKPPSTIESFMIDER---KINLDLMVLYTLQRL-NGQKWLTRN -- >gi|67920789|ref|ZP_00514308.1|	 conserved hypothetical protein [Crocosphaera watsonii WH 8501]

MKPYIFDNN--- KITGTIM- --AEETKPVA---KKG-KPP--AIEKKPFTEFMEEHFTPTLKESLTKEG-LN DIELSFTKAPVSIPGAISDEPCWQVIGTWD-KG---KRQFNIYFPGEDIKG-QKAFSYAV NGKPPSTIESFMIDER---KVTLDLMVLYTLQRL-NGQKWLTRN -- >gi|116070611|ref|ZP_01467880.1|	 hypothetical protein BL107_13235 [Synechococcus sp. BL107]

MSEAP--- ---AN-KPAAKP-KPP--KPEDKPFPEFIDTLFLPAVAKQLLENG-IT ADRLERIDGDRPVVGGRCPMVVGDLP--G---GRRFWLCFAKEDISS-GKVIALAD PGSEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGGN -- >gi|33865733|ref|NP_897292.1|	 hypothetical protein SYNW1199 [Synechococcus sp. WH 8102]

MSDTP--- --DT-AS-AAAQEDTKA-APKPKP-KPP--KPEDKPFPEFIDTLFIPAVSKQLEDNG-IQ ADRLERVEGERPVVGGSCPMVIGELP--G---GRRFWLCFGSADITS-PKLIALAE AGSEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGRN -- >gi|78184737|ref|YP_377172.1|	 hypothetical protein Syncc9902_1164 [Synechococcus sp. CC9902]

MSEAP--- ---AN-KPAAKP-KPP--KPEDKPFPEFIDTLFLPAVAKQLLENG-IT ADRLERVEGDRPVVGGRCPMVVGDLP--G---GRRFWLCFAKEDINS-GKVIALAD PGSDPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGSN -- >gi|78212841|ref|YP_381620.1|	 hypothetical protein Syncc9605_1311 [Synechococcus sp. CC9605]

MSETP--- --VE-KQ-SSGQE-APA-KPAPKA-KPP--KPEDKPFPEFIDTLFLPAVAKQLAEHD-IT ADRLERIEGQRPVVGGECPMVVGELP--G---GRRFWVCFSKADINS-SKVIALAD AGSEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGGN -- >gi|148242576|ref|YP_001227733.1|	 hypothetical protein SynRCC307_1477 [Synechococcus sp. RCC307]

MSDSA--- --AT-PDKPTDQPQAAK-APPAKA-KPP--KPEDKPFAEFVPQLLMPALAKEIESYG-GP APALELEQAAMPVVGETCWQIRGELP--G---DRRFWLCFTKDDIQA-PKTFAIAE SGAPPSLLESFLIDEK---RITLALLVSRTVQRL-NGQKWLGPN -- >gi|116074905|ref|ZP_01472166.1|	 hypothetical protein RS9916_30264 [Synechococcus sp. RS9916]

-MPRADYSSASE-GNLVSETP--- --AK-A---TPTPA---EKPAKPP--KPEDKPFDAFIQEDLLPAVRKGIVDRG-IT PSVLDLRQGERPVVGGSCWMLYGELP--P---GRRFWLCFSEPSIGA-DKTIALAD PGTDASLLESFLIDEK---RMSLALLQSRLLQRL-NGQKWLGGN -- >gi|148239439|ref|YP_001224826.1|	 hypothetical protein SynWH7803_1103 [Synechococcus sp. WH 7803]

MSETP--- --AK-PK-GETKPAGEG---KAKPAAKP--KLEDKPFASFMQEDFLPSLTKALDDRG-QR AVSLSLIEGERPVVGGSCWMVKGELS--G---ERRFWLCFESDAITS-GKTIALAE SGTAPSLLESFLIDEK---RITLALLQSRLLQRL-NGQKWLGGN -- >gi|88808724|ref|ZP_01124234.1|	 hypothetical protein WH7805_03502 [Synechococcus sp. WH 7805]

MSETP--- --AK-PK-GTTKPAGEG---KAKPAAKP--KPEDKAFASFIQEDFLPSLSKALADRG-HA PVSLSLSEGERPVVGGLCWMVKGELS--S---ERRFWLCFESDAITS-GKTIALAE SAAEPSLLESFLIDEK---RMTLALLQSRLLQRL-NGQKWLGGN -- >gi|124023148|ref|YP_001017455.1|	 hypothetical protein P9303_14441 [Prochlorococcus marinus str. MIT 9303] MN-- DNDNRV-- -NPNADAEKPAE-QKPTLENP--- --VK-TN-P-TEDQQEG--AKPQSAKPPAAKVEDKPFETFIRDDFLPNIKQALTERG-MP PSTLELIQGNRPVVGDPCWMVCGEIP--L---GRRFWLCFASDSIAS-KKTISLAE TGTEPSLLEPFLIDEK---KMTLILLRSRLLQRL-NGQKWLTAN -- >gi|33863041|ref|NP_894601.1|	 hypothetical protein PMT0769 [Prochlorococcus marinus str. MIT 9313] MN-- DNDNRV-- -NPNADGQKPAE-QKPTLENP--- --VK-TN-P-TEDQQEG--AKPQTAKPPAAKVEDKPFETFIRDDFLPNIKQALTERG-MP PSTLELIQGDRPVVGDPCWMVCGEIP--L---GRRFWLCFASDSIAS-KKTISLAE TGTEPSLIEPFLIDEK---KMTLTLLRSRLLQRL-NGQKWLTAN -- >gi|87125806|ref|ZP_01081649.1|	 hypothetical protein RS9917_00280 [Synechococcus sp. RS9917] MS-- DT-- --PAA--- ---ATPKP-KPP--KLEDKPFAAFIAEDFLPGLRKGLADHG-LT PTSLDLIEGERPVVGGSCWMVCGELP--P---GRRFWLCFNEAAIQS-GKTIALAD PGTEPSLIESFLIDEK---RITLPLLLSRLLQRL-NGQKWLGGN -- >gi|37520443|ref|NP_923820.1|	 hypothetical protein glr0874 [Gloeobacter violaceus PCC 7421] MPE--ENQAPPPAEAA ---ATQTP ESPTDTKAAD--ATPAPDGEAPAPKPRPPRPRPAAASAEG DATEAAPAAAEGEAAPARPP--RP-KPG--DAPAKPLPQYIQEDILPLLEKRMKAEG-AA DVALTAGEADFTATW---DNG---NKTFTIYFDEGNLEG-RKTIAYNE VKSPGRVLQMFMPPERGFKGVDAKQIVVMILQQFTTTLTWIKKQPAAAGGGAAAKGKPPR PERPAKSVAS >gi|16331085|ref|NP_441813.1|	 hypothetical protein sll0272 [Synechocystis sp. PCC 6803] -MAL-IPFYVYGDRLP

--K-LMTEAKAPAE---KKA-KPP--AVEEKPFTEFINQDFLPALQSALGKIG-LG PVALDFSKKPIAIPGADN-TPYWQVQGTWS-GDRQISKQFNLYFFQEDIKG-PKGFAYSV DNRPPSTLESFMIDER---KVTLDLMVLYTLQRL-NGQKWLGGN -- >gi|22298015|ref|NP_681262.1|	 hypothetical protein tlr0472 [Thermosynechococcus elongatus BP-1]

--MAVACKGECYHAEEMVPCR--- --SAMAEETPT- --QAPKKE-KPP--AIEDKPFAEFINEAFLPALKNALSAK--VG DVTLRLEGN--TVMGEWG-KG---MYQFRLYFLEGNIQG-PKVFVCSS GGIAPSTLEPFLGDER---KVTLDLLVFGVMQRL-NGQKWLGGN -- >gi|33240441|ref|NP_875383.1|	 hypothetical protein Pro0991 [Prochlorococcus marinus subsp. marinus str. CCMP1375]

---MENNQDLS- --ANNPPLSPEARKS- SQLEGEKSLIKK-KPP--KLEEKPFEEFVNKHLIPEISNSLSSKG-IS LESIILKKDQRPVVGGECWIVYGELL--N---GKRFWITFNSDDIKS-TKNICLAE SSSEAALLESFLIDEK---KITLQLLTSRFMQRL-NGQKWLGDN -- >gi|56752196|ref|YP_172897.1|	 hypothetical protein syc2187_d [Synechococcus elongatus PCC 6301] MS-- -EEQ VVPTPE-- ---AAAKPAA---KKE-KPP--ALEDKPFKEFVQTDLLPAVQQALSDRG-LA DLDLQFVEAPLPVTGDRCWQLQGSWA-KG---QRRFLLGFSEESLTA-PKVFSLAD GKATPATLEAFLGDER---KITLPLLLNRILSRL-NGQKWLEQN -- >gi|84518016|ref|ZP_01005365.1|	 hypothetical protein P9211_04167 [Prochlorococcus marinus str. MIT 9211] ME-- ESHQDQIKGDNSQALS ESSS NVV---TQEVKE-KPL--KIEEKPFDVFILDHFIPGLKSSLHKFG-IS APVITLKEDERPVTGGKCWMVYAKLP--K---DRKFWICFSTNQISS-LKNFAISE SGAEPSLLESFLIDER---KTTLPLLISRTLQRL-NGQKWLGRN -- >gi|86606710|ref|YP_475473.1|	 hypothetical protein CYA_2070 [Synechococcus sp. JA-3-3Ab]

--MSGEPTPAQETAVEKPARVE---AANPAP- -KE-TDTAQASPKAAA-KKP--AKEEKPFEQLIAEDVIPATIAAFQKRG-VT DLQLRLEGKT--LIGSFA-GG---KKQFSVLFAEGSLNG-PKFFRCAI EGSPASTIESFMIDER---RVDVPLFVFYLLQRL-YAQQWY--- -- >gi|86608133|ref|YP_476895.1|	 hypothetical protein CYB_0645 [Synechococcus sp. JA-2-3B'a(2-13)]

--MSGEPTPE-AAAAEKPAE-AAKPA-- ---AEDSPKAAA-KKV--AKEEKPFEQQITEEVIPAAIAAFQKRG-VS DLELRLEGKT--LVGAFA-GG---KKQFSVLFAEGSLNG-RKFFRCTT EGSPASTIESFMIDER---RVDVNLFVFYLVQRL-YAQQWY--- -- >gi|87302400|ref|ZP_01085225.1|	 hypothetical protein WH5701_09364 [Synechococcus sp. WH 5701] -MQC FELR LQPVSDSASPAPNP-- -VKV-KPP--APEDKPFAEFIPQLFLPALLKEIEAFG-GA TPQLSFEQGAMPVVGSPCWLVMGSFP--G---DRRFWLCFTEASISS-AKTIALAE AGSEPSLLESFLIDEK---KTTLALLISRVVSRL-NSQKWLGPN -- >gi|123966154|ref|YP_001011235.1|	 hypothetical protein P9515_09211 [Prochlorococcus marinus str. MIT 9515] --ME-ENVDSIGDSYINEKDTFKKDNKNTNKEKVAKEKSNEVNKEINEEKVAKENSNEVN IEINEEKISEENSKEVNKDINKEKVAKEKSNEVNKEINEEKVAKENSNEVNIEINEEKVA KEKSNEVNKEINEEKISEENSKEVNKDINKEKVSKENSKEVNKDINKEKVAKENSKEVNK DINK-EK-VAEENLKP--LKIKPK-KEI--PIEKKPFKEFINEHLLPSIIQEFKLRG-FE VKEINLKNTSRPIAGDKCWVVFCEIK--D---ICNFWLSFEKEDISS-SKSISLCK SNQKPSVIESFLIDEK---RITLKLIISRILQRL-NGQKLIGIN -- >gi|126696361|ref|YP_001091247.1|	 hypothetical protein P9301_10231 [Prochlorococcus marinus str. MIT 9301] --ME-ENLDINNKVNN ---EKSDNITKSNSEEI--- REPRSEG---DTNTVSN--NSNPQRN-SDSKDNLKN DIDTPVKP---VIKPK-KEL--PIEKKPFQEFINIHLIPALVEEINIRG-LK VNNINLTNTNRPIAGDKCWVINCEIK--D---TCNFWLSFEKEDISS-LKSISLSK PNQKPSIIESFLIDEK---RITLKLIISRVLQRL-NGQKLIGVN -- >gi|33861402|ref|NP_892963.1|	 hypothetical protein PMM0845 [Prochlorococcus marinus subsp. pastoris str. CCMP1986] --ME-ENVESIEESIKREDDPLKK DIIDI--- --DSPKETSTLI---NANSQ DSNK-QK-VGGENSIP--LKIKPK-KEL--PIEKKPFNEFINDHLLPSIIQEFKVRG-LE VADINLKNTSRPIAGDKCWVIFCEIK--D---ICNFWLSFEKDDISS-LKSISLCK SDQKPSVIESFLIDEK---RITLKLIISRILQRL-NGQKLIGIN -- >gi|123968557|ref|YP_001009415.1|	 hypothetical protein A9601_10241 [Prochlorococcus marinus str. AS9601] --ME-ENLDKNNEVNK ---EIPDKTTKSNSEEI--- KEPKSEK---AIILDKNGDSATKIAIKN EINTPEKP---ITKPK-KEL--PVEKKPFQEFINLHLIPSLTEEINQRG-LE INNINLTNTNRPIAGDKCWVINCEIK--D---TCNFWLSFEKDDISS-LKSISLSK PNQKPSIIESFLIDEK---RITLKLIISRVLQRL-NGQKLIGVN -- >gi|78779339|ref|YP_397451.1|	 hypothetical protein PMT9312_0955 [Prochlorococcus marinus str. MIT 9312] --ME-ENLEPNSEVNN ---ETTNIPNKSNTEET--- KEPKSEK---VLNMSEN--NANSPNN-SVQKVDMKK ENVIPAKS---ISKPQ-KEL--PIEKKPFQEFINIHLIPELIDEINQRG-LE IKNINLKKTTRPIAGDKCWVINCEIK--D---TCDFWLSFEKEDISS-LKSISLSK PKQKPSIIESFLIDEK---RITLKLIISRLLQRL-NGQKLLGVN -- >gi|124025752|ref|YP_001014868.1|	 hypothetical protein NATL1_10451 [Prochlorococcus marinus str. NATL1A] ---M-ENQNPSNEIDS ---SKKVTRSQSDSLDK--- NEPASEG-KKDLNTLDKP-- --EK-SS-LLNSNAPA--IAKKPV-KPP--KLEDKPFKEFISNFLIPGLKASIEDKG-TV VCEIKLIEGQRPVVGGNCWMVFCELS--E---QRKFWLCFSKDIITS-DKTILLAE SNSDPSIVESFLIDEK---KTTLPLLISRVLQRL-NGQKWIGVN -- >gi|72382203|ref|YP_291558.1|	 hypothetical protein PMN2A_0363 [Prochlorococcus marinus str. NATL2A] ---M-ENQNPSNEIDS ---SKKVTRSQSDSLDK--- NEPAFEG-KKDLNTLDKP-- --EK-SS-LLNSNAPA--IPKKPV-KPP--KLEDKPFKEFISNFLIPGLKASIEDKG-TV VCEIKLIEGQRPVVGGNCWMVFCELS--E---QKKFWLCFSKDIITS-DKTILLAE SNSDPSIVESFLIDEK---KTTLPLLISRVLQRL-NGQKWIGAN -- >gi|125559127|gb|EAZ04663.1|	 hypothetical protein OsI_025895 [Oryza sativa (indica cultivar-group)] MASQPLRLVRPSP-LAGRHAAACK CSAAIPLVFGRQRLPLLVAFPRG--SGSGSGSGASCSAVQ ESSS-AAAATT-V-SEKKDAADAKK--- --EATAEAK--PAAKPAAKPK-KPPVKPLPEMMQEEIIPPLKAALEAEDDVS QVELSFEDNR--LEGSFI-KD-EVPYYFWAFFPNGDLTG-PKGFALSS YGTEVSTIEPFLIDEK---RANAKYVVFWVYKRL-AGQGILPVW--KEEEGE GEGEGESS-A >gi|125601029|gb|EAZ40605.1|	 hypothetical protein OsJ_024088 [Oryza sativa (japonica cultivar-group)] MASQPLRLVRPSP-LAGRHAAACK CSAAIPLVFGRQRLPLLVAFPRG--SGSGSGSGASCSAVQ ESSS-AAAATT-VPPPNFSEKKDAADAKK--- --EATAEAK--PAAKPAAKPK-KPPVKPLPEMMQEEIIPPLKAALEAEDDVS QVELSFEDNR--LEGSFI-KD-EVPYYFWAFFPNGDLTG-PKGFALSS YSTEVSTIEPFLIDEK---RANAKYVVFWVYKRL-AGQGILPVW--KEEEGE GEGEGESS-A >gi|34395135|dbj|BAC84849.1|	 unknown protein [Oryza sativa (japonica cultivar-group)]

---MMQEEIIPPLKAALEAEDDVS QVELSFEDNR--LEGSFI-KD-EVPYYFWAFFPNGDLTG-PKGFALSS YSTEVSTIEPFLIDEK---RANAKYVVFWVYKRL-AGQGILPVW--KEEEGE GEGEGESS-A >gi|147819041|emb|CAN71628.1|	 hypothetical protein [Vitis vinifera] MAIRAIGVSSFPS-SSYFTRKSEA TSSTLPLCLRHGQCMHQMA---G--KPVTSRRIIACSAVQ ESST-PTDEWK-GLDWSSMYSTCFYLVAAETKE--- --VKPAQEK--GPAKPKP-PA-KAPVKPLPQMMEEDVIPSLKSILEAQDDLS EIELSFQDNR--LEGSFQ-KK-GIPYSFWAFFXNGVLTG-PKGFSLSS YGSGSSTVEPFLIDEK---RITAKHVVFWVEKRL-AAQGIIPVW--KE -- >gi|145328262|ref|NP_001077877.1|	 unknown protein [Arabidopsis thaliana] MATIAGGSFG-VPSSR ISITTPTLSSSS---LLPPLTLQ--SGTRKDNLLRC-ALQ ESST-SAVAT-EKKNKEEGEE--- --STVAVPA--KKPKPAAKAA--AVAKPLRQMMEEDVIPPLQAILESQDDIS DIDLSFQDDK--LEGFFL-KK-SIPYSFWAFFPTGNLTGEQKDFQFPH TGQVRAPWNHFLSTRG---NQLRTTLCFGSRSVL-LHKGSSPFG--TN ---EVFVH-L >gi|30678093|ref|NP_849933.1|	 unknown protein [Arabidopsis thaliana]

---MHKTECLA--QKFEKSNFLVL-FL- ---VAT-EKKNKEEGEE--- --STVAVPA--KKPKPAAKAA--AVAKPLRQMMEEDVIPPLQAILESQDDIS DIDLSFQDDK--LEGFFL-KK-SIPYSFWAFFPTGNLTG-AKGFSISS HGSGPSTVEPFLVDER---KPTANHVVFWVEKRL-AAQGIIPVW--NQ --