At2g04039.2/MSA

>gi|30678093|ref|NP_849933.1|	  165 residues, 50 /line  unknown protein [Arabidopsis thaliana] M--- HKTECLAQKFEKSNFLV-L-FL-- --VATEKKN--- ---KEEGEESTVAVPAKKPKP-AAKAA---AVAKPLRQMMEEDVIPPLQAILESQDDISD IDLSFQDDK--LEGFFLK-KSIPYSFWAFFPTGNLTG-AKGFSISSHG SGPSTVEPFLVDER---KPTANHVVFWVEKRL-AAQGIIPVW--NQ--

>gi|113475477|ref|YP_721538.1|	  150 residues, 50 /line  hypothetical protein Tery_1808 [Trichodesmium erythraeum IMS101]

---MPEEQQKAKK--- -T-ETKPAGGKVKPE-KPP--AVESKPFAEFVQQDYLPALQKALLEQG-IDD LDLSLVKEKLPMIGFSSSEECWQVVGKFK---KGQRQFNIYFPKKDIKG-PRAFSCADNG SKPATLEPFLIDER---KITLKLLVFGVTQRL-NAQKWLSLN--

>gi|23125299|ref|ZP_00107239.1|	  226 residues, 50 /line  hypothetical protein Npun02006772 [Nostoc punctiforme PCC 73102] MADQTNHNQAGEVAP- --STVDKQAPSVAEEHAPST ---DSPEATD-LPTANTPDPKAANPETNPNAAKTTATPPKREKPAAKAA--- -VGEKPAAAT-EEKPAAKAAKKE-KAP--AVEDKPFVEFIEQDYLPALQKAIAQQG-VQD LQVSFAKQKVPITGFESAEECWQIIGSWKE--TGARQFNLYFPEEDIQG-KKGFSCN-EG KKPSTLESFLIDER---KITLDLLVFGLVQRL-DGQKWLGIN--

>gi|119485084|ref|ZP_01619469.1|	  141 residues, 50 /line  hypothetical protein L8106_06539 [Lyngbya sp. PCC 8106] MSE-

-ETEAK-- -KPAA---KKE-KPP--AVEDKPFAEFIEQDYIPALQDALTSQG-LED LELKFAKEKFPASV-GTSEPCWQVMGQWK---GGKRQFNVYFPNEDIKA-QKVFSLADNG SKPSGIEPFLGDER---RINLSLLVFGVVQRL-NAQKWLNRN--

>gi|119510377|ref|ZP_01629512.1|	  201 residues, 50 /line  hypothetical protein N9414_15977 [Nodularia spumigena CCY9414] MAEEKNHNQTGEAAP- --SNVDKQVPST ---NEPVATN-LPTANTPDPKAANPEVNPNAATTKTAPPKA--- ---EKPPA---AAKAAKKE-KPP--AVEDKPFVEFMEQHYLPTLQNAIAQEG-VED LQLAFAKQKLPIAGFQSAEECWQVIGSWQ---NGQRQFNVYFPEESITG-KKGFSCN-EG KRPSTLESFLIDER---KITLDLLVFGLVQRL-NGQKWLGRN--

>gi|17230789|ref|NP_487337.1|	  209 residues, 50 /line  hypothetical protein alr3297 [Nostoc sp. PCC 7120] MADPTNQNQPGDTVP- --STVDQKAPTVAEENAPST ---SEPVATN-LPTANTPDPEAANPKTNPNTAKPTAAAPKE--- ---EKPAA---AAKAAKKE-KAP--SVEDKPFVEFMEQDYLPALQKAITAEG-VKD LQLSFAKQKLPIAGLSSAEEYWQVIGSWQ---NGQRQFNVYFPDEDIQG-KKGFSCN-EG RRPSTLESFLIDER---KITLDMLVSRLVYRL-NGQKWLGRN--

>gi|126658804|ref|ZP_01729948.1|	  139 residues, 50 /line  hypothetical protein CY0110_08131 [Cyanothece sp. CCY0110]

-M-- AE-ETKPKA---KKE-KPP--AIEEKPFTEFMEEHFTPTLKESLINQG-LDD IELSFTKADISIAGATSDEPCWQVIGTWN---QGQRQFKLYFLEEDIKG-QKAFSYAVNG KPPSTIESFMIDER---KINLDLMVLYTLQRL-NGQKWLTRN--

>gi|67920789|ref|ZP_00514308.1|	  154 residues, 50 /line  conserved hypothetical protein [Crocosphaera watsonii WH 8501]

--MKPYIFDNNKITGTIM-- AE-ETKPVA---KKG-KPP--AIEKKPFTEFMEEHFTPTLKESLTKEG-LND IELSFTKAPVSIPGAISDEPCWQVIGTWD---KGKRQFNIYFPGEDIKG-QKAFSYAVNG KPPSTIESFMIDER---KVTLDLMVLYTLQRL-NGQKWLTRN--

>gi|116070611|ref|ZP_01467880.1|	  135 residues, 50 /line  hypothetical protein BL107_13235 [Synechococcus sp. BL107]

MSEAPA-- NKPAAKP-KPP--KPEDKPFPEFIDTLFLPAVAKQLLENG-ITA DRLERIDGDRPVVGGRCPMVVGDLPGGRRFWLCFAKEDISS-GKVIALADPG SEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGGN--

>gi|33865733|ref|NP_897292.1|	  146 residues, 50 /line  hypothetical protein SYNW1199 [Synechococcus sp. WH 8102]

MSDTPDTASAA- ---AQEDTK---AAPKPKP-KPP--KPEDKPFPEFIDTLFIPAVSKQLEDNG-IQA DRLERVEGERPVVGGSCPMVIGELPGGRRFWLCFGSADITS-PKLIALAEAG SEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGRN--

>gi|78184737|ref|YP_377172.1|	  135 residues, 50 /line  hypothetical protein Syncc9902_1164 [Synechococcus sp. CC9902]

MSEAPA-- NKPAAKP-KPP--KPEDKPFPEFIDTLFLPAVAKQLLENG-ITA DRLERVEGDRPVVGGRCPMVVGDLPGGRRFWLCFAKEDINS-GKVIALADPG SDPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGSN--

>gi|78212841|ref|YP_381620.1|	  145 residues, 50 /line  hypothetical protein Syncc9605_1311 [Synechococcus sp. CC9605]

MSETPVEKQSS- ---GQE-AP---AKPAPKA-KPP--KPEDKPFPEFIDTLFLPAVAKQLAEHD-ITA DRLERIEGQRPVVGGECPMVVGELPGGRRFWVCFSKADINS-SKVIALADAG SEPTLLESFLIDEK---RMSLPLLVSRLLQRL-NGQKWLGGN--

>gi|148242576|ref|YP_001227733.1|	  147 residues, 50 /line  hypothetical protein SynRCC307_1477 [Synechococcus sp. RCC307]

MSDSAATPD---KPT- ---DQPQAA---KAPPAKA-KPP--KPEDKPFAEFVPQLLMPALAKEIESYG-GPA PALELEQAAMPVVGETCWQIRGELPGDRRFWLCFTKDDIQA-PKTFAIAESG APPSLLESFLIDEK---RITLALLVSRTVQRL-NGQKWLGPN--

>gi|116074905|ref|ZP_01472166.1|	  153 residues, 50 /line  hypothetical protein RS9916_30264 [Synechococcus sp. RS9916]

---M -PRADY--SSASEGNLVSETPAKA--T- ---PTPA-EKPAKPP--KPEDKPFDAFIQEDLLPAVRKGIVDRG-ITP SVLDLRQGERPVVGGSCWMLYGELPPGRRFWLCFSEPSIGA-DKTIALADPG TDASLLESFLIDEK---RMSLALLQSRLLQRL-NGQKWLGGN--

>gi|148239439|ref|YP_001224826.1|	  145 residues, 50 /line  hypothetical protein SynWH7803_1103 [Synechococcus sp. WH 7803]

MSETPAKPKGE- ---TKPAGE---G--KAKPAAKP--KLEDKPFASFMQEDFLPSLTKALDDRG-QRA VSLSLIEGERPVVGGSCWMVKGELSGERRFWLCFESDAITS-GKTIALAESG TAPSLLESFLIDEK---RITLALLQSRLLQRL-NGQKWLGGN--

>gi|88808724|ref|ZP_01124234.1|	  145 residues, 50 /line  hypothetical protein WH7805_03502 [Synechococcus sp. WH 7805]

MSETPAKPKGT- ---TKPAGE---G--KAKPAAKP--KPEDKAFASFIQEDFLPSLSKALADRG-HAP VSLSLSEGERPVVGGLCWMVKGELSSERRFWLCFESDAITS-GKTIALAESA AEPSLLESFLIDEK---RMTLALLQSRLLQRL-NGQKWLGGN--

>gi|124023148|ref|YP_001017455.1|	  169 residues, 50 /line  hypothetical protein P9303_14441 [Prochlorococcus marinus str. MIT 9303] MN-- -DNDNRVN -PNADA--EKPAEQKPTLENPVKTNP-- ---TEDQQE---G-AKPQSAKPPAAKVEDKPFETFIRDDFLPNIKQALTERG-MPP STLELIQGNRPVVGDPCWMVCGEIPLGRRFWLCFASDSIAS-KKTISLAETG TEPSLLEPFLIDEK---KMTLILLRSRLLQRL-NGQKWLTAN--

>gi|33863041|ref|NP_894601.1|	  169 residues, 50 /line  hypothetical protein PMT0769 [Prochlorococcus marinus str. MIT 9313] MN-- -DNDNRVN -PNADG--QKPAEQKPTLENPVKTNP-- ---TEDQQE---G-AKPQTAKPPAAKVEDKPFETFIRDDFLPNIKQALTERG-MPP STLELIQGDRPVVGDPCWMVCGEIPLGRRFWLCFASDSIAS-KKTISLAETG TEPSLIEPFLIDEK---KMTLTLLRSRLLQRL-NGQKWLTAN--

>gi|87125806|ref|ZP_01081649.1|	  134 residues, 50 /line  hypothetical protein RS9917_00280 [Synechococcus sp. RS9917] MS-- -DT- -PAA --ATPKP-KPP--KLEDKPFAAFIAEDFLPGLRKGLADHG-LTP TSLDLIEGERPVVGGSCWMVCGELPPGRRFWLCFNEAAIQS-GKTIALADPG TEPSLIESFLIDEK---RITLPLLLSRLLQRL-NGQKWLGGN--

>gi|37520443|ref|NP_923820.1|	  222 residues, 50 /line  hypothetical protein glr0874 [Gloeobacter violaceus PCC 7421] MP-EENQAPPPAEAAA ---TQTPE SPTDTKAADA--TPAPDGEAPAPKPRPPRPRPAAASAEGD ATEAAPAAAEGEAAPARPP--RP-KPG--DAPAKPLPQYIQEDILPLLEKRMKAEG-AAD VALTAGEADFTATW--D---NGNKTFTIYFDEGNLEG-RKTIAYNEVK SPGRVLQMFMPPERGFKGVDAKQIVVMILQQFTTTLTWIKKQPAAAGGGAAAKGKPPRPE RPAKSVAS >gi|16331085|ref|NP_441813.1|	  157 residues, 50 /line  hypothetical protein sll0272 [Synechocystis sp. PCC 6803] MALIP---FYVY GDRLP---

-KLMTE-AKAPAE---KKA-KPP--AVEEKPFTEFINQDFLPALQSALGKIG-LGP VALDFSKKPIAIPGAD-NTPYWQVQGTWSGDRQISKQFNLYFFQEDIKG-PKGFAYSVDN RPPSTLESFMIDER---KVTLDLMVLYTLQRL-NGQKWLGGN--

>gi|22298015|ref|NP_681262.1|	  146 residues, 50 /line  hypothetical protein tlr0472 [Thermosynechococcus elongatus BP-1]

-MAVACKGECYHAEEMVPC- -RSAMA-- ---EETPT-QAPKKE-KPP--AIEDKPFAEFINEAFLPALKNALSAK--VGD VTLRLEGN--TVMGEWG---KGMYQFRLYFLEGNIQG-PKVFVCSSGG IAPSTLEPFLGDER---KVTLDLLVFGVMQRL-NGQKWLGGN--

>gi|33240441|ref|NP_875383.1|	  155 residues, 50 /line  hypothetical protein Pro0991 [Prochlorococcus marinus subsp. marinus str. CCMP1375] --MENNQ---DL SAN- ---NPPLSPEARK--- -SSQLE-GEKSLIKK-KPP--KLEEKPFEEFVNKHLIPEISNSLSSKG-ISL ESIILKKDQRPVVGGECWIVYGELLNGKRFWITFNSDDIKS-TKNICLAESS SEAALLESFLIDEK---KITLQLLTSRFMQRL-NGQKWLGDN--

>gi|56752196|ref|YP_172897.1|	  144 residues, 50 /line  hypothetical protein syc2187_d [Synechococcus elongatus PCC 6301] MS-- EEQV VPTP EA-AAKPAA---KKE-KPP--ALEDKPFKEFVQTDLLPAVQQALSDRG-LAD LDLQFVEAPLPVTGDRCWQLQGSWA---KGQRRFLLGFSEESLTA-PKVFSLADGK ATPATLEAFLGDER---KITLPLLLNRILSRL-NGQKWLEQN--

>gi|84518016|ref|ZP_01005365.1|	  153 residues, 50 /line  hypothetical protein P9211_04167 [Prochlorococcus marinus str. MIT 9211] --MEESH- --QDQIKGDNSQALSE SSS- ---NVV---TQEVKE-KPL--KIEEKPFDVFILDHFIPGLKSSLHKFG-ISA PVITLKEDERPVTGGKCWMVYAKLPKDRKFWICFSTNQISS-LKNFAISESG AEPSLLESFLIDER---KTTLPLLISRTLQRL-NGQKWLGRN--

>gi|86606710|ref|YP_475473.1|	  150 residues, 50 /line  hypothetical protein CYA_2070 [Synechococcus sp. JA-3-3Ab] MSG-

EPTPA---QETAVEKPARVEAANP --APKETDTA-QASPKAAA-KKP--AKEEKPFEQLIAEDVIPATIAAFQKRG-VTD LQLRLEGKT--LIGSFA---GGKKQFSVLFAEGSLNG-PKFFRCAIEG SPASTIESFMIDER---RVDVPLFVFYLLQRL-YAQQWY-

>gi|86608133|ref|YP_476895.1|	  141 residues, 50 /line  hypothetical protein CYB_0645 [Synechococcus sp. JA-2-3B'a(2-13)] MSG-

EPTPEAAAAEKPAE--AAKP --A--A-EDSPKAAA-KKV--AKEEKPFEQQITEEVIPAAIAAFQKRG-VSD LELRLEGKT--LVGAFA---GGKKQFSVLFAEGSLNG-RKFFRCTTEG SPASTIESFMIDER---RVDVNLFVFYLVQRL-YAQQWY-

>gi|87302400|ref|ZP_01085225.1|	  146 residues, 50 /line  hypothetical protein WH5701_09364 [Synechococcus sp. WH 5701] MQCF ELR- ---LQPVSD--- ---SASPAPN-PVKV-KPP--APEDKPFAEFIPQLFLPALLKEIEAFG-GAT PQLSFEQGAMPVVGSPCWLVMGSFPGDRRFWLCFTEASISS-AKTIALAEAG SEPSLLESFLIDEK---KTTLALLISRVVSRL-NSQKWLGPN--

>gi|123966154|ref|YP_001011235.1|	  319 residues, 50 /line  hypothetical protein P9515_09211 [Prochlorococcus marinus str. MIT 9515] --MEENVDSIGDSYINEKDTFKKDNKNTNKEKVAKEKSNEVNKEINEEKVAKENSNEVNI EINEEKISEENSKEVNKDINKEKVAKEKSNEVNKEINEEKVAKENSNEVNIEINEEKVAK EKSNEVNKEINEEKISEENSKEVNKDINKEKVSKENSKEVNKDINKEKVAKENSKEVNKD -INKEKVAEE-NLKPL--KIKPK-KEI--PIEKKPFKEFINEHLLPSIIQEFKLRG-FEV KEINLKNTSRPIAGDKCWVVFCEIKDICNFWLSFEKEDISS-SKSISLCKSN QKPSVIESFLIDEK---RITLKLIISRILQRL-NGQKLIGIN--

>gi|126696361|ref|YP_001091247.1|	  192 residues, 50 /line  hypothetical protein P9301_10231 [Prochlorococcus marinus str. MIT 9301] --MEENLDINNKVNN- --EKSDNITKSNSEEI ---REPRSEG---DTNTVSN--NSNPQRNSDSKD-- --NLKNDIDT-PVKP---VIKPK-KEL--PIEKKPFQEFINIHLIPALVEEINIRG-LKV NNINLTNTNRPIAGDKCWVINCEIKDTCNFWLSFEKEDISS-LKSISLSKPN QKPSIIESFLIDEK---RITLKLIISRVLQRL-NGQKLIGVN--

>gi|33861402|ref|NP_892963.1|	  183 residues, 50 /line  hypothetical protein PMM0845 [Prochlorococcus marinus subsp. pastoris str. CCMP1986] --MEENVESIEESIKREDDPLKK- ---DIIDI -DSPKETSTLINAN---SQD -SNKQKVGGE-NSIPL--KIKPK-KEL--PIEKKPFNEFINDHLLPSIIQEFKVRG-LEV ADINLKNTSRPIAGDKCWVIFCEIKDICNFWLSFEKDDISS-LKSISLCKSD QKPSVIESFLIDEK---RITLKLIISRILQRL-NGQKLIGIN--

>gi|123968557|ref|YP_001009415.1|	  187 residues, 50 /line  hypothetical protein A9601_10241 [Prochlorococcus marinus str. AS9601] --MEENLDKNNEVNK- --EIPDKTTKSNSEEI ---KEPKSEK---AIILDKN---GDSATKI-- --AIKNEINT-PEKP---ITKPK-KEL--PVEKKPFQEFINLHLIPSLTEEINQRG-LEI NNINLTNTNRPIAGDKCWVINCEIKDTCNFWLSFEKDDISS-LKSISLSKPN QKPSIIESFLIDEK---RITLKLIISRVLQRL-NGQKLIGVN--

>gi|78779339|ref|YP_397451.1|	  192 residues, 50 /line  hypothetical protein PMT9312_0955 [Prochlorococcus marinus str. MIT 9312] --MEENLEPNSEVNN- --ETTNIPNKSNTEET ---KEPKSEK---VLNMSEN--NANSPNNSVQKV-- --DMKKENVI-PAKS---ISKPQ-KEL--PIEKKPFQEFINIHLIPELIDEINQRG-LEI KNINLKKTTRPIAGDKCWVINCEIKDTCDFWLSFEKEDISS-LKSISLSKPK QKPSIIESFLIDEK---RITLKLIISRLLQRL-NGQKLLGVN--

>gi|124025752|ref|YP_001014868.1|	  183 residues, 50 /line  hypothetical protein NATL1_10451 [Prochlorococcus marinus str. NATL1A] ---MENQNPSNEIDS- --SKKVTRSQSDSLDK ---NEPASEG-KKDLNTLDKPEKSSLLN -S-NAPAI--AKKPV-KPP--KLEDKPFKEFISNFLIPGLKASIEDKG-TVV CEIKLIEGQRPVVGGNCWMVFCELSEQRKFWLCFSKDIITS-DKTILLAESN SDPSIVESFLIDEK---KTTLPLLISRVLQRL-NGQKWIGVN--

>gi|72382203|ref|YP_291558.1|	  183 residues, 50 /line  hypothetical protein PMN2A_0363 [Prochlorococcus marinus str. NATL2A] ---MENQNPSNEIDS- --SKKVTRSQSDSLDK ---NEPAFEG-KKDLNTLDKPEKSSLLN -S-NAPAI--PKKPV-KPP--KLEDKPFKEFISNFLIPGLKASIEDKG-TVV CEIKLIEGQRPVVGGNCWMVFCELSEQKKFWLCFSKDIITS-DKTILLAESN SDPSIVESFLIDEK---KTTLPLLISRVLQRL-NGQKWIGAN--

>gi|125559127|gb|EAZ04663.1|	  228 residues, 50 /line  hypothetical protein OsI_025895 [Oryza sativa (indica cultivar-group)] MASQ-PLRLVRPSPLAGRHAAACKCSAAIP-- --LVFGRQRLPLLVAFPRGSGSGSGSGAS-CSAVQE SSS---AA-AATTVSEKKDAADAKK--- ---EATAEAKPAAKPAAKPKKPP-VKPLPEMMQEEIIPPLKAALEAEDDVSQ VELSFEDNR--LEGSFIK-DEVPYYFWAFFPNGDLTG-PKGFALSSYG TEVSTIEPFLIDEK---RANAKYVVFWVYKRL-AGQGILPVW--KEEEGEGE GEGESS-A >gi|147819041|emb|CAN71628.1|	  220 residues, 50 /line  hypothetical protein [Vitis vinifera] MAIR-AIGVSSFPSSSYFTRKSEATSSTLP-- --LCLRHGQCMHQMA---GKPVTSRRIIA-CSAVQE SST---PTDEWKGLDWSSMYSTCFYLVAAETKE--- ---VKPAQEKGPAKPKP-PAKAP-VKPLPQMMEEDVIPSLKSILEAQDDLSE IELSFQDNR--LEGSFQK-KGIPYSFWAFFXNGVLTG-PKGFSLSSYG SGSSTVEPFLIDEK---RITAKHVVFWVEKRL-AAQGIIPVW--KE--

>gi|145328262|ref|NP_001077877.1|	  206 residues, 50 /line  unknown protein [Arabidopsis thaliana] MATI-AGGSFGVPSSRISITTPTLSSSSL--- LPPLTLQSGTRKDNLLR-C-ALQE SST---SA--VATEKKN--- ---KEEGEESTVAVPAKKPKP-AAKAA---AVAKPLRQMMEEDVIPPLQAILESQDDISD IDLSFQDDK--LEGFFLK-KSIPYSFWAFFPTGNLTGEQKDFQFPHTG QVRAPWNHFLSTRG---NQLRTTLCFGSRSVL-LHKGSSPFG--TN-- -EVFVH-L >gi|18395648|ref|NP_565308.1|	  199 residues, 50 /line  unknown protein [Arabidopsis thaliana] MATI-AGGSFGVPSSRISITTPTLSSSSL--- LPPLTLQSGTRKDNLLR-C-ALQE SST---SA--VATEKKN--- ---KEEGEESTVAVPAKKPKP-AAKAA---AVAKPLRQMMEEDVIPPLQAILESQDDISD IDLSFQDDK--LEGFFLK-KSIPYSFWAFFPTGNLTG-AKGFSISSHG SGPSTVEPFLVDER---KPTANHVVFWVEKRL-AAQGIIPVW--NQ--