Evolutional: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
Now 47 Human Blast sequence were used to run Clustal X and they are as follows, | |||
Highlighted organisms come under same Lineage as our protein. | |||
>gi|9294283|dbj|BAB02185.1| unnamed protein product [Arabidopsis thaliana] | |||
MEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFGEMIGVWTVCLWEQMGRPERVNLVELGPGRGTLMA | |||
DLLRGTSKFKNFTESLHIHLVECSPALQKLQHQNLKCTDESSSEKKAVSSLAGTPVHWHATLQEVPSGVP | |||
TLIIAHEFYDALPVHQFQTQYLQKSTRGWCEKMVDVGEDSKFRFVLSPQPTPAALYLMKRCTWATPEERE | |||
KMEHVEISPKSMDLTQEMAKRIGSDGGGALIIDYGMNAIISDSLQAIRKHKFVNILDDPGSADLSAYVDF | |||
PSIKHSAEEASENVSVHGPMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVGDGEAPFWEGPN | |||
EQTPIGMGTRYLAMSIVNKNQGIPAPFQ | |||
>gi| | >gi|50550583|ref|XP_502764.1| hypothetical protein [Yarrowia lipolytica] | ||
MLRTIRPARTTLVRAVRPVRPVSGRVGRLGRHVTTGTTSSTTSASSPDLSTTLAMAIEQQGPMSVATFMK | |||
HCLTNPSGGYYIDKDPLGAKGDFTTSPEISQMFGELVGLWLAAQWLYYGQKQPFRVIEYGPGRGTLMDDS | |||
LRALVSAKSTGAKEALKEVLLVEASPVLRDAQRKKLCGAESQFKTEEDGSITCVTKYGVPIRWYEDSKML | |||
DKLASSNDPLHNYIVAHEFFDALPIYQFEKTDKGWRELMVNYGVENKTKESSILLPGQTHIKSSDLDKDK | |||
KKTFHLVTAPTWTVASKVIPQSHKRYRDLPEWSKIEVCPDAWDVANQMGRLVAKGGAAFIVDYAVKPGVP | |||
VNTLRGIRDHKICSPFEEPGKVDLSADVDFTAIGIASRSKNKENVSAFGPINQATWLKNMGIEMRTEKLM | |||
EGKEEYIKKRIESQYKRLVDIGINGMGKIYKAFFLTHSSHGYPVGFPIPEPKDLKEPHQKPEKDPKDTEP | |||
KVVEV | |||
''' | |||
>gi|27382556|ref|NP_774085.1| hypothetical protein blr7445 [Bradyrhizobium japonicum USDA 110]''' | |||
MTEQPLLNEIKALIKSSGPMPVWRYMELCLMHPRYGYYVSRDPLGREGDFTTAPEVSQMFGELLGLWTAS | |||
VWKQMGSPQSLRLIELGPGRGTMMADALRALRVLPPLYQALQIHLVEVNPVLRERQSATLSGARNVAWHD | |||
SIDDVPEGPSIILANEYFDVLPIHQMVKRENGWHERVIEIDPNGKLQFGAASEPTPRFDVLLPPLVRAAP | |||
VGAVFEWRPDGEVMKLATRVRDQDGAALIIDYGHLRSDAGDTFQAIARHTFTDPLKAPGQADVTAHVDFQ | |||
ALARAAEDVGARVHGPVTQGDFLKRVGIDTRAAALMQKATPEVATDISVALKRLTDTGRSGMGSMFKVLG | |||
ISEPRLTGLAGLSDLEHAGGN | |||
>gi|119177909|ref|XP_001240685.1| hypothetical protein CIMG_07848 [Coccidioides immitis RS] | |||
MSNGATQIVNRLARASRCSRLATPSAAKRYLSSAPQRRWSTPLAKTIADVINTAGPISIAAYMRQCLTSP | |||
EGGYYTSRGSTGVEVFGRKGDFVTSPEISQMFGELLGVWMVTEWMAQGRRSRGVQLIEVGPGRGTLMADM | |||
LRSVRNFKSFSSSIEAVYLVEASPTLRDIQKQMLCGDAPMEEIEVGYRSTSKHLGVPVVWTEHIRSLPQG | |||
DNDVPFIIAHEFFDALPIHAFQCVASPPSETIITPTGPTTLRQPLSSSPTQWRELVVSVNPASQMHAENR | |||
LEFRLSLAKTSTPASMVMPEMSERYKALKSTRGSTIEISPESQGYVQEFARRIGGHSNSKIPTTRKPAGA | |||
ALILDYGPSHSIPVNSLRGIKDHKLVSPFTSPGQVDLSADVDFIALADSAISASPGVEVHGPTEQGSFLH | |||
SLGISERAAQLMKRAEDETKRKNIEAGWKRLVERGGGGMGRIYKAMAIIPEAGGMRRPVGFGGQVPA | |||
>gi|91978339|ref|YP_570998.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB5] | >gi|91978339|ref|YP_570998.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB5] | ||
Line 18: | Line 45: | ||
GVSDPNITSLVALSDDAERAAEGQPA | GVSDPNITSLVALSDDAERAAEGQPA | ||
>gi| | >gi|46200845|ref|ZP_00207869.1| COG1565: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] | ||
MSLSALLSERIKATGPIPVSEFMAEALGHPEYGYYRGRDPFGMAGDFTTAPEISQMFGELIGLWCALVWQ | |||
SMGSPERVVLAEIGPGRGTLMADLLRAAKALAPFARALDVHLIETSPSLRNRQAQALADQSVTWHERFED | |||
LPDGPLLLVANELFDALPIRQLEKVGGVWHERVVGLDDQGALVLALGPVVADPPLAPAVLNAPDGSLAEV | |||
CPQGRVLAEAVARRLAHQGGAALIIDYGYETSAAGDSLQAVKSHRHHPVLSAPGTADITAHVDFQALAEA | |||
ASGLARVYGPVPQGRFLARLGLEERVRMLMQHASVEQAAHLASGARRLIDPAEMGTLFKVLALANPLLPA | |||
PPGLELA | |||
>gi| | >gi|125526627|gb|EAY74741.1| hypothetical protein OsI_002588 [Oryza sativa (indica cultivar-group)] | ||
MEEVLTNPQSGFYINRDVFGTSGDFITSPEVSQMFGEMTGVWAMCLWEQMGQPEKVNLIELGPGRGTLLA | |||
DLLRGSSKFVNFTKALNINLVECSPTLQKVQYNTLKCEDEPIGDKTRTVSKLCGAPVHWHASLEQVPSGL | |||
PTIIIAHEFYDALPIHQFQPTASLLFLSKRCGWASSEELEKVEHIEVCPKAMEITEQIADRISSDGGGAL | |||
IIDYGKDGIVSDSLQAIRKHKFVHILDNPGSADLSAYVDFASIRHSAKEASDDISVHGPMTQSQFLGSLG | |||
INFRVEALLQNCATDEQAESLRTGYWRLVGDGEAPFWEGPDDQTPIGMGTRYLAMAIVNKKQGTPVPFE | |||
>gi| | >gi|119480871|ref|XP_001260464.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181] | ||
MMNSATKRALTRHFRTYQCRNLQIGSHRCSSTFDQRQWSTPLAKTLANAIKVTGPIPIAAFMRQVLTSPE | |||
GGYYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVGIWTITEWMAQGLKRSGVQLIEVGPGKGTLMDDML | |||
RTFRNFKSFASSLEAIYLVEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVIWVEDIRLLPHEE | |||
DKTPFIFAHEFFDALPIHAFESIPPAPENQSEQKEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPKAVE | |||
ENIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTPGSTIEVSPESRIYASDFARRIGGSSQPPRTVG | |||
SRNAPAAQPKKVPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSSPGQVDVSADVDFMALAEAAIEAS | |||
EGVEVHGPVEQGDFLQVMGIAERMQQLLKGVQDEEKRKTLESGWKRLIERGGGGMGKIYKFMAIIPENGG | |||
RRRPVGFGGSVQM | |||
>gi|92118562|ref|YP_578291.1| protein of unknown function DUF185 [Nitrobacter hamburgensis X14] | >gi|92118562|ref|YP_578291.1| protein of unknown function DUF185 [Nitrobacter hamburgensis X14] | ||
Line 74: | Line 78: | ||
GVSDSSLTELAGLSDRKRRGGIRAP | GVSDSSLTELAGLSDRKRRGGIRAP | ||
>gi|85716018|ref|ZP_01046995.1| hypothetical protein NB311A_14415 [Nitrobacter sp. Nb-311A] | >gi|146342824|ref|YP_001207872.1| hypothetical protein BRADO6003 [Bradyrhizobium sp. ORS278] | ||
MIETSPLQPEIKRLIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGELLGLWAA | |||
SIWKAAGSPQQFRLIELGPGRGTMMSDALRALRVLPPLYQTISVHLVEINPVLREKQKATLTGLRNVTWH | |||
DSFDEVPEGPSVIFANEYFDVLPVHQMVRRETGWHERVVELDDDENFVYGTAADPTPGFELLLSPLVRAA | |||
PAGAIFEWRPDTQMMAIARRLREQRGAAVIIDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDF | |||
DALSRTAEAVGARVHGPITQGEFLQRLGIETRALTLMQKASPEVSEDIASGLKRLTSGGRGGMGSLFKVL | |||
GVSDPSIPVLAGISDEHTSEKTGGA | |||
>gi|146322884|ref|XP_755307.2| DUF185 domain protein [Aspergillus fumigatus Af293] | |||
MNSATKSAWTRHFRTYQYRNLRIGSHRCSSTFEKRQWSTPLAKTLANAIKVTGPIPIAAFMRQVLTSPEG | |||
GYYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVGIWTITEWMAQGSKRSGVQLIEVGPGKGTLMDDMLR | |||
TFRNFKSFASSLEAIYLVEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVLWVEDIRLLPHEED | |||
KTPFIFAHEFFDALPIHAFESIPPAPENSPEQKEIITPTGPAKLHQPMKPANTPQWREIMVTLNPKAVED | |||
NIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTPGSTIEVSPESRIYASDFARRIGGSSQPPRTVGS | |||
RNSPAAQPKKIPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSSPGQVDVSADVDFMALAEAAIEASE | |||
GVEVHGPVEQGDFLQVMGIAERMQQLLRGVQDEEKRKTLESGWKRLIERGGGGMGKIYKFMAIIPENGGR | |||
RRPVGFGGTVQM | |||
''' | |||
>gi|78693301|ref|ZP_00857815.1| conserved hypothetical protein [Bradyrhizobium sp. BTAi1]''' | |||
MIELSPLHSEIKRVIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGELLGLWAA | |||
SVWKASGSPQQFRLIELGPGRGTMMSDALRALRVLPPLYQTISVHMVEINPVLREKQKATLTGLRNITWH | |||
ESFDDVPEGPSVIFANEYFDVLPIHQMLKRETGWHERVVELDAEENFAYGTAAEPTPGFELLLPPLVRAA | |||
PLGAIFEWRPNNEIMAIAKRIREQRGAAVIIDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDF | |||
EALARAADAVGARVHGPITQSEFLRRLGIETRALTLMQKASPDISRDIASGLKRLIEGGRGGMGSLFKVL | |||
GLSDASIPVLAGISDEHTGGKPGGA | |||
>gi|84499690|ref|ZP_00997978.1| hypothetical protein OB2597_07165 [Oceanicola batsensis HTCC2597] | |||
MAVTPLLDRIRHRIGAQGPMTLAEYMQIALLDPDHGYYATRDPFGTAGDFITAPETSQMFGELVGLALAQ | |||
SWIDQGRPAPFILAEPGPGRGTLMADILRATRSVPGFHDGLSLVLIEASPVLRDIQARTLSGYRAEWIDD | |||
LGALPEAPLFLVANEFFDALPIRQFRRRGDGWAEVMVTVSGSGLATALAAPVPLPELAHRLGDTREDDVV | |||
ELCPAAARAAAHIGARIADQGGAAVIVDYGDWRSLGDTFQALKGHAPVDPLAAPGTADLTAHVDFERLAK | |||
AATPAWASGMIPQGVFLERLGITARAQALATRLQGPDLDAHVAAHRRLTHPEEMGTLFKVLALSPPDAPP | |||
VPGTTDPEWPTE | |||
>gi|67539742|ref|XP_663645.1| hypothetical protein AN6041.2 [Aspergillus nidulans FGSC A4] | |||
MNCSTQRIVNQFSRQTARRRFNIRSRRWNSTFETREWSTPLARTLANVIKTTGPVPIAAFMRQVLTSPEG | |||
GYYTTKPGGGGEVFGKKGDFVTSPEISQVFGELVGIWTIAEWMAQGGKKSGVQLMEIGPGKGTLMDDMLR | |||
TFRNFKPFTSSLEAIYLVEASPTLREVQKQLLCGNAVMEETDIGHRCTSKYFNVPVIWVEDIRLLPHEED | |||
KTPFIFAHEFFDALPIHAFESVPPSPENEQQEQEIMTPTGRTKLQRPPKAANTPQWRELMVTLNPKAVDE | |||
NIKDEPEFKLTLAKASTPSSLVIPEISERYRALKSQPGSTIEVSPESRIYASDIARRIGGSSQPPRTAAG | |||
RNASAPSAIAKRIPSGAALIMDYGTMSTVPINSLRGIQNHKIVPALSSPGRVDVSADVDFTSLAEAALEA | |||
SEGVEVHGPVEQGHFLQAMGIAERMQQLLSTVKDEKKRKILETGWQRLVERGGGGMGKLYKVMTIIPENG | |||
GRRRPVGFGGGVPL''' | |||
>gi|85716018|ref|ZP_01046995.1| hypothetical protein NB311A_14415 [Nitrobacter sp. Nb-311A]''' | |||
MTEPAPLLADIKRLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQMFGELLGLWGA | MTEPAPLLADIKRLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQMFGELLGLWGA | ||
SVWRTIGSPLTLRLIELGPGRGTMMADALRALRVLPPMYESLSVHMVEINPVLREKQMAALSDAPNIQWH | SVWRTIGSPLTLRLIELGPGRGTMMADALRALRVLPPMYESLSVHMVEINPVLREKQMAALSDAPNIQWH | ||
Line 82: | Line 130: | ||
VSSPGLTELAGLSDRERRGGIKAP | VSSPGLTELAGLSDRERRGGIKAP | ||
>gi| | >gi|145252682|ref|XP_001397854.1| hypothetical protein An16g05460 [Aspergillus niger] | ||
MNQATRRAVRQLLRKHPNQTLFLKSQRWSSTTPASSTSTSETRKWSTPLAQTLANAIKVTGPVPIAAFMR | |||
QVLTNPEGGYYTTRPEGHGAVFGKKGDFVTSPEISQVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKG | |||
TLMDDMLRTFRNFKMFSSSMEAIYLVEASATLREVQKKLLCGDAVMEATDIGHKSTCKYFDVPIVWVEDI | |||
RLLPHEEEKTPFIFAHEFFDALPIHAFESIPPSPENQPEQKEIMTPTGPAKLHQPLKPANTPQWREIMVT | |||
LNPKAVEENIEGEPEFKLTLAKASTPSSLVIPEISPRYRALKSQPGSTIEVSPESRIYAADFARRIGGAS | |||
EPPRTATKGAAASAPAPAKRVSSGAALIMDYGTLNTIPINSLRGIQEHKNVPPLSSPGQVDVSADVDFTA | |||
LAEAAIEASEGVEVHGPVEQGDFLQAMGIEERMQQLLKKVEDEEKRKTLETGWKRLVEKGGGSMGKIYKV | |||
MAIVPENDGKRRPIGFGGGLVM | |||
>gi|116058192|emb|CAL53381.1| ATP synthase beta subunit/transcription termination factor rho-like (ISS) [Ostreococcus tauri] | |||
MGGGEHGKGERTGMIGHLKRAMAFAGGSIPVSEYVRECLTNPEHGYYMRGDVFGRDGDFVTSPEISQVFG | |||
EVLGVWAALQHEALGSPGTLRVVEFGPGRGTLMADLLRGTSKFEKFRSAVSVHLIEVSPALREVQARTLR | |||
CVDVETTSAAADDGGARVRVPKNALEAEEGEVDKRSAADGPSGEAHTRGTSEISGAKVFWHDGLESVPNG | |||
PTLVICHEFFDALPVRQFQRTDRGWCEKLVTIDAELASTAETVEETTPRRELAMVLSPGPTPASHMLVPR | |||
RLKGLPKEQVDSLRLLELSPPSMTLWDKLADRIEKNSGAVLAIDYGEEGPLGNTLEAIKDHKFVHVLDSP | |||
GEADLSAYVDFGALRQIVEEKPQRGVTCYGPVTQQQLLLSLGLVARLEQLVENAASEDQANALVKGCERL | |||
VGDGAGNAETGEPPGMGVRYKAMCMVSRGLPKPVGFS | |||
>gi|83309693|ref|YP_419957.1| hypothetical protein amb0594 [Magnetospirillum magneticum AMB-1] | |||
MTLADILADRIRATGPIPVSEFMAEALGHPEYGYYMGRDPFGMAGDFITAPEISQMFGELIGLWCALVWQ | |||
SMGAPKRVVLAEIGPGRGTLMADLLRAAQALPPFALALNVHLIETSPSLRNRQAQALTDRSVEWHERFED | |||
LPDGPLLLVANELFDALPIRQLEKAGGVWRERVVALDEAGAFAFAQGPVVAEPPLAPAVLGAADGAVAEL | |||
CPQGRALAGTIARRLAHQGGAALIIDYGYGKSAAGDSLQALKSHKRHPVLSGPGTADITAHVDFQALAEA | |||
ASGLARAHGPVPQGSFLARLGLEERVRMLMQNATPEQAAHLASGARRLIDPGEMGTLFKVLALAAPLLPV | |||
PPGLEHD | |||
>gi|121525100|ref|ZP_01658036.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] | >gi|121525100|ref|ZP_01658036.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] | ||
Line 98: | Line 165: | ||
PSPAGF | PSPAGF | ||
>gi| | >gi|75676687|ref|YP_319108.1| Protein of unknown function DUF185 [Nitrobacter winogradskyi Nb-255] | ||
MRRRARIRALQPRSIVTEPAPLLTYIKKLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFVTSP | |||
EVSQMFGELLGLWAASVWRMMGSPDPLRLIELGPGRGTLMADALRALRVLPPMYESLSVHMVEINPVLVE | |||
KQMAALSDAPNIEWHTSLDQVPQGPAIILANEYFDVLPVHQMVRRDGGWHERVVDIDGSGQLVFGVSAEP | |||
TPRFDVLLPPLVRAAPVGAIFEWRPDAEMMSIATRVRDNGGAALIIDYGHVRSDAGDTFQAISRHSFADP | |||
LKYPGRVDVTAHVDFEALARAAEDVGARVHGPVPQGEFLRRLGIEARAVNLMAKATPELSDDIATALKRL | |||
TEGGRGGMGSMFKVIGVSDPSLSVLVGLSDQAHGGGIRTP | |||
>gi| | >gi|89359500|ref|ZP_01197321.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2] | ||
MAVGAAGRRRRRPSALRPEAQAGRVTTPLSKEISALIAAEGPMPLSRYMALCLGHPRHGYYMTRDPLGAR | |||
GDFTTAPEISQMFGELLGLWAVAQWQAMGSPPAFRLVELGPGRGTLMADALRAARLVPDFGAAARIHLVE | |||
TSPVLRAAQARTLAAHADRVSWHDRVEEVPDGPALVLANEFFDALPIDQYVFHAGHWHERRVGLDDGGRL | |||
VLGLDPAPSRAAPAFAAHLPPPAEGVVLEHLESGPARALSERLKTQGGAALIIDYGHAGGYGDTFQALEQ | |||
HRFADPLAAPGNADLTAHVDFSALARIGRAAGLRAFGPLEQGAFLARLGLAQRAERLKRDATDELRAGVD | |||
AAARRLAGDGAGEMGRLFKVLVLAHPEIGLPPAFDSTEEWVR | |||
>gi| | >gi|115523453|ref|YP_780364.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisA53] | ||
MTDQPLHDTIKKLIRSAGPMPVWRYMELCLTHPEHGYYVSRDPLGREGDFITSPEVSQMFGELLGLWAAS | |||
VWKAIGSPQQVRLIELGPGRGTLMADAMRALRVLPPLYQAISVHLVEINPVLRDKQRDTLANLSNVAWHA | |||
DLDEVPGGTSIIFANEYFDVLPVHQAVRGEHGWHERVIEIDAEGDLTFGAAAEPIPQFEVLLPPLVRAAP | |||
PGAVFEWRADSEIMKIASRVRDEGGAALIIDYGHLRSDAGDTFQAIAKHSFADPLANPGQADVTAHVDFQ | |||
ALAQAAEAVGARVHGPVTQGEFLRRLGIETRALALMAKASHEISEDVANALKRLTGGGRGGMGSMFKVIG | |||
VSEADLTEVTGLSDAVRAEVGA | |||
>gi| | >gi|121715340|ref|XP_001275279.1| DUF185 domain protein [Aspergillus clavatus NRRL 1] | ||
MNNARRAWTRHVRACQPRNLRLGSHRCSSTFDQRQWSTPLAKTLANAIKITGPIPISAFMRQVLTSPEGG | |||
YYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVAVWTITEWMAQGRKRSGVQLIEVGPGKGTLMDDMLRT | |||
FQNFKSFSSSIEAIYLVEASPTLREVQKQRLCGDAPMEETDIGHRSTSKYFNVPVIWVEDIRLLPHEEGT | |||
TPFIFAHEFFDALPIHAFESVPPAPESQTEQSEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPEAVEEN | |||
KEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSQPGSTIEISPESRVYASDFARRIGGSSQPPRTVNRQ | |||
DAGPVQPKRVPSGAALIMDYGTMSTIPVNSLRGIQNHRNVPALSSPGQVDVSADVDFIALAEAAIDASEG | |||
VEVHGPVEQGDFLQVMGIAERMQQLLKGIKDEEKRKTLESGWKRLVERGGGGMGKIYKFMAIIPENGGQR | |||
RPVGFGGTVQM | |||
>gi| | >gi|86751272|ref|YP_487768.1| Protein of unknown function DUF185 [Rhodopseudomonas palustris HaA2] | ||
MNDDSPLLAEIKRLIETAGPMPVWRYMELCLAHPEYGYYVSRDPLGREGDFTTSPEISQMFGELIGLWTA | |||
SVWKAVGEPGVLRLIEIGPGRGTMIADALRALRVLPPLYQSLSVHLVEINPVLRAKQQATLAGIRNVHWH | |||
EDFAEVPEGPAVVLANEYFDVLPIHQAVKRDGGWHERVIEISASGDLVFGVADDPIPRFEVLLPPLVQMA | |||
PAGTVFEWRPDNEIMAIAARLRDQGGAALIIDYGHVRSDVGDTFQAIARHSFADPLQHPGGADLTAHVDF | |||
QALGRAAETIGARIHGPVTQGEFLKRLGIETRALSLMAKASAQVSEDIAGALKRLTGEGRGGMGAMFKVI | |||
GVSDPSITSLVALSDDAERAAEGQKA | |||
>gi| | >gi|56695809|ref|YP_166160.1| hypothetical protein SPO0907 [Silicibacter pomeroyi DSS-3] | ||
MSLTGLLLERIAQQGPLSLADYMAECLLHPEYGYYTTRDPLGVAGDFTTAPEISQMFGELIGLALAQAWM | |||
DQGRPAPFTLVELGPGRGTLMADALRATRAVPGFHEAARLWLVEASPVLRATQAQALAGHDPQWCDTVSD | |||
LPAGPLFGVANEFFDALPVRQFQRAGAVWRERLVGARDGALCWGLGAEALQPALAHRLEDTREGDLVELC | |||
PAAGLILSELASRIAADGGAALIVDYGDWRSLGDTVQALRNHAPADPLADPGQADLTAHVDFEVLAMTAR | |||
AAGCAHSRLSTQGVFLERLGIAQRAQALARHADEAALDRLITAHRRLTHPEEMGNLFKVLGLYPSDATPP | |||
PGLEP | |||
>gi| | >gi|90422923|ref|YP_531293.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB18] | ||
MTEPFSLQDVIKKLIKSAGPMPVWRYMELCLTHPEFGYYVSRDPLGREGDFTTAPEVSQMFGELLGLWAA | |||
SVWRSIGSPQLVRLIEFGPGRGTMMADALRALRVVPPLFQALHVHLIEINPVLREKQKATLAGAQNLHWH | |||
ASLDEVPGGSTIIFANEYFDVLPIHQMVRGEHGWHERTVEIDAAERLVFGVAPEPVPHFEQLLPPLVRAA | |||
PQGAVFEWRPDAEIMKIASRVRDEGGAALIIDYGHPRSDAGDTFQAIARHSFADPLQNPGRADVTAHVDF | |||
QALARGAQDVGARVHGPVTQGEFLKRLGIENRAVALMAKASLEVSEDVASALKRLVEGGRGGMGSMFKVM | |||
AVSEPEIEQIAGLSDQPEAARTAAQ | |||
>gi| | >gi|111069170|gb|EAT90290.1| hypothetical protein SNOG_02078 [Phaeosphaeria nodorum SN15] | ||
MRATSLSLLRTATRCSQPQANDIRRRSAQCAIRYTSSLTSTSSGAERKWSTPLAKMLGEAITTTGPISVA | |||
AYMRQCLTAPEGGYYTRQTSAGQDQFGTTGDFVTSPEISQVFGELIGIWIYAEWLAQGKKEKVQIIEVGP | |||
GRGTLMDDVLRTISSFKAFANSIEAIYLVEASPHLQKQQGKLLAGTEELQKSDIGLTAPLKYIPGVNIQW | |||
CEDIRNIEETSSPFILAHEFFDALPIHVFQNIAQSSIPASSMIMTPTGPIKPKHGATVPKNTWHELVVSP | |||
TNPYSSAGTITTTSSTSTKQEEKPDFELTVSKTPTPHSLYLPKKSDRYKKLEDTNDAIIEISPESMAYIS | |||
DFAVRIGGDNPPSKAEPTSSSAKLQRTEPAPFSKPQPAGAALILDYGPANTIPANTFRGIRGHQTVSPFT | |||
SPGLVDLSADVDFLALAESALDASPGVEVHGPVEQSFFLSTMGIKERAERLLKGAKDEATRQRLETGWKR | |||
LIDRGPNGMGKTYKAMALLPYIKGSKVRRPVGFGGDIAA | |||
>gi| | >gi|110680664|ref|YP_683671.1| hypothetical membrane protein [Roseobacter denitrificans OCh 114] | ||
MSLKDQLIARIKAHGPMSVAEYMGDCLLHPTLGYYTTQHPFGGSGDFITAPETSQMFGELIGLCLVQAWV | |||
DQGRPSPFALVELGPGRGVLMADILRAAAQVPDFARAAEVILVEASPKLQEIQRDTLKAHAVTFVKDVAS | |||
LPQCPLFVVANEFFDALPIRQFVRSGPHWRERQVGCDAEQLIFGMGAQTPQPALNARLSDTKEHDLVEYA | |||
PAAAPIMSELGSRIDTHGGAGLIIDYGDWRSLGDTLQAVRQHEYTGVLDHPGESDLTAHVDFEALAQAVP | |||
CAFSRLTPQGVFLERLGITQRAQRLAQNLPKDLLEQHIKAHRRLTHPEEMGNLFKVLGLFPHGKAPPAGL | |||
EI | |||
>gi|85704768|ref|ZP_01035869.1| hypothetical protein ROS217_06800 [Roseovarius sp. 217] | >gi|85704768|ref|ZP_01035869.1| hypothetical protein ROS217_06800 [Roseovarius sp. 217] | ||
Line 193: | Line 249: | ||
LDP | LDP | ||
>gi| | >gi|116502505|gb|EAU85400.1| hypothetical protein CC1G_07094 [Coprinopsis cinerea okayama7#130] | ||
MQLCLSHPTHGYYMNPNNAVFGTSGDFITSPEISQVFGELVGVWLVSQWADAGTPPAIRLVELGPGRGTL | |||
MDDILRIVKKFLPEKALTGVHLVETSEALRSVQKAKLGEKCDLHFHNGIHEIPRNPSVYTMLVAHEFFDA | |||
LPVHVVQKTEAGWNEVMIASNDSLSSSESSPSQPQTQKQGVLRRVLNPLPSPASTLLGNSSLRFRNLPIG | |||
STIEVSPTSFRIAHQIGRLLSARGLEEKPLDLVEASQQETGVGGCGLVIDYGADHAFGDSFRAFKEHKIV | |||
DVFHRPGECDITANVDFAYLKEAMTVEPHGPITQADFLERMALQTRVEALVRNASSEERKKVILDAANRL | |||
VDRSGMGTQYKVLGITSSPKSSSTGVWPFDVQNQALES | |||
>gi| | >gi|126739895|ref|ZP_01755586.1| hypothetical protein RSK20926_14444 [Roseobacter sp. SK209-2-6] | ||
MSSLEQQLVARIQENGPISLAEYMSECLLHPEFGYYSTRDPFGQSGDFVTAPEISQMFGELLGLCLAQCW | |||
LDQGAPSPFALVELGPGRGTLMRDLLRATAGVTGFHQAMQVFLVEASPKLQREQAKALEEYDVSWVAEPM | |||
ELPNLPVFLVANEFFDALPARQFVRDSDGWRERLIGLEEGKLGFGLGSATDQPALAYRLEDTRPGDLVEL | |||
CSPAATLLEPIADRISTFGGAALIIDYGDWRSLGDTLQALRAHKSTPPLENPGKADLTLHVDFEFLAQST | |||
KSTGCAHSRVTPQGVFLERLGITERAQALSRALQGEALDTLIAAHRRLTHPEEMGNLFKVLGLYPAQFSP | |||
PPGLLS | |||
>gi| | >gi|114767217|ref|ZP_01446082.1| hypothetical protein R2601_09240 [Roseovarius sp. HTCC2601] | ||
MSPLEEILHRRIAAEGPMTIAEYMATCLGHPRYGYYPTRDPLGAAGDFTTAPEISQMFGELLGLCLAQCW | |||
LEQGRPSSFVLAELGPGRGTLMADATRAMRGVPGMLEAARLHLVETSPRLRDEQHRRLAPLMPVWHDSVA | |||
NLPEAPLYLLANEFFDALPIRQFLRSGEGWCERVVGLSEGRLAFGLTEPAPHGELEHRLADTREGDLVET | |||
CAPATGIAEDIGRRIASQGGAALIVDYGSARSLGDTFQAVRRHDKVSPLDAPGTADLTAHVDFGALATAM | |||
PCATTTLTPQGVFLERLGITDRARALAARLSGAQLDSHVAAHRRLTHPEEMGSLFKTLGAFPEGAAPPPG | |||
LLDT | |||
>gi|86136505|ref|ZP_01055084.1| hypothetical protein MED193_20319 [Roseobacter sp. MED193] | >gi|86136505|ref|ZP_01055084.1| hypothetical protein MED193_20319 [Roseobacter sp. MED193] | ||
Line 239: | Line 280: | ||
AGAGCAHSKVTPQGVFLERLGITDRARNLAAGLEGEALESLIAAHRRLTHPSEMGNLFKVLGLTPADIAP | AGAGCAHSKVTPQGVFLERLGITDRARNLAAGLEGEALESLIAAHRRLTHPSEMGNLFKVLGLTPADIAP | ||
PPGLNA | PPGLNA | ||
>gi|39937419|ref|NP_949695.1| DUF185 [Rhodopseudomonas palustris CGA009] | |||
MIDQTALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGDFTTSPEISQMFGELLGLWSA | |||
SVWKAADEPQTLRLIEIGPGRGTMMADALRALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGIRNIHWH | |||
DSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAADPIPGFEALLPPLARLS | |||
PPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDF | |||
DALGRAAESIGARAHGPVTQGAFLKRLGIETRALSLMAKATPQVSEDIAGALQRLTGEGRGAMGSMFKVI | |||
GVSDPKIETLVALSDDTDREAERRQGTHG | |||
>gi|118591907|ref|ZP_01549302.1| hypothetical protein SIAM614_20945 [Stappia aggregata IAM 12614] | |||
MTGLKDRIKARIATEGPLSVAQYMSVCLGDPDAGYYMTREPFGSEGDFITAPEVSQMFGELIGAACLSAW | |||
QALGEPAEFQLVELGPGRGTLMADLLRMASLRPAFIKAARLNMVETSPRLREIQTATLSRGPLTPHFRNR | |||
FQDVPGGPLILVANEFFDALPIHQFVKTARGWQERQIGLSQDGELMFGVGTARLPDDAIPADLSSAPEGA | |||
IFETQPAANAIAEEIGHRIAGNGGAAILIDYGYLNTAAGDTLQALYKHAYDDVLAHPGEADLTAHVNFEA | |||
LAAATVRAGAQALAPLTQGEFLLRSGLLERAGALGAGKSHSEQEAIRDAVERLAAPGQMGDLFKVLAVTN | |||
SGISFPPFDSAS | |||
>gi|15889490|ref|NP_355171.1| hypothetical protein AGR_C_4024 [Agrobacterium tumefaciens str. C58] | |||
MTTPLAQRIKSLIRLNGPLSVTDFFSLCLADPEHGYYKSREPFGRSGDFITAPEVSQLFGEMLGVFVVHA | |||
WQRHGAPAQTQLVEIGPGRGTMMSDMLRVIRRIAPPLYETMRVHLVETSPRLSAIQKETLTAHADRLTWH | |||
DSFDDVPEGFLLLVANELFDAIPIRQFVRTPQGFRERVVSLDANGELVFSTGLAGIDPTLLPPQPERQQL | |||
GTVFEVSPAREAVMTAICQRLSVHGGTALAIDYGHLVAGYGDTLQAMRNHAFDPPLAHPGEADLTSHVDF | |||
ESLVKTAQATGVHVNGALRQGDFLHGLGLKERASALAAKATPDQTLEIAEAVNRLAGEGAGKMGELFKVI | |||
AVSSPALHLLPFRAVD | |||
>gi|84683545|ref|ZP_01011448.1| hypothetical protein RB2654_19268 [Rhodobacterales bacterium HTCC2654] | >gi|84683545|ref|ZP_01011448.1| hypothetical protein RB2654_19268 [Rhodobacterales bacterium HTCC2654] | ||
Line 247: | Line 312: | ||
LTHSRLTPQGVFLERLGITARAQALAARLEGEPLTRHVAAHRRLTHPDEMGTLFKTLAFVPEGAAMLPGL | LTHSRLTPQGVFLERLGITARAQALAARLEGEPLTRHVAAHRRLTHPDEMGTLFKTLAFVPEGAAMLPGL | ||
ET | ET | ||
>gi|83854800|ref|ZP_00948330.1| hypothetical protein NAS141_08731 [Sulfitobacter sp. NAS-14.1] | >gi|83854800|ref|ZP_00948330.1| hypothetical protein NAS141_08731 [Sulfitobacter sp. NAS-14.1] | ||
Line 331: | Line 320: | ||
APASYTRITPQGVFLERLGITQRAQTLAKGMTEDALNAHVAAHRRLTHPSEMGNLFKVMGIYPPHHSPPP | APASYTRITPQGVFLERLGITQRAQTLAKGMTEDALNAHVAAHRRLTHPSEMGNLFKVMGIYPPHHSPPP | ||
GLEP | GLEP | ||
>gi|118735910|ref|ZP_01584383.1| protein of unknown function DUF185 [Dinoroseobacter shibae DFL 12] | >gi|118735910|ref|ZP_01584383.1| protein of unknown function DUF185 [Dinoroseobacter shibae DFL 12] | ||
Line 380: | Line 329: | ||
VSPPSAPPVPGSIAPPNPGKPDAP | VSPPSAPPVPGSIAPPNPGKPDAP | ||
>gi| | >gi|21434891|gb|AAM53573.1| Aby [Azospirillum brasilense] | ||
MSDAAPDSLAHHLARRILMDGPLSVAAFMAEALGHPRFGYYMRQDPFGVSGDFTTAPEISQMFGELAGLW | |||
CVDTWARLGGPAPVHLVELGPGRGTLMQDALRAAALVPAFREATRVHLVETSPTLRARQKETLAGIPVAW | |||
HDRLEDVPEGPTLILANEFFDALPIRQVQKTNHGWFERLIDIDNTESMDTPRFRFVLEAFGSAGARLIPP | |||
ALRDAPEGSVVEVSPASQPVARLIGERLAAHPGAALVIDYGYRGGPAVGDTLQALRRHAYAPVLDAPGEA | |||
DLTAHVDFAAIAAAAREGGAESFGPVDQGDWLVRLGIQPRATALKRSATTKQAADIDSALARLIHRCRPA | |||
>gi| | >gi|99080460|ref|YP_612614.1| protein of unknown function DUF185 [Silicibacter sp. TM1040] | ||
MSLMQSLRRRIELDGPMTVADYMSECLLHPDYGYYTTAPAIGAEGDFITAPEISQMFGELLGLVLVQSWL | |||
DQGRPQPFTLAELGPGRGTLMADMLRATRAVPGFHEAMELLLIEASPRLRDLQRQALAPYAPRWVPSVED | |||
LPQHPLFLVANEFFDALPIRQFQREGNQWRERRVGLAEDASGLTLGLGAPAPQPALAHRLEDTKDGDLVE | |||
HCEVAAVVTEAIAQRIGDHGGVALLVDYGDWRSLGDTLQALRAHAPTDPLAEPGQADLTAHVDFEAICTA | |||
ASATGCAHTRLTPQGVFLERLGITDRANALASGAAGEPLAQIIAAHRRLTHPEEMGNLFKVLGLYPAKFA | |||
PPAGLEK | |||
>gi| | >gi|126733062|ref|ZP_01748818.1| hypothetical protein SSE37_14379 [Sagittula stellata E-37] | ||
MSALKDIITRQISRTGPLTLADYMALCLSHPEHGYYATRDPLGAEGDFTTAPEISQMFGELIGLALAQSW | |||
MDQGAPTRFVLSELGPGRGTLMADALRATTRVPGFHDALELHLVETSPALRAEQAARLPDATWHESVASL | |||
PEAPLFLIANEFFDALPIRQFLRHAQGWQERVVGLKDGQPTLGLTDPAPHDALDHRLADTEPGQIVENCA | |||
PAQAIVQETGRRIASHGGTALIVDYGDWRSRGDTFQALYRHKPAEPFARPGEADLTAHVDFEALAKAAHP | |||
AAHSALTPQGVFLEHLGITARAQALARRLGGAALESHVAAHRRLTHPGEMGSLFKVLALFPHDAPPPPGT | |||
GLPADPS | |||
>gi|84515351|ref|ZP_01002713.1| hypothetical protein SKA53_01796 [Loktanella vestfoldensis SKA53] | >gi|84515351|ref|ZP_01002713.1| hypothetical protein SKA53_01796 [Loktanella vestfoldensis SKA53] | ||
Line 428: | Line 360: | ||
DP | DP | ||
>gi| | >gi|83941322|ref|ZP_00953784.1| hypothetical protein EE36_03798 [Sulfitobacter sp. EE-36] | ||
MTTLRDILHSRIASNGPMRIDEYMATCLLHPTQGYYTTRDPFGTQGDFTTAPEISQMFGELLGLCLAQSW | |||
LAQDAPSAFTLAELGPGRGTLMADILRATRNVPGFIEAARITLVEASPTLRDVQAKTLAGHQVIWADGTD | |||
ALPDQPLFLVANEFFDALPIRQFVRGETSWRERQIGLADGALSFGLGPELPQPALADRLADTKPGDLVED | |||
CTQLAPILHPVSERIATHGGAALIVDYGDWHSLGDTLQALQGHEKADPLAAPGQADLTAHVDFEKLALAA | |||
APASHTRITPQGVFLERLGITQRAQTLAKGMTEDALNAHVAAHRRLTHPSEMGNLFKVMGIYPPHHSPPP | |||
GLEP | |||
>gi| | >gi|114770207|ref|ZP_01447745.1| hypothetical protein OM2255_11240 [alpha proteobacterium HTCC2255] | ||
MTALSNIIKKQIKRFGPMPVSEYMTLCLLHPEHGYYTNRDALGATGDFTTAPEISQMFGELIGLSIAQSW | |||
IDQEMPTPFILAELGPGNGTLMADILRATKSVPNFHASMDLHLIEASPEMRKRQETALNGFNVTWLNYFS | |||
ELPQKPLFLIANEFFDCLPIKQYRRTDEGWQEQMIAVENEQLHFILGTATSEEVFSKTNDVPSADMLEVS | |||
PPTVAFASAIGEHIQGNGGCAIIVDYGEWDSDGDSLQALKDHRKIDPLTHCGTADLTAHVSFKDLTNAAS | |||
KYAKVSSTIPQGILLERLGITQRAQTLAKNMSGKKLENHISAHKRLTHPDEMGSLFKAIAIIPENTDLPA | |||
GFNE | |||
>gi| | >gi|67464609|pdb|1ZKD|A Chain A, X-Ray Structure Of The Putative Protein Q6n1p6 From Rhodopseudomonas Palustris At The Resolution 2.1 A , Northeast Structural Genomics Consortium Target Rpr58 | ||
MIDQTALATEIKRLIKAAGPXPVWRYXELCLGHPEHGYYVTRDPLGREGDFTTSPEISQXFGELLGLWSA | |||
SVWKAADEPQTLRLIEIGPGRGTXXADALRALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGIRNIHWH | |||
DSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAADPIPGFEALLPPLARLS | |||
PPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDF | |||
DALGRAAESIGARAHGPVTQGAFLKRLGIETRALSLXAKATPQVSEDIAGALQRLTGEGRGAXGSXFKVI | |||
GVSDPKIETLVALSDDTDREAERRQGTHGLEHHHHHH | |||
>gi| | >gi|83859400|ref|ZP_00952921.1| hypothetical protein OA2633_13385 [Oceanicaulis alexandrii HTCC2633] | ||
MSDDEFRPAQMISERLAERIRTEGSLSVAAFMAEALFHPMAGFYATKDPLGAANDFITAPEISQMFGELL | |||
GLWAAECWMQMGAPSRFELIELGPGTGRMMSDMLRAGRAAPGFLDAVHVTLIEASPALKMVQGQTLASAS | |||
VPINWAKDFDKAPSGPAVVIGNEFLDCLPIRQAIRHKGQWRERVVTLHPEDEARFVYGLGPVLGEADVAF | |||
IAPGLREADDGTLVELRPGDQQQIDQLAARFDRDPGYALFVDYGSAKPETGDTLQAIRAHQKVDPLDAPG | |||
TADLTAWVDFDRLLRLGEDAGLSAFGPMTQGDFLTELGIEQRAAVLSRSVDEAGQAKLKRQMHRLVSPED | |||
MGTLFKLAAFSSEGLPPAPGIAPFKRSR | |||
Revision as of 06:43, 29 May 2007
Now 47 Human Blast sequence were used to run Clustal X and they are as follows, Highlighted organisms come under same Lineage as our protein.
>gi|9294283|dbj|BAB02185.1| unnamed protein product [Arabidopsis thaliana] MEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFGEMIGVWTVCLWEQMGRPERVNLVELGPGRGTLMA DLLRGTSKFKNFTESLHIHLVECSPALQKLQHQNLKCTDESSSEKKAVSSLAGTPVHWHATLQEVPSGVP TLIIAHEFYDALPVHQFQTQYLQKSTRGWCEKMVDVGEDSKFRFVLSPQPTPAALYLMKRCTWATPEERE KMEHVEISPKSMDLTQEMAKRIGSDGGGALIIDYGMNAIISDSLQAIRKHKFVNILDDPGSADLSAYVDF PSIKHSAEEASENVSVHGPMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVGDGEAPFWEGPN EQTPIGMGTRYLAMSIVNKNQGIPAPFQ
>gi|50550583|ref|XP_502764.1| hypothetical protein [Yarrowia lipolytica] MLRTIRPARTTLVRAVRPVRPVSGRVGRLGRHVTTGTTSSTTSASSPDLSTTLAMAIEQQGPMSVATFMK HCLTNPSGGYYIDKDPLGAKGDFTTSPEISQMFGELVGLWLAAQWLYYGQKQPFRVIEYGPGRGTLMDDS LRALVSAKSTGAKEALKEVLLVEASPVLRDAQRKKLCGAESQFKTEEDGSITCVTKYGVPIRWYEDSKML DKLASSNDPLHNYIVAHEFFDALPIYQFEKTDKGWRELMVNYGVENKTKESSILLPGQTHIKSSDLDKDK KKTFHLVTAPTWTVASKVIPQSHKRYRDLPEWSKIEVCPDAWDVANQMGRLVAKGGAAFIVDYAVKPGVP VNTLRGIRDHKICSPFEEPGKVDLSADVDFTAIGIASRSKNKENVSAFGPINQATWLKNMGIEMRTEKLM EGKEEYIKKRIESQYKRLVDIGINGMGKIYKAFFLTHSSHGYPVGFPIPEPKDLKEPHQKPEKDPKDTEP KVVEV >gi|27382556|ref|NP_774085.1| hypothetical protein blr7445 [Bradyrhizobium japonicum USDA 110] MTEQPLLNEIKALIKSSGPMPVWRYMELCLMHPRYGYYVSRDPLGREGDFTTAPEVSQMFGELLGLWTAS VWKQMGSPQSLRLIELGPGRGTMMADALRALRVLPPLYQALQIHLVEVNPVLRERQSATLSGARNVAWHD SIDDVPEGPSIILANEYFDVLPIHQMVKRENGWHERVIEIDPNGKLQFGAASEPTPRFDVLLPPLVRAAP VGAVFEWRPDGEVMKLATRVRDQDGAALIIDYGHLRSDAGDTFQAIARHTFTDPLKAPGQADVTAHVDFQ ALARAAEDVGARVHGPVTQGDFLKRVGIDTRAAALMQKATPEVATDISVALKRLTDTGRSGMGSMFKVLG ISEPRLTGLAGLSDLEHAGGN
>gi|119177909|ref|XP_001240685.1| hypothetical protein CIMG_07848 [Coccidioides immitis RS] MSNGATQIVNRLARASRCSRLATPSAAKRYLSSAPQRRWSTPLAKTIADVINTAGPISIAAYMRQCLTSP EGGYYTSRGSTGVEVFGRKGDFVTSPEISQMFGELLGVWMVTEWMAQGRRSRGVQLIEVGPGRGTLMADM LRSVRNFKSFSSSIEAVYLVEASPTLRDIQKQMLCGDAPMEEIEVGYRSTSKHLGVPVVWTEHIRSLPQG DNDVPFIIAHEFFDALPIHAFQCVASPPSETIITPTGPTTLRQPLSSSPTQWRELVVSVNPASQMHAENR LEFRLSLAKTSTPASMVMPEMSERYKALKSTRGSTIEISPESQGYVQEFARRIGGHSNSKIPTTRKPAGA ALILDYGPSHSIPVNSLRGIKDHKLVSPFTSPGQVDLSADVDFIALADSAISASPGVEVHGPTEQGSFLH SLGISERAAQLMKRAEDETKRKNIEAGWKRLVERGGGGMGRIYKAMAIIPEAGGMRRPVGFGGQVPA
>gi|91978339|ref|YP_570998.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB5] MTDNSPLLAEIKRLIKSTGPMPVWRYMELCLNHPLYGYYVSRDPLGREGDFTTSPEISQMFGELIGLWAA SVWKATGEPDVLRLIEIGPGRGTMIADALRALRVLPPLYQSLSVHLVEINPVLREKQKATLAGIRNIHWH DTFADVPDGPAVILANEYFDVLPIHQAVKRDGGWHERVIEISASGELVFGVAPDPIPRFDILLPHLVRMA PAGAVFEWRSDAEIMAIATRLRDQGGAALIIDYGHIRSDVGDTFQAIARHSFADPLQNPGRADLTAHVDF QALGRAAEDVGARLHGPVTQGEFLKRLGIETRALSLMAKASPQVSEDISGALRRLTGEGRGAMGSMFKVI GVSDPNITSLVALSDDAERAAEGQPA
>gi|46200845|ref|ZP_00207869.1| COG1565: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] MSLSALLSERIKATGPIPVSEFMAEALGHPEYGYYRGRDPFGMAGDFTTAPEISQMFGELIGLWCALVWQ SMGSPERVVLAEIGPGRGTLMADLLRAAKALAPFARALDVHLIETSPSLRNRQAQALADQSVTWHERFED LPDGPLLLVANELFDALPIRQLEKVGGVWHERVVGLDDQGALVLALGPVVADPPLAPAVLNAPDGSLAEV CPQGRVLAEAVARRLAHQGGAALIIDYGYETSAAGDSLQAVKSHRHHPVLSAPGTADITAHVDFQALAEA ASGLARVYGPVPQGRFLARLGLEERVRMLMQHASVEQAAHLASGARRLIDPAEMGTLFKVLALANPLLPA PPGLELA
>gi|125526627|gb|EAY74741.1| hypothetical protein OsI_002588 [Oryza sativa (indica cultivar-group)] MEEVLTNPQSGFYINRDVFGTSGDFITSPEVSQMFGEMTGVWAMCLWEQMGQPEKVNLIELGPGRGTLLA DLLRGSSKFVNFTKALNINLVECSPTLQKVQYNTLKCEDEPIGDKTRTVSKLCGAPVHWHASLEQVPSGL PTIIIAHEFYDALPIHQFQPTASLLFLSKRCGWASSEELEKVEHIEVCPKAMEITEQIADRISSDGGGAL IIDYGKDGIVSDSLQAIRKHKFVHILDNPGSADLSAYVDFASIRHSAKEASDDISVHGPMTQSQFLGSLG INFRVEALLQNCATDEQAESLRTGYWRLVGDGEAPFWEGPDDQTPIGMGTRYLAMAIVNKKQGTPVPFE
>gi|119480871|ref|XP_001260464.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181] MMNSATKRALTRHFRTYQCRNLQIGSHRCSSTFDQRQWSTPLAKTLANAIKVTGPIPIAAFMRQVLTSPE GGYYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVGIWTITEWMAQGLKRSGVQLIEVGPGKGTLMDDML RTFRNFKSFASSLEAIYLVEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVIWVEDIRLLPHEE DKTPFIFAHEFFDALPIHAFESIPPAPENQSEQKEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPKAVE ENIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTPGSTIEVSPESRIYASDFARRIGGSSQPPRTVG SRNAPAAQPKKVPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSSPGQVDVSADVDFMALAEAAIEAS EGVEVHGPVEQGDFLQVMGIAERMQQLLKGVQDEEKRKTLESGWKRLIERGGGGMGKIYKFMAIIPENGG RRRPVGFGGSVQM
>gi|92118562|ref|YP_578291.1| protein of unknown function DUF185 [Nitrobacter hamburgensis X14] MTEASPLLPDIKKLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQMFGELIGLWAA SVWRAMGSPTTLRLIELGPGRGTMMADALRALRVLPPMHQALSVHLVEINPVLREKQKAALSDARTIQWH ASLDEVPQGPAIILANEYFDVLPVHQMVKRDDGWYERVVDIDGSGQLVFGTTAAPTPRFDALLPPLVRAA PVGAIFEWRPDAEMMTIATRVRDHGGAALIIDYGHVRSDAGDTFQAIAGHSFADPLKYPGQADVTAHVDF QALARAAEDIGARVHGPVTQGEFLQRLGIEARAVNLMAKATPEISEGISTALKRLTEGGRGGMGSMFKAI GVSDSSLTELAGLSDRKRRGGIRAP
>gi|146342824|ref|YP_001207872.1| hypothetical protein BRADO6003 [Bradyrhizobium sp. ORS278] MIETSPLQPEIKRLIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGELLGLWAA SIWKAAGSPQQFRLIELGPGRGTMMSDALRALRVLPPLYQTISVHLVEINPVLREKQKATLTGLRNVTWH DSFDEVPEGPSVIFANEYFDVLPVHQMVRRETGWHERVVELDDDENFVYGTAADPTPGFELLLSPLVRAA PAGAIFEWRPDTQMMAIARRLREQRGAAVIIDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDF DALSRTAEAVGARVHGPITQGEFLQRLGIETRALTLMQKASPEVSEDIASGLKRLTSGGRGGMGSLFKVL GVSDPSIPVLAGISDEHTSEKTGGA
>gi|146322884|ref|XP_755307.2| DUF185 domain protein [Aspergillus fumigatus Af293] MNSATKSAWTRHFRTYQYRNLRIGSHRCSSTFEKRQWSTPLAKTLANAIKVTGPIPIAAFMRQVLTSPEG GYYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVGIWTITEWMAQGSKRSGVQLIEVGPGKGTLMDDMLR TFRNFKSFASSLEAIYLVEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVLWVEDIRLLPHEED KTPFIFAHEFFDALPIHAFESIPPAPENSPEQKEIITPTGPAKLHQPMKPANTPQWREIMVTLNPKAVED NIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTPGSTIEVSPESRIYASDFARRIGGSSQPPRTVGS RNSPAAQPKKIPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSSPGQVDVSADVDFMALAEAAIEASE GVEVHGPVEQGDFLQVMGIAERMQQLLRGVQDEEKRKTLESGWKRLIERGGGGMGKIYKFMAIIPENGGR RRPVGFGGTVQM >gi|78693301|ref|ZP_00857815.1| conserved hypothetical protein [Bradyrhizobium sp. BTAi1] MIELSPLHSEIKRVIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGELLGLWAA SVWKASGSPQQFRLIELGPGRGTMMSDALRALRVLPPLYQTISVHMVEINPVLREKQKATLTGLRNITWH ESFDDVPEGPSVIFANEYFDVLPIHQMLKRETGWHERVVELDAEENFAYGTAAEPTPGFELLLPPLVRAA PLGAIFEWRPNNEIMAIAKRIREQRGAAVIIDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDF EALARAADAVGARVHGPITQSEFLRRLGIETRALTLMQKASPDISRDIASGLKRLIEGGRGGMGSLFKVL GLSDASIPVLAGISDEHTGGKPGGA
>gi|84499690|ref|ZP_00997978.1| hypothetical protein OB2597_07165 [Oceanicola batsensis HTCC2597] MAVTPLLDRIRHRIGAQGPMTLAEYMQIALLDPDHGYYATRDPFGTAGDFITAPETSQMFGELVGLALAQ SWIDQGRPAPFILAEPGPGRGTLMADILRATRSVPGFHDGLSLVLIEASPVLRDIQARTLSGYRAEWIDD LGALPEAPLFLVANEFFDALPIRQFRRRGDGWAEVMVTVSGSGLATALAAPVPLPELAHRLGDTREDDVV ELCPAAARAAAHIGARIADQGGAAVIVDYGDWRSLGDTFQALKGHAPVDPLAAPGTADLTAHVDFERLAK AATPAWASGMIPQGVFLERLGITARAQALATRLQGPDLDAHVAAHRRLTHPEEMGTLFKVLALSPPDAPP VPGTTDPEWPTE
>gi|67539742|ref|XP_663645.1| hypothetical protein AN6041.2 [Aspergillus nidulans FGSC A4] MNCSTQRIVNQFSRQTARRRFNIRSRRWNSTFETREWSTPLARTLANVIKTTGPVPIAAFMRQVLTSPEG GYYTTKPGGGGEVFGKKGDFVTSPEISQVFGELVGIWTIAEWMAQGGKKSGVQLMEIGPGKGTLMDDMLR TFRNFKPFTSSLEAIYLVEASPTLREVQKQLLCGNAVMEETDIGHRCTSKYFNVPVIWVEDIRLLPHEED KTPFIFAHEFFDALPIHAFESVPPSPENEQQEQEIMTPTGRTKLQRPPKAANTPQWRELMVTLNPKAVDE NIKDEPEFKLTLAKASTPSSLVIPEISERYRALKSQPGSTIEVSPESRIYASDIARRIGGSSQPPRTAAG RNASAPSAIAKRIPSGAALIMDYGTMSTVPINSLRGIQNHKIVPALSSPGRVDVSADVDFTSLAEAALEA SEGVEVHGPVEQGHFLQAMGIAERMQQLLSTVKDEKKRKILETGWQRLVERGGGGMGKLYKVMTIIPENG GRRRPVGFGGGVPL
>gi|85716018|ref|ZP_01046995.1| hypothetical protein NB311A_14415 [Nitrobacter sp. Nb-311A] MTEPAPLLADIKRLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQMFGELLGLWGA SVWRTIGSPLTLRLIELGPGRGTMMADALRALRVLPPMYESLSVHMVEINPVLREKQMAALSDAPNIQWH ASLDEVPQGPAIIFANEYFDVLPVHQMVKGDDGWHERVVDIDGGQLVFGVSATPTPRFDVLLPPLVRAAP VGAIFEWRPDAEIMSIATRVRDQGGAALIIDYGHERSDAGDTFQAIARHSFADPLKYPGRVDVTAHVDFE ALARAAEDVGARVHGPVTQGEFLRRLGIEARAVNLMAKATAEVSDGIASALKRLTEGGRGGMGSMFKVIG VSSPGLTELAGLSDRERRGGIKAP
>gi|145252682|ref|XP_001397854.1| hypothetical protein An16g05460 [Aspergillus niger] MNQATRRAVRQLLRKHPNQTLFLKSQRWSSTTPASSTSTSETRKWSTPLAQTLANAIKVTGPVPIAAFMR QVLTNPEGGYYTTRPEGHGAVFGKKGDFVTSPEISQVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKG TLMDDMLRTFRNFKMFSSSMEAIYLVEASATLREVQKKLLCGDAVMEATDIGHKSTCKYFDVPIVWVEDI RLLPHEEEKTPFIFAHEFFDALPIHAFESIPPSPENQPEQKEIMTPTGPAKLHQPLKPANTPQWREIMVT LNPKAVEENIEGEPEFKLTLAKASTPSSLVIPEISPRYRALKSQPGSTIEVSPESRIYAADFARRIGGAS EPPRTATKGAAASAPAPAKRVSSGAALIMDYGTLNTIPINSLRGIQEHKNVPPLSSPGQVDVSADVDFTA LAEAAIEASEGVEVHGPVEQGDFLQAMGIEERMQQLLKKVEDEEKRKTLETGWKRLVEKGGGSMGKIYKV MAIVPENDGKRRPIGFGGGLVM
>gi|116058192|emb|CAL53381.1| ATP synthase beta subunit/transcription termination factor rho-like (ISS) [Ostreococcus tauri] MGGGEHGKGERTGMIGHLKRAMAFAGGSIPVSEYVRECLTNPEHGYYMRGDVFGRDGDFVTSPEISQVFG EVLGVWAALQHEALGSPGTLRVVEFGPGRGTLMADLLRGTSKFEKFRSAVSVHLIEVSPALREVQARTLR CVDVETTSAAADDGGARVRVPKNALEAEEGEVDKRSAADGPSGEAHTRGTSEISGAKVFWHDGLESVPNG PTLVICHEFFDALPVRQFQRTDRGWCEKLVTIDAELASTAETVEETTPRRELAMVLSPGPTPASHMLVPR RLKGLPKEQVDSLRLLELSPPSMTLWDKLADRIEKNSGAVLAIDYGEEGPLGNTLEAIKDHKFVHVLDSP GEADLSAYVDFGALRQIVEEKPQRGVTCYGPVTQQQLLLSLGLVARLEQLVENAASEDQANALVKGCERL VGDGAGNAETGEPPGMGVRYKAMCMVSRGLPKPVGFS
>gi|83309693|ref|YP_419957.1| hypothetical protein amb0594 [Magnetospirillum magneticum AMB-1] MTLADILADRIRATGPIPVSEFMAEALGHPEYGYYMGRDPFGMAGDFITAPEISQMFGELIGLWCALVWQ SMGAPKRVVLAEIGPGRGTLMADLLRAAQALPPFALALNVHLIETSPSLRNRQAQALTDRSVEWHERFED LPDGPLLLVANELFDALPIRQLEKAGGVWRERVVALDEAGAFAFAQGPVVAEPPLAPAVLGAADGAVAEL CPQGRALAGTIARRLAHQGGAALIIDYGYGKSAAGDSLQALKSHKRHPVLSGPGTADITAHVDFQALAEA ASGLARAHGPVPQGSFLARLGLEERVRMLMQNATPEQAAHLASGARRLIDPGEMGTLFKVLALAAPLLPV PPGLEHD
>gi|121525100|ref|ZP_01658036.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] MTSPLARQIARLIEQTGPIPLSQYMALALGHPEHGYYMTRDPLGARGDFVTAPEISQMFGELVGLWLADQ WLEQGSPKPFVLAELGPGRGTLMADALRAIAAVPHMVEAASIHLVETSPVLRNAQSKRIPQAHWHEHVDD LPDLPLFLVANEFFDALPVTQYQRTERGWCERFVSMAEGRFVPVLAPVPLADDSGLPAAMKAAQEGSIAE VSPASTSITETIAHRIARRGGAALVIDYGHVSSAPGDTLQALRDHKFADPFEAPGEADLTAHVDFEALSH AASAAGAAAHGAVEQGRFLMALGIEARAEALSRNATPAQREDIASAMQRLTARDGMGSLFKVLGITPRGA PSPAGF
>gi|75676687|ref|YP_319108.1| Protein of unknown function DUF185 [Nitrobacter winogradskyi Nb-255] MRRRARIRALQPRSIVTEPAPLLTYIKKLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFVTSP EVSQMFGELLGLWAASVWRMMGSPDPLRLIELGPGRGTLMADALRALRVLPPMYESLSVHMVEINPVLVE KQMAALSDAPNIEWHTSLDQVPQGPAIILANEYFDVLPVHQMVRRDGGWHERVVDIDGSGQLVFGVSAEP TPRFDVLLPPLVRAAPVGAIFEWRPDAEMMSIATRVRDNGGAALIIDYGHVRSDAGDTFQAISRHSFADP LKYPGRVDVTAHVDFEALARAAEDVGARVHGPVPQGEFLRRLGIEARAVNLMAKATPELSDDIATALKRL TEGGRGGMGSMFKVIGVSDPSLSVLVGLSDQAHGGGIRTP
>gi|89359500|ref|ZP_01197321.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2] MAVGAAGRRRRRPSALRPEAQAGRVTTPLSKEISALIAAEGPMPLSRYMALCLGHPRHGYYMTRDPLGAR GDFTTAPEISQMFGELLGLWAVAQWQAMGSPPAFRLVELGPGRGTLMADALRAARLVPDFGAAARIHLVE TSPVLRAAQARTLAAHADRVSWHDRVEEVPDGPALVLANEFFDALPIDQYVFHAGHWHERRVGLDDGGRL VLGLDPAPSRAAPAFAAHLPPPAEGVVLEHLESGPARALSERLKTQGGAALIIDYGHAGGYGDTFQALEQ HRFADPLAAPGNADLTAHVDFSALARIGRAAGLRAFGPLEQGAFLARLGLAQRAERLKRDATDELRAGVD AAARRLAGDGAGEMGRLFKVLVLAHPEIGLPPAFDSTEEWVR
>gi|115523453|ref|YP_780364.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisA53] MTDQPLHDTIKKLIRSAGPMPVWRYMELCLTHPEHGYYVSRDPLGREGDFITSPEVSQMFGELLGLWAAS VWKAIGSPQQVRLIELGPGRGTLMADAMRALRVLPPLYQAISVHLVEINPVLRDKQRDTLANLSNVAWHA DLDEVPGGTSIIFANEYFDVLPVHQAVRGEHGWHERVIEIDAEGDLTFGAAAEPIPQFEVLLPPLVRAAP PGAVFEWRADSEIMKIASRVRDEGGAALIIDYGHLRSDAGDTFQAIAKHSFADPLANPGQADVTAHVDFQ ALAQAAEAVGARVHGPVTQGEFLRRLGIETRALALMAKASHEISEDVANALKRLTGGGRGGMGSMFKVIG VSEADLTEVTGLSDAVRAEVGA
>gi|121715340|ref|XP_001275279.1| DUF185 domain protein [Aspergillus clavatus NRRL 1] MNNARRAWTRHVRACQPRNLRLGSHRCSSTFDQRQWSTPLAKTLANAIKITGPIPISAFMRQVLTSPEGG YYTTRPEGGGEVFGKKGDFVTSPEISQVFGELVAVWTITEWMAQGRKRSGVQLIEVGPGKGTLMDDMLRT FQNFKSFSSSIEAIYLVEASPTLREVQKQRLCGDAPMEETDIGHRSTSKYFNVPVIWVEDIRLLPHEEGT TPFIFAHEFFDALPIHAFESVPPAPESQTEQSEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPEAVEEN KEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSQPGSTIEISPESRVYASDFARRIGGSSQPPRTVNRQ DAGPVQPKRVPSGAALIMDYGTMSTIPVNSLRGIQNHRNVPALSSPGQVDVSADVDFIALAEAAIDASEG VEVHGPVEQGDFLQVMGIAERMQQLLKGIKDEEKRKTLESGWKRLVERGGGGMGKIYKFMAIIPENGGQR RPVGFGGTVQM
>gi|86751272|ref|YP_487768.1| Protein of unknown function DUF185 [Rhodopseudomonas palustris HaA2] MNDDSPLLAEIKRLIETAGPMPVWRYMELCLAHPEYGYYVSRDPLGREGDFTTSPEISQMFGELIGLWTA SVWKAVGEPGVLRLIEIGPGRGTMIADALRALRVLPPLYQSLSVHLVEINPVLRAKQQATLAGIRNVHWH EDFAEVPEGPAVVLANEYFDVLPIHQAVKRDGGWHERVIEISASGDLVFGVADDPIPRFEVLLPPLVQMA PAGTVFEWRPDNEIMAIAARLRDQGGAALIIDYGHVRSDVGDTFQAIARHSFADPLQHPGGADLTAHVDF QALGRAAETIGARIHGPVTQGEFLKRLGIETRALSLMAKASAQVSEDIAGALKRLTGEGRGGMGAMFKVI GVSDPSITSLVALSDDAERAAEGQKA
>gi|56695809|ref|YP_166160.1| hypothetical protein SPO0907 [Silicibacter pomeroyi DSS-3] MSLTGLLLERIAQQGPLSLADYMAECLLHPEYGYYTTRDPLGVAGDFTTAPEISQMFGELIGLALAQAWM DQGRPAPFTLVELGPGRGTLMADALRATRAVPGFHEAARLWLVEASPVLRATQAQALAGHDPQWCDTVSD LPAGPLFGVANEFFDALPVRQFQRAGAVWRERLVGARDGALCWGLGAEALQPALAHRLEDTREGDLVELC PAAGLILSELASRIAADGGAALIVDYGDWRSLGDTVQALRNHAPADPLADPGQADLTAHVDFEVLAMTAR AAGCAHSRLSTQGVFLERLGIAQRAQALARHADEAALDRLITAHRRLTHPEEMGNLFKVLGLYPSDATPP PGLEP
>gi|90422923|ref|YP_531293.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB18] MTEPFSLQDVIKKLIKSAGPMPVWRYMELCLTHPEFGYYVSRDPLGREGDFTTAPEVSQMFGELLGLWAA SVWRSIGSPQLVRLIEFGPGRGTMMADALRALRVVPPLFQALHVHLIEINPVLREKQKATLAGAQNLHWH ASLDEVPGGSTIIFANEYFDVLPIHQMVRGEHGWHERTVEIDAAERLVFGVAPEPVPHFEQLLPPLVRAA PQGAVFEWRPDAEIMKIASRVRDEGGAALIIDYGHPRSDAGDTFQAIARHSFADPLQNPGRADVTAHVDF QALARGAQDVGARVHGPVTQGEFLKRLGIENRAVALMAKASLEVSEDVASALKRLVEGGRGGMGSMFKVM AVSEPEIEQIAGLSDQPEAARTAAQ
>gi|111069170|gb|EAT90290.1| hypothetical protein SNOG_02078 [Phaeosphaeria nodorum SN15] MRATSLSLLRTATRCSQPQANDIRRRSAQCAIRYTSSLTSTSSGAERKWSTPLAKMLGEAITTTGPISVA AYMRQCLTAPEGGYYTRQTSAGQDQFGTTGDFVTSPEISQVFGELIGIWIYAEWLAQGKKEKVQIIEVGP GRGTLMDDVLRTISSFKAFANSIEAIYLVEASPHLQKQQGKLLAGTEELQKSDIGLTAPLKYIPGVNIQW CEDIRNIEETSSPFILAHEFFDALPIHVFQNIAQSSIPASSMIMTPTGPIKPKHGATVPKNTWHELVVSP TNPYSSAGTITTTSSTSTKQEEKPDFELTVSKTPTPHSLYLPKKSDRYKKLEDTNDAIIEISPESMAYIS DFAVRIGGDNPPSKAEPTSSSAKLQRTEPAPFSKPQPAGAALILDYGPANTIPANTFRGIRGHQTVSPFT SPGLVDLSADVDFLALAESALDASPGVEVHGPVEQSFFLSTMGIKERAERLLKGAKDEATRQRLETGWKR LIDRGPNGMGKTYKAMALLPYIKGSKVRRPVGFGGDIAA
>gi|110680664|ref|YP_683671.1| hypothetical membrane protein [Roseobacter denitrificans OCh 114] MSLKDQLIARIKAHGPMSVAEYMGDCLLHPTLGYYTTQHPFGGSGDFITAPETSQMFGELIGLCLVQAWV DQGRPSPFALVELGPGRGVLMADILRAAAQVPDFARAAEVILVEASPKLQEIQRDTLKAHAVTFVKDVAS LPQCPLFVVANEFFDALPIRQFVRSGPHWRERQVGCDAEQLIFGMGAQTPQPALNARLSDTKEHDLVEYA PAAAPIMSELGSRIDTHGGAGLIIDYGDWRSLGDTLQAVRQHEYTGVLDHPGESDLTAHVDFEALAQAVP CAFSRLTPQGVFLERLGITQRAQRLAQNLPKDLLEQHIKAHRRLTHPEEMGNLFKVLGLFPHGKAPPAGL EI
>gi|85704768|ref|ZP_01035869.1| hypothetical protein ROS217_06800 [Roseovarius sp. 217] MSGLEAQLRARIAEAGPISLADYMAACLMHPEFGYYATRDPFGAGGDFVTAPEISQMFGELLGLCLAQVW LDQGRPARFVLAELGPGRGTLMADVLRATQRVPGFRDAAEVHLVEGSAVLRAAQRRAIAGDVIWHERVES LPEGPLYLLANEFFDALPIRQFQRFGDGWRERVVGLSDDRLALGLSGPVAPPALVERLAETREGDVVEIC GPGEAVAAEIGARIAGHGGAALIVDYGDWRSLGDTFQAVKGHAPVDPLAAPGLADLTAHVDFEALARAAS PAVYTRLTPQGVFLERLGIGARSEVLARNLSGQALENHLAAYQRLTGAEEMGTLFKVLGLYPEGTTPPPG LDP
>gi|116502505|gb|EAU85400.1| hypothetical protein CC1G_07094 [Coprinopsis cinerea okayama7#130] MQLCLSHPTHGYYMNPNNAVFGTSGDFITSPEISQVFGELVGVWLVSQWADAGTPPAIRLVELGPGRGTL MDDILRIVKKFLPEKALTGVHLVETSEALRSVQKAKLGEKCDLHFHNGIHEIPRNPSVYTMLVAHEFFDA LPVHVVQKTEAGWNEVMIASNDSLSSSESSPSQPQTQKQGVLRRVLNPLPSPASTLLGNSSLRFRNLPIG STIEVSPTSFRIAHQIGRLLSARGLEEKPLDLVEASQQETGVGGCGLVIDYGADHAFGDSFRAFKEHKIV DVFHRPGECDITANVDFAYLKEAMTVEPHGPITQADFLERMALQTRVEALVRNASSEERKKVILDAANRL VDRSGMGTQYKVLGITSSPKSSSTGVWPFDVQNQALES
>gi|126739895|ref|ZP_01755586.1| hypothetical protein RSK20926_14444 [Roseobacter sp. SK209-2-6] MSSLEQQLVARIQENGPISLAEYMSECLLHPEFGYYSTRDPFGQSGDFVTAPEISQMFGELLGLCLAQCW LDQGAPSPFALVELGPGRGTLMRDLLRATAGVTGFHQAMQVFLVEASPKLQREQAKALEEYDVSWVAEPM ELPNLPVFLVANEFFDALPARQFVRDSDGWRERLIGLEEGKLGFGLGSATDQPALAYRLEDTRPGDLVEL CSPAATLLEPIADRISTFGGAALIIDYGDWRSLGDTLQALRAHKSTPPLENPGKADLTLHVDFEFLAQST KSTGCAHSRVTPQGVFLERLGITERAQALSRALQGEALDTLIAAHRRLTHPEEMGNLFKVLGLYPAQFSP PPGLLS
>gi|114767217|ref|ZP_01446082.1| hypothetical protein R2601_09240 [Roseovarius sp. HTCC2601] MSPLEEILHRRIAAEGPMTIAEYMATCLGHPRYGYYPTRDPLGAAGDFTTAPEISQMFGELLGLCLAQCW LEQGRPSSFVLAELGPGRGTLMADATRAMRGVPGMLEAARLHLVETSPRLRDEQHRRLAPLMPVWHDSVA NLPEAPLYLLANEFFDALPIRQFLRSGEGWCERVVGLSEGRLAFGLTEPAPHGELEHRLADTREGDLVET CAPATGIAEDIGRRIASQGGAALIVDYGSARSLGDTFQAVRRHDKVSPLDAPGTADLTAHVDFGALATAM PCATTTLTPQGVFLERLGITDRARALAARLSGAQLDSHVAAHRRLTHPEEMGSLFKTLGAFPEGAAPPPG LLDT
>gi|86136505|ref|ZP_01055084.1| hypothetical protein MED193_20319 [Roseobacter sp. MED193] MSPLTDQLLARISSDGPISLADFMAECLLHPEHGYYTTRSPFGTQGDFTTAPEISQMFGELLGLSLAQSW LNQGAPDTFTLAELGPGRGTLMADLLRATRGVPGFHTALQLYLVEASPNLQEQQAKALARYDATWVDTAD ALPQQPLFLVANEFFDALPIRQFVRDGDGWREKRIGLVDGGLGFGLGPAAPQPALEHRLRDTTDGDLVEL SPGAAPILSSLAQRIASHGGAALIVDYGDWRSLGDTLQALKSHTPVEPLETPGEADLTAHVDFEVLCSVA AGAGCAHSKVTPQGVFLERLGITDRARNLAAGLEGEALESLIAAHRRLTHPSEMGNLFKVLGLTPADIAP PPGLNA
>gi|39937419|ref|NP_949695.1| DUF185 [Rhodopseudomonas palustris CGA009] MIDQTALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGDFTTSPEISQMFGELLGLWSA SVWKAADEPQTLRLIEIGPGRGTMMADALRALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGIRNIHWH DSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAADPIPGFEALLPPLARLS PPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDF DALGRAAESIGARAHGPVTQGAFLKRLGIETRALSLMAKATPQVSEDIAGALQRLTGEGRGAMGSMFKVI GVSDPKIETLVALSDDTDREAERRQGTHG
>gi|118591907|ref|ZP_01549302.1| hypothetical protein SIAM614_20945 [Stappia aggregata IAM 12614] MTGLKDRIKARIATEGPLSVAQYMSVCLGDPDAGYYMTREPFGSEGDFITAPEVSQMFGELIGAACLSAW QALGEPAEFQLVELGPGRGTLMADLLRMASLRPAFIKAARLNMVETSPRLREIQTATLSRGPLTPHFRNR FQDVPGGPLILVANEFFDALPIHQFVKTARGWQERQIGLSQDGELMFGVGTARLPDDAIPADLSSAPEGA IFETQPAANAIAEEIGHRIAGNGGAAILIDYGYLNTAAGDTLQALYKHAYDDVLAHPGEADLTAHVNFEA LAAATVRAGAQALAPLTQGEFLLRSGLLERAGALGAGKSHSEQEAIRDAVERLAAPGQMGDLFKVLAVTN SGISFPPFDSAS
>gi|15889490|ref|NP_355171.1| hypothetical protein AGR_C_4024 [Agrobacterium tumefaciens str. C58] MTTPLAQRIKSLIRLNGPLSVTDFFSLCLADPEHGYYKSREPFGRSGDFITAPEVSQLFGEMLGVFVVHA WQRHGAPAQTQLVEIGPGRGTMMSDMLRVIRRIAPPLYETMRVHLVETSPRLSAIQKETLTAHADRLTWH DSFDDVPEGFLLLVANELFDAIPIRQFVRTPQGFRERVVSLDANGELVFSTGLAGIDPTLLPPQPERQQL GTVFEVSPAREAVMTAICQRLSVHGGTALAIDYGHLVAGYGDTLQAMRNHAFDPPLAHPGEADLTSHVDF ESLVKTAQATGVHVNGALRQGDFLHGLGLKERASALAAKATPDQTLEIAEAVNRLAGEGAGKMGELFKVI AVSSPALHLLPFRAVD
>gi|84683545|ref|ZP_01011448.1| hypothetical protein RB2654_19268 [Rhodobacterales bacterium HTCC2654] MSLADKLRARIEGTGPMSVADFMAECLLDPEHGYYTTRDPFGSAGDFTTAPEISQMFGELVGLCLAQGWM DQGSPAPFVLAELGPGRGTLMADILRATRGVPGFHDAARIVLVEASPRLRERQQATLTGYGVTWVDSLED APDGPLFLVANEFFDALPVRQFQRDADDWRERQVGLKDGALTFGLGGPTAHAPLDRWTDAQPGDLVELRP AADAVMAEIDRRIAAQGGAALVIDYGDWHSLGDTLQAVAKHEAADPLANPGAADLTAHVDFEALALAATR LTHSRLTPQGVFLERLGITARAQALAARLEGEPLTRHVAAHRRLTHPDEMGTLFKTLAFVPEGAAMLPGL ET
>gi|83854800|ref|ZP_00948330.1| hypothetical protein NAS141_08731 [Sulfitobacter sp. NAS-14.1] MTTLRDILHSRIASNGPMRIDEYMATCLLHPTQGYYTTRDPFGTQGDFTTAPEISQMFGELLGLCLAQSW IAQDAPSAFTLAELGPGRGTLMADILRATRNVPGFIEAAQITLVEASPTLRDVQAKTLAEHQVIWADGTD ALPDQPLFLVANEFFDALPIRQFVRGETSWRERQVGLADGALSFGLGPELPQPALADRLADTTPGDLVED CTQLAPILHPVSERIATHGGAALIVDYGDWHSLGDTLQALQGHEKADPLVAPGQADLTAHVDFEKLALAA APASYTRITPQGVFLERLGITQRAQTLAKGMTEDALNAHVAAHRRLTHPSEMGNLFKVMGIYPPHHSPPP GLEP
>gi|118735910|ref|ZP_01584383.1| protein of unknown function DUF185 [Dinoroseobacter shibae DFL 12] MADRARPPHRAMTPLAEILAARIAATGPITVAEFMAECLLHPTHGYYTTRTPFGQAGDFTTAPEISQMFG ELLGLALAQAWHDQGAPPGAILAEIGPGRGTLMADIRRVLKQVPGAATLRPHLVEASPALRAEQATRVPE AVRLDRVEDLPDAPLLLVANEFFDALPIRQFERHAAGWAERQIGLAEGALAFGRAQPAALASLAHRMADT GPGDLVETCAPAQPIIAEIAGRIARHGGAAIIADYGDWRSKGDTLQAVRAHRPDPVLAHPGQADLTAHVD FEPLAQAARTAGASVSAMIPQGVFLERLGITTRAQALATGLEGAALQSHIAAHRRLTHPEEMGTLFKVLC VSPPSAPPVPGSIAPPNPGKPDAP
>gi|21434891|gb|AAM53573.1| Aby [Azospirillum brasilense] MSDAAPDSLAHHLARRILMDGPLSVAAFMAEALGHPRFGYYMRQDPFGVSGDFTTAPEISQMFGELAGLW CVDTWARLGGPAPVHLVELGPGRGTLMQDALRAAALVPAFREATRVHLVETSPTLRARQKETLAGIPVAW HDRLEDVPEGPTLILANEFFDALPIRQVQKTNHGWFERLIDIDNTESMDTPRFRFVLEAFGSAGARLIPP ALRDAPEGSVVEVSPASQPVARLIGERLAAHPGAALVIDYGYRGGPAVGDTLQALRRHAYAPVLDAPGEA DLTAHVDFAAIAAAAREGGAESFGPVDQGDWLVRLGIQPRATALKRSATTKQAADIDSALARLIHRCRPA
>gi|99080460|ref|YP_612614.1| protein of unknown function DUF185 [Silicibacter sp. TM1040] MSLMQSLRRRIELDGPMTVADYMSECLLHPDYGYYTTAPAIGAEGDFITAPEISQMFGELLGLVLVQSWL DQGRPQPFTLAELGPGRGTLMADMLRATRAVPGFHEAMELLLIEASPRLRDLQRQALAPYAPRWVPSVED LPQHPLFLVANEFFDALPIRQFQREGNQWRERRVGLAEDASGLTLGLGAPAPQPALAHRLEDTKDGDLVE HCEVAAVVTEAIAQRIGDHGGVALLVDYGDWRSLGDTLQALRAHAPTDPLAEPGQADLTAHVDFEAICTA ASATGCAHTRLTPQGVFLERLGITDRANALASGAAGEPLAQIIAAHRRLTHPEEMGNLFKVLGLYPAKFA PPAGLEK
>gi|126733062|ref|ZP_01748818.1| hypothetical protein SSE37_14379 [Sagittula stellata E-37] MSALKDIITRQISRTGPLTLADYMALCLSHPEHGYYATRDPLGAEGDFTTAPEISQMFGELIGLALAQSW MDQGAPTRFVLSELGPGRGTLMADALRATTRVPGFHDALELHLVETSPALRAEQAARLPDATWHESVASL PEAPLFLIANEFFDALPIRQFLRHAQGWQERVVGLKDGQPTLGLTDPAPHDALDHRLADTEPGQIVENCA PAQAIVQETGRRIASHGGTALIVDYGDWRSRGDTFQALYRHKPAEPFARPGEADLTAHVDFEALAKAAHP AAHSALTPQGVFLEHLGITARAQALARRLGGAALESHVAAHRRLTHPGEMGSLFKVLALFPHDAPPPPGT GLPADPS
>gi|84515351|ref|ZP_01002713.1| hypothetical protein SKA53_01796 [Loktanella vestfoldensis SKA53] MTTLADLLLTRIARDGPISIASFMTDALMHPAHGYYATRDPFGAAGDFITAPEISQMFGELIGLSLAQAW LDQGAPDPVTLAELGPGRGTLMADILRATAAVPGFHAAVTVHFVETSPHLRALQAERVPQATWHDRIDTL PDAPLLLVANEFFDALPIRQFVRAGAGWRERMVGAQDGTLCFGLSDAAALAVLTPRLDDTQDGDLVEHCP ALPGIVAAIAGRIATNGGAALVIDYGDWQSLGDTFQALAGHAPTDPLAAPGAADLTAHVDFAAIAAHAAP ARHSRLTPQGVFLERLGITARALKLASGLTGEALDAHVAAHRRLTHPAEMGDLFKVMALYPATAMPPPGL DP
>gi|83941322|ref|ZP_00953784.1| hypothetical protein EE36_03798 [Sulfitobacter sp. EE-36] MTTLRDILHSRIASNGPMRIDEYMATCLLHPTQGYYTTRDPFGTQGDFTTAPEISQMFGELLGLCLAQSW LAQDAPSAFTLAELGPGRGTLMADILRATRNVPGFIEAARITLVEASPTLRDVQAKTLAGHQVIWADGTD ALPDQPLFLVANEFFDALPIRQFVRGETSWRERQIGLADGALSFGLGPELPQPALADRLADTKPGDLVED CTQLAPILHPVSERIATHGGAALIVDYGDWHSLGDTLQALQGHEKADPLAAPGQADLTAHVDFEKLALAA APASHTRITPQGVFLERLGITQRAQTLAKGMTEDALNAHVAAHRRLTHPSEMGNLFKVMGIYPPHHSPPP GLEP
>gi|114770207|ref|ZP_01447745.1| hypothetical protein OM2255_11240 [alpha proteobacterium HTCC2255] MTALSNIIKKQIKRFGPMPVSEYMTLCLLHPEHGYYTNRDALGATGDFTTAPEISQMFGELIGLSIAQSW IDQEMPTPFILAELGPGNGTLMADILRATKSVPNFHASMDLHLIEASPEMRKRQETALNGFNVTWLNYFS ELPQKPLFLIANEFFDCLPIKQYRRTDEGWQEQMIAVENEQLHFILGTATSEEVFSKTNDVPSADMLEVS PPTVAFASAIGEHIQGNGGCAIIVDYGEWDSDGDSLQALKDHRKIDPLTHCGTADLTAHVSFKDLTNAAS KYAKVSSTIPQGILLERLGITQRAQTLAKNMSGKKLENHISAHKRLTHPDEMGSLFKAIAIIPENTDLPA GFNE
>gi|67464609|pdb|1ZKD|A Chain A, X-Ray Structure Of The Putative Protein Q6n1p6 From Rhodopseudomonas Palustris At The Resolution 2.1 A , Northeast Structural Genomics Consortium Target Rpr58 MIDQTALATEIKRLIKAAGPXPVWRYXELCLGHPEHGYYVTRDPLGREGDFTTSPEISQXFGELLGLWSA SVWKAADEPQTLRLIEIGPGRGTXXADALRALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGIRNIHWH DSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAADPIPGFEALLPPLARLS PPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDF DALGRAAESIGARAHGPVTQGAFLKRLGIETRALSLXAKATPQVSEDIAGALQRLTGEGRGAXGSXFKVI GVSDPKIETLVALSDDTDREAERRQGTHGLEHHHHHH
>gi|83859400|ref|ZP_00952921.1| hypothetical protein OA2633_13385 [Oceanicaulis alexandrii HTCC2633] MSDDEFRPAQMISERLAERIRTEGSLSVAAFMAEALFHPMAGFYATKDPLGAANDFITAPEISQMFGELL GLWAAECWMQMGAPSRFELIELGPGTGRMMSDMLRAGRAAPGFLDAVHVTLIEASPALKMVQGQTLASAS VPINWAKDFDKAPSGPAVVIGNEFLDCLPIRQAIRHKGQWRERVVTLHPEDEARFVYGLGPVLGEADVAF IAPGLREADDGTLVELRPGDQQQIDQLAARFDRDPGYALFVDYGSAKPETGDTLQAIRAHQKVDPLDAPG TADLTAWVDFDRLLRLGEDAGLSAFGPMTQGDFLTELGIEQRAAVLSRSVDEAGQAKLKRQMHRLVSPED MGTLFKLAAFSSEGLPPAPGIAPFKRSR