Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_0483 |
Symbol | |
ID | 3999636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | - |
Start bp | 495108 |
End bp | 501104 |
Gene Length | 5997 bp |
Protein Length | 1998 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637937380 |
Product | adhesin |
Protein accession | YP_544594 |
Protein GI | 91774838 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC ACCGTTACCG CCTGATATAC AACCGTGCCC GGGGCCTCAT GATGGCCGTG GCGGAAAACA CCGCCACCCG CCGCACCACA CTCGGCACAC ACTCCGACGC GGTGCGAGAA TCCCCCGCTC TCGCGCTGGC CCACATTGCC CCACTGCCCC TAACCATCAT GCTCATGCTG GGAACAGTCA TCATCATGCC CGGCACAGCC AAGGCCGAGA TCATCGCCGA TCCATCCGCA CCTCCCAGCC AGCAACCTAT CATCGACAAC ACCGCGAACG GCCTGCCTCT GGTCAATATT CAGACACCCA GCACAGCCGG GGTATCACGC AACACTTATA GCCAGTTCGA CATCAACACC CATGGCGCCA TCCTTAATAA TAGCAACACG AATGTGCAAA CCCAGCTCGG CGGCTGGATA CAGGGTAACC CACTGCTCGC AGGTGGCATG GCCCGCGTCA TCCTCAACGA GGTCAACAGC AGCAGCCCCA GCCTGCTCAA CGGCTTTATC GAGATCGCTG GTAGCCGCGC CCAACTCGTC ATCGCCAACC CGGCCGGCAT CAGCTGCAAC GGCTGCGGCT TCATCAACGC CAACCGCGCC ACCCTCACCA CTGGCAATCC CATCGTCAAT GGCGGCAACC TGCTCGGCTA TCGCGTAGGC GGCGGCAACA TCAGCTTCCT GGGCAACGGC ATGGACGCCC GTCATGCCGA CTACACTGAC ATCATCACAC GGGCCGTGGA AATCAACGCA GGCATCTGGG CCAAGCAACT CAACATTACC ACCGGCAACA ACCACATCAA CATCGACAAC CACGGCAACC CAACCGTGAT CTCCCGGCTC ACCCCCGCAA GCTCCGCCAC GGCCCCAGCC TTTGCCGTGG ACGTGGCGGC CCTGGGCGGC ATGTACGCAG GCAAGATCCA CCTCATCGGC ACCGAGGCCG GGCTGGGCGT GCGCAATGCG GGCAGCATCG GGGCCAGCGT GGGCGAAATC GTGGTAACGG CCGATGGACG CATTGTCAAT ACAGGGACAC TGGCAGCACA TACCAATCTC GACATCGCCA CCAATCAAGA CTTGCAAAAT GCGGGCACCA TCAATGCCAA TGCCAGGGTG AGCCTGAACG CGGGGGACAT CGACAACACC GGGGAAATTT CTTCAGCCAG CACCATCCTG CTGAGCCAAG CAACATTGAC CAATCGCGGC ACAATAGACG GGGATAACAC CCATATCAAT GCAGGCAGAC TGGACAACAT CGGGCACGGC AGCATCTTCG GCGACCACCT CAGCATCCAG GCTGGCGAGA TCAACAACCT GGCTGAAAGC GTGAATGGCA CCACCACAGC CGCCATCATG GCCGGGCGTA CCCGTCTCGA CATAGCCACC AGCACACTAC TCAACAGCGA CAACGCCATG CTGTTCAGCG GAGGCGATAT CGCCATCGGG GGCATTCTCG ACGCAGCGGG AATAGCCACA GGCCATGCCG CCTCCATCAT CAACCGGGAC GCCACCATCG AAGCGCTGGG CAACATGGTC ATCAACGCAA ATGGGCTGCA AAACCTCAAC ACCGGCCTGC AAGTGGGAAG GACAGCGGAA ACCACCGCAG GCTACGACAA ATTCACCCCA GCCGGCCAAG GTGTCGTCTG GGATAGCGCG GACTATCCCG GCGCGCAGAT AGGCAATGTT CATGTCGTAT GGCGCTCCGC CGGGCCATAC ACATTCCGCG AATACACGCG CTACCAGGGC ACCCTTTATA CCTCCCATAC CGAAGTGATC GCATCCACTC CCGGCAGGAT ACTCGCCGAA GGCAATATGC AGATCACAGG CAACCTACTG AACAGCGATA GCCAGATCAT TGCCGGTGAA GTATTGGACA TCAGCGGCGC AACGGTACAG AACGTCAATA CCGAAGGCCA GACCATTGCC CGCCACAGCG GCGGCGCCTG GTATTACGAT TGGGACGGCA AGGATAACGA ATACGACATC GACTACTTGG GCAACCACAG TCCCGGCAAC ATCGTCACCA CCTACAATCT CGCCACTACC CGCCTAGAAG GCAATGCCGC GCCCCCAACC AGTGGTACCG TGGTGGCAGG TAACCCAAGC GCAGTCAGCA ACAACGCATT GTTCCAGGCG CTTCCAGATA CAGCGGCAGA CTATCTCATC GAGACCAACC CACGCTTCAC CAATTACCGC ACGTGGCTAA GCTCCGACTA CATGGTGCAG CAACTCTCGT TCGACCCGGC TATCACGCAA AAGCGTCTGG GCGATGGCTT CTACGAACAA CGCCTGGTGC GGGAACAGAT TGCCGAACTG ACCGGCAGGC GCTTCCTGGA AAGCTACGCC GACGACGAAG CCCAATACCA AGCCCTGATG AACGCAGGAC TCACCATGGC CAGCCAAATG CAACTCATCC CCGGCGTCGC GCTTTCAGCG GAGCAAATCG CACAGCTCAC CAGTGACATC GTCTGGCTGG TGGAACAGGA AGTCACCCTG CCCGGCGGCA CTATCGCCAA AGCCCTCGTG CCGCAGGTAT ACGTACGCCT GCAGGACGGC GACCTAAACC CCACGACCGG CATCATGGCG GGCAACAGCA TCAACATCAA TGCCACAGAC ACCATCACCA ATGGCGGCAC CCTGGCTGGG CGCACCCTGG CCGTGCTCAA TGCAGACAAC ATCCAGAACC TGGGCGGCAA CATCAGTGCT GATGTCACGG TCTTGAAAGC CACCGGCAAC ATCGACAATA TCGGTGGCAC CATCATGGCA GGAAATGCAC TGGTACTCGA AGCCGGCAGT GACATCAACA TCGAAAGCAC CACGCAAAGC CATGCCCAGA CCATAGGCGC CAGCAGTTTC AGCCGCACCA ACCTCGACAG GGTTGCCGGT TTATACGTGA GCAACCCCGA TGCCGTCCTC GTCGCCAGCG CAGGCAACGA TCTCAACCTC ATGGCTGCCA GCATTGTCAA CAGCGGCGAT CATGGCGCCA CCTCGCTCGA GGCGGGCCAC GACATCAACC TGGGCACGGT GCAGGTCGCC GAACAGAACA ATAGCGTCCG CAACGCCAAG AATCGCGTCA GCTACGGCTC AATCCAAGAC ATCGGCACCA ATATCCACAC CAGCGGCAAT ATCTCCCTGC AAGCTGGTAG CGACATCAGC ATCAAGGCAG GCAACCTCAA CAGCGAAAAC GGCAACCTCG GCATCGCCGC ATCAGGGGAC ATCACCATCA CCTCAGGCGA AGCTGGCAGC AACTTCGATC TTGCCCGCAA GACCAAGCGC AGCAGCGGGC TAAGCAGCAC GACCAGGACC TACCGCACCA TCGCAGAATC CACCGAGGCC ATCGGCAGCA CCCTGAGTGC AGACACCATC AGCCTGCAAG CCGGCACACC TGCCGGAATC GCATCCGCGG ATACGGGCAA TATCAACATC ACCGGCAGTC ATGTCGTCTC CACCAACGCC ACTTCTCTCA ATGCCAGCGG CGATATCAAC CTCAATACGG CTGACAACAC CAGCTACACC TTCAACCAGA AAACCACCAA GAAATCAGGC TTCTCCGCCA GCGGCTCCAG CATCGGCTAC GGCACCAGCA ACCTGAGCCA AAAGCAAACC GGCACCACCA TCACCCAGAC CGGCAGCACC GTGGGCAGCG TCGAAGGCGA TGTCACCATC CAGGCAGGCA ACACCTACAC GCAAATCGCC AGCGATGTGC TCGCGCCACA AGGCGATATT GAGATCAGCG CCCAACAGGT CAACATCACC GCAGGCAACA ACACCTCCAC CCAGACCACC GAGACGAAGT TCAAGCAGAG CGGCCTGACG CTGGCGATCA CCAGCCCGGT CATCTCGGCG ATCCAGACCG CAGAGCAGAT GAAGCAGGCC GCCAGCAATA CTTCCGGTGT CCGCATGCAG GCATTGGCAG CAGGCACCAC GGCATTGGCT GCCAAGAACG CTTTCGATTC CACCCAGCAG GCATTGTCCG CAGCACCGAC AAGCAATAGC GTCACCAATG CCGCCAATCA GGCAGGCGGA GTGAATCTGT CGCTCAGTAT TGGTACTTCA AAGAGCAGTA GCACCTCCAC GCAAACCAGC AGCACAGCCC AAGGCAGCAC GGTGATGGCG GGCGGTGATG TCAACATCAC GGCGACGGGG GCTGGGGAGC AGTCGGATAT CAACGTCATC GGCAGTACAA TCAAGGCGGA TGGGGATGTG AGCTTAAAGG CGGATGATAA AGTAAACCTA ATCGCTGCGC AGAACACGGA AACACTGAAC AACAAGAATA AAAACAGCAG CGCTAGTGTC GGCATCAGCA TCGGTTCCAA CGGGCTTGCC GTCACCGCCA GCGCATCACA GGGTAAAGGC AAGGCCAATG GCACAGATAT CACCTGGACT GAGAGCGTAG TTGAGGCTGG CAACAAGGTC ACGCTGGAAA GCGGTACAGA TACCAACATC ATCGGTTCGC AAGTCAGGGG GGATCAAGTA GTGGCCGATG TGGGCACTAC AGGCAAAGGC AACCTCAATA TCCAGAGCTT GCAGGACATC AGCACTTACG ACAGCAAGCA GAAAAGCACT GGTATCAGCG TCAGCATGCC GATTGGGGCT GGCATGGCAG GCGGTTCGTT CAGTTCGTCC AATACCAAGA TTAAGAGCGA CTATGCCTCT GTCAACGAAC AGGCAGGCAT CTATTCAGGC GATGGCGGGT TCCAGATCAA CGTCAAGGGG AATACCGATC TTAAAGGTGC GGTGATCGCA TCTACTGGAA CCGCGATTGC GGAAAACAGG AACAACCTCT CCACGGGAAC GCTCACCGTC TCAGATATCA AGAACCAAGC AGAGTATGAT GCCAAGGCGA CATCGGCGAC GATAGGCGGT GGTATCCAGG CTGGGTTGCC GCAGCTATCC GGGGCGGGCA TGGGCTCGGA TACCGGTCAA GCCAGCAGTA CTACAGTCAG CGCGATCAGT GGCGGTACAG TCGATATTAC AAATGACGCA GAACAGCAGA CAAAGTCCGG CACGGATGCC GCCACCACAG TCGCCCTACT CAACCGCGAC GTGCATGTGG ATGAGCAGGG CAATGCGGTG GATAGCCAAG GTAACAGTAC GGCTAATACC ATTGCGCCTA TCTTCGATGC GGAGAAGGTA GCCAAGGAGA TTCAGGCGCA GGTGCAGATT ACGGAGGCGT TTGGACAGCA GGCCCATTAT GCTGTAGGCC AGTATGTTCA GGAAAAACGT CGAGCATTAC AGCAACAACT TAAAGAAGCA GTAACGCCTG AGGAAAAATC AATCATTCAG TCGCAACTCA AAGATTTGCG ACTGCAAGAG CAGGTTATGA ATGTACTGAT TGGGGCCGTC ACCGGACAAT TGGATACAGT GCTTGGCAAG GAAGCTTTAT CTACTGCAGC AGAAAAAATG AGAGAAATCA CCATCGCAAA CTCAAGCCTG TTTAAAGGAA TCACCGATGG CAACACGGTG CTCAACAACA CATCAGGCGA AAGCGAAGGC GTTCGCGGCG ACGGTATAAA AGGCGGAGGA ACCAGGGTTG ATTTGGATAA TATCTGTGGC CCAGATAATG CACGATGCGT AACTCGAAAA GATGAAAATG GAAATAATGT ACTTGTTCTA AAAGATGGCA TGGTGCAATG GCACCCAGAC AACAATATAT CACTAGCCGA ATTCCTCAAG TCCCCTGAAG GACAAAAGGC TGCTGGTGCA ACAGGCGGCA TTCAAGGGTG GTTAGGAACA TTATTTGGTC ATGAATATCA GCCGGGTAGC TGGCAAGATA AATTGATAGA GGCCTTCGGT GGAACCCATG ACTATATCGG AGGGCAGATC ACGGGCTTGT ATGACGAACA AGGAAATACA AAACGTGGAA TGGAGAGCAC AGAACGATTC ATTCGGGATC GAGTATCCGA ATTGGCAATT TTACCATCCA CTCCATTTGC CATGGCGGAG CTGTTGCCTC CTGAAGTTTG GACTGCTATT TCCATTTTAT TGAAAGGCGC TAAATGA
|
Protein sequence | MNKHRYRLIY NRARGLMMAV AENTATRRTT LGTHSDAVRE SPALALAHIA PLPLTIMLML GTVIIMPGTA KAEIIADPSA PPSQQPIIDN TANGLPLVNI QTPSTAGVSR NTYSQFDINT HGAILNNSNT NVQTQLGGWI QGNPLLAGGM ARVILNEVNS SSPSLLNGFI EIAGSRAQLV IANPAGISCN GCGFINANRA TLTTGNPIVN GGNLLGYRVG GGNISFLGNG MDARHADYTD IITRAVEINA GIWAKQLNIT TGNNHINIDN HGNPTVISRL TPASSATAPA FAVDVAALGG MYAGKIHLIG TEAGLGVRNA GSIGASVGEI VVTADGRIVN TGTLAAHTNL DIATNQDLQN AGTINANARV SLNAGDIDNT GEISSASTIL LSQATLTNRG TIDGDNTHIN AGRLDNIGHG SIFGDHLSIQ AGEINNLAES VNGTTTAAIM AGRTRLDIAT STLLNSDNAM LFSGGDIAIG GILDAAGIAT GHAASIINRD ATIEALGNMV INANGLQNLN TGLQVGRTAE TTAGYDKFTP AGQGVVWDSA DYPGAQIGNV HVVWRSAGPY TFREYTRYQG TLYTSHTEVI ASTPGRILAE GNMQITGNLL NSDSQIIAGE VLDISGATVQ NVNTEGQTIA RHSGGAWYYD WDGKDNEYDI DYLGNHSPGN IVTTYNLATT RLEGNAAPPT SGTVVAGNPS AVSNNALFQA LPDTAADYLI ETNPRFTNYR TWLSSDYMVQ QLSFDPAITQ KRLGDGFYEQ RLVREQIAEL TGRRFLESYA DDEAQYQALM NAGLTMASQM QLIPGVALSA EQIAQLTSDI VWLVEQEVTL PGGTIAKALV PQVYVRLQDG DLNPTTGIMA GNSININATD TITNGGTLAG RTLAVLNADN IQNLGGNISA DVTVLKATGN IDNIGGTIMA GNALVLEAGS DINIESTTQS HAQTIGASSF SRTNLDRVAG LYVSNPDAVL VASAGNDLNL MAASIVNSGD HGATSLEAGH DINLGTVQVA EQNNSVRNAK NRVSYGSIQD IGTNIHTSGN ISLQAGSDIS IKAGNLNSEN GNLGIAASGD ITITSGEAGS NFDLARKTKR SSGLSSTTRT YRTIAESTEA IGSTLSADTI SLQAGTPAGI ASADTGNINI TGSHVVSTNA TSLNASGDIN LNTADNTSYT FNQKTTKKSG FSASGSSIGY GTSNLSQKQT GTTITQTGST VGSVEGDVTI QAGNTYTQIA SDVLAPQGDI EISAQQVNIT AGNNTSTQTT ETKFKQSGLT LAITSPVISA IQTAEQMKQA ASNTSGVRMQ ALAAGTTALA AKNAFDSTQQ ALSAAPTSNS VTNAANQAGG VNLSLSIGTS KSSSTSTQTS STAQGSTVMA GGDVNITATG AGEQSDINVI GSTIKADGDV SLKADDKVNL IAAQNTETLN NKNKNSSASV GISIGSNGLA VTASASQGKG KANGTDITWT ESVVEAGNKV TLESGTDTNI IGSQVRGDQV VADVGTTGKG NLNIQSLQDI STYDSKQKST GISVSMPIGA GMAGGSFSSS NTKIKSDYAS VNEQAGIYSG DGGFQINVKG NTDLKGAVIA STGTAIAENR NNLSTGTLTV SDIKNQAEYD AKATSATIGG GIQAGLPQLS GAGMGSDTGQ ASSTTVSAIS GGTVDITNDA EQQTKSGTDA ATTVALLNRD VHVDEQGNAV DSQGNSTANT IAPIFDAEKV AKEIQAQVQI TEAFGQQAHY AVGQYVQEKR RALQQQLKEA VTPEEKSIIQ SQLKDLRLQE QVMNVLIGAV TGQLDTVLGK EALSTAAEKM REITIANSSL FKGITDGNTV LNNTSGESEG VRGDGIKGGG TRVDLDNICG PDNARCVTRK DENGNNVLVL KDGMVQWHPD NNISLAEFLK SPEGQKAAGA TGGIQGWLGT LFGHEYQPGS WQDKLIEAFG GTHDYIGGQI TGLYDEQGNT KRGMESTERF IRDRVSELAI LPSTPFAMAE LLPPEVWTAI SILLKGAK
|
| |