Gene Mfla_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1547 
Symbol 
ID4001137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1654419 
End bp1656803 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content50% 
IMG OID637938458 
ProductTonB-dependent receptor 
Protein accessionYP_545656 
Protein GI91775900 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0831797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGGA AAAGTATGCT ACCAACTGAA TCGGCCTCAA GCCCATTTAG AAAACGACAA 
CTCATCATTG CCGTAGCTGC CGCCTTTGCA GGCAGCCTTT TTATCTCCCC AACATATGCG
GCTGATGAGC AAACCTCTCA AGCTGGAGAT ACCTCGATAA CAAAAGCCGC CCCTCACAAA
GAAAAGACTA CTCCAGCCGA AGCGAAGGAT GTTGAATCAT CTCTTGGAAG CATCGTTGTA
ACGGCCCAGC GCCGCGAAGA AAATGCGCAG GAAGTACCGA CCGCCATTTC CGTACTTGGA
GGTGAAGAAT TCCTTCAACG AGGCATTGGT CGTTCTGCAA GCGAAATCCT GAACAATGTT
CCCAACTCAT CGGCCGGGAC TATTCAGAAT GGCCGTCCAC GCTGGTGGAT CCGTGGTGTA
GGCGCAGGTC AGCAACAACT GGACTTCCCT AACCCTGTCG GCTTCTATAT CGATGACGTA
TTTATCAGCA ATGCCAGTGC TACCGGCCTG CCTATTTTCG ACCTGGAACG TGTAGAGGTT
CTTCGCGGCC CACAAGGTAC GCTCTGGGGC AAGAATACGA CAGGAGGTGC CATCAACATC
GTTTCCAAGA AACCTACCTT CTCGGAAAAT CCTGAGGGTT ATGCCAAAGT GGATTACGGC
AGCTTTGGAG ACAAAGTGAT CCAAGGTGCT GTAGGCGGTA CTATCGTTGA TGAGCGTATC
GCGGGTCGAA TTTCCTTCTA CAATCAAGAC CTCGATGGTC GCTTTACCAA CCAGTTCAAT
GGCAAGACCA GTGGCGGCCT TGAAGACTCG GTCATTCGTG GTCAGCTGCT ATTTGCGTTG
ACCCCTGACC TAGATGCGCT GCTCAACGTC TATCACCGCA AGTATAAAAC CGACGGTAAT
GTGGGTACGG TCTATACCAG GACCGGCGAG TTCCGCAACG GCTACAGACC CAGCCGGGAT
ATCAATCATG TCAGCAGCAA TGCCGAAGAC AGCAATGATG TTACCCAAAA CGGCATCTCA
TTGAATGTAA ATTGGCAGTT GGGCAAACTA ACATTAACCT CGATCACGGC CTATGCTGAT
TACAAGGCAA ATACATTCAG CGACTCAGAC AATACTCCTT TAGAAATCGG CCGCGGCTAC
ACAGATGCAA AAAGTAAGCA GTGGACGCAG GAATTTCGCT TGGCTTCTCC TCGTGAAGAC
CGTTGGAACT GGCTGACAGG CTTCTTTTAT TTCAAGGAAG ACATCGACTC ATTTAGTGCG
GCTGCCAGAC TACCGAATGG AGCTGTGCCG CAATTGCAGG GCTCCTCACA GGCCAATACA
TTCAATCGTA CCGACTTGGC TCATGAAACT GAAAGCTACG CTATTTTTGG CAGCACCACT
TACAACTTCA CCGACAAATT CGATACCACG CTTGGCGCTC GCTGGACAAC AGAAGAAAAA
AACTATGATC TAGATCGCCG CAGCAATGCA TTACTAGGGG GCCCCAATGC CGGTACCGAC
ACAAGCTGGT CAGACTATGG GGCATGGTGG AACTCATACA CTGGCGCCAT TGGCGGCACC
GGGACTTTTG TAGATGCCCG CGACAAGCGC TGGAACGCCT TTACTTACGA CATTACGCCA
CAATACAAGA TCACGGAGAC TGACCGCATT TATTTCAAGT TTGCTCGTGG CATCAAGTCT
GGAGGCTTCA ATACGGCTGC CACCAATCCC CTGGCACTCA ATACGTTGAA ACCGGAAGAA
TTGAATTCCT ATGAAATCGG CTATAAGTCG GAATGGTTGA ATGGTCGCTT GAACTTCAAT
GCCAACGCTT TCTATTACGA TTACAGTAAT GTACAAGTCA ATGTCGTAGG TACCAATCTT
GCCGTGCCGA TTTCCTACTT GCAAAACGTA GAAAAAGCCA GCGTCAAGGG AGCTGAGTTT
GAAATTGAAG CATTGCCAAC CAACCATCTG CATCTGAATG CCAATATCGG CATCCTTAAG
ACAGAGTTCG AAAAGTTTGA CGTACTCAAT GGTGGTGGCA ACCACGATGG TAACGAGTTC
GTGCGTGCCC CTCGCTGGAG CGCTCAAATA CGCGGTACCT ACAATATCCC ACTCGAAAAC
GGCAGCAGGA TATTGTTGGG AGCAGATGCG CGCTACCTGG GTAAACAATA TTTCTTTGTT
GTCCCTCAGG ATAATGACCT GCTGAATCAG GGAGCATACA CCTTGGTGAA CGCACGCATC
AGCTACCTGA CTAAGAACGA TAGGGTCGAG ATCACTGGAT ATGTCAACAA TTTGTTCGAT
AAGGAATACC GCTACCATGC GTTGCCTGCC AGCAATGCCA GCGGCAACAC CGTGTATTGG
GGTAACCCGA GAACGATAGG CGCATCATTG ACCTACCGCT TCTAA
 
Protein sequence
MSRKSMLPTE SASSPFRKRQ LIIAVAAAFA GSLFISPTYA ADEQTSQAGD TSITKAAPHK 
EKTTPAEAKD VESSLGSIVV TAQRREENAQ EVPTAISVLG GEEFLQRGIG RSASEILNNV
PNSSAGTIQN GRPRWWIRGV GAGQQQLDFP NPVGFYIDDV FISNASATGL PIFDLERVEV
LRGPQGTLWG KNTTGGAINI VSKKPTFSEN PEGYAKVDYG SFGDKVIQGA VGGTIVDERI
AGRISFYNQD LDGRFTNQFN GKTSGGLEDS VIRGQLLFAL TPDLDALLNV YHRKYKTDGN
VGTVYTRTGE FRNGYRPSRD INHVSSNAED SNDVTQNGIS LNVNWQLGKL TLTSITAYAD
YKANTFSDSD NTPLEIGRGY TDAKSKQWTQ EFRLASPRED RWNWLTGFFY FKEDIDSFSA
AARLPNGAVP QLQGSSQANT FNRTDLAHET ESYAIFGSTT YNFTDKFDTT LGARWTTEEK
NYDLDRRSNA LLGGPNAGTD TSWSDYGAWW NSYTGAIGGT GTFVDARDKR WNAFTYDITP
QYKITETDRI YFKFARGIKS GGFNTAATNP LALNTLKPEE LNSYEIGYKS EWLNGRLNFN
ANAFYYDYSN VQVNVVGTNL AVPISYLQNV EKASVKGAEF EIEALPTNHL HLNANIGILK
TEFEKFDVLN GGGNHDGNEF VRAPRWSAQI RGTYNIPLEN GSRILLGADA RYLGKQYFFV
VPQDNDLLNQ GAYTLVNARI SYLTKNDRVE ITGYVNNLFD KEYRYHALPA SNASGNTVYW
GNPRTIGASL TYRF