Gene Mfla_0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0535 
Symbol 
ID3999405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp560102 
End bp562030 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content57% 
IMG OID637937433 
ProductNHL repeat-containing protein 
Protein accessionYP_544646 
Protein GI91774890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases
[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000386483 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA TAAGTAGATT GATTGGCTTG CTGGCTGCGG TGGGTTGGAT GACTGCAGCC 
AACGCTGATC TCGCAGACGT TGAGGCTAAC CTGGGCCGGC TCAAGGTGCC GGAGGGGTTC
AAGGTGGAGG TCTATGCCGA AGTGCCGGGC GCACGGCAAA TGGCACTGGG AACTTCCGGT
ACAGTCTACG TCGGCACTCG CGGCAACAAG GTCTATGCTG TCGTGGACAA GAACAAGGAT
CACAAGGCAG ACGAGGTGAT CACTATCCTT GATGACCTCA AGGTAGGCAA TGGTGTCGCC
ATGTGGGAGG GGAATCTCTA CGTGGCCGAG CAGCACCGCA TCACCCGTTA CGCCGCCCCC
GATTTTGACC TCAACCTGCC GTTCAAGCAG ATGCGCGAGG TGATTTACGA CCAACTGCCC
GACAAGGTAC ATCATGGCTG GCGCTATATC GCCTTCGGCC CCGACAAGAA GCTCTATGCG
ACGATTGGCG CGCCATGCAA CGTCTGTGAC CCGCAAGGCA TCGAAGCATC CATCATTCGC
ATGGATCCGG ATGGCAAGAA TGTGGAGGTG TTTGCCAAAG GCGTGCGTAA TTCGGTCGGC
ATGGATTTTC AGCCGGGCAC CAATGTCCTG TATTTCACCG ATAATGGTGT GGACATGATG
GGCGATGATA TTCCGCCTGA CGAACTCAAT GCTGCTCCCC AGGCTGGACT GCATTTCGGT
TTCCCTTACG TCGGTGGCCG GGATGCGCGT CCTAAGGACT GGCAGAACAA GAAGCCGCCC
CAGGCCGTGA CGCCGCCTGT TGTCGAATTC CAGGCGCATA GCGCAAACCT GGGTTTCAAG
TTCTATACCG GCAAGCAGTT TCCCCGTGAT TATCAGGGCA ATGCCATCGT TGCCCAGCAT
GGTTCATGGA ACCGCAGCCA GCCCGTCGGT TACCAGCTGA TGCGTGTCGT GTTCGACGAG
CAGCATCAGG TCAAGTCGCA CGAAGTTTTT ATCGAGGGAT GGCTCAACGA TGGTGAAGCA
TGGGGTCGTC CTGTTGACGT ATTGCAGCTC AATGACGGTT CCTTGCTGGT ATCGGATGAT
TACAGCGGCG TCATCTACCG CGTCAGCTAT GGCGAGTCTT CTGCCGCTGC TGCGCCTGGC
CGTGGCCAGG CGTCCAAAGT GACCGGCTTG AACATGCCGG AGTCTGCTGT TGCACATCCA
GACGGGCGTA TCTTCGTCAG CGAGATTGGC GAGTTCGGCA AGTCCGGTGA CGGTAAGATT
ACCGTGATCA ACAAGGACGG CAGTCGCCGG ACGCTGGCCG ACGGTTTGAA TGACCCCAAG
GGGCTCGACC TCTTCAACAA TCAGCTGTAT GTCGCGGATA TGGACCAGGT CGTGAGGGTT
GGTCTGGACG GCAGCAAAAC CGTCATTGCC AAGTCCGGGG ACTTCCCGGA GAAGCCAATG
TTCCTCAATG ACATTGAGAT TGATGGGCTG GGCAACGTCT ATGTCTCGGA TAGTGGGGAC
GATGATGGCA AGCATGGCGC GATCTACCAG ATATCCCCTG AAGGGAAGAT CACCCAGCTG
ATCAATGACA AGTCCGGCAT CAAGCGTCCG AACGGGCTGT TGCTGGATGG CCCCGGCAAG
CTGCTGGTGG CCGATTTCGG CAATGGCAAG CTATTCCAGG TGAATTTTGC CAGCAAGAAG
GCGAGCGTCA CGCTGCTCAA CCAGGGCTTT GGCGGCGCTG ACGGGCTGGT GCGCGATACT
GATGGGTTGC TTTATGTCAG CGACTGGGCC GGCGGCAAGG TCTGGCAATT GACTGAGCCG
CGCGCAACGC CGCAATTGAT CACCGAAGGC CATCAGTCTG CCGCTGATAT TGCCCTATCG
GCCGATGGGC GCTTCCTGCT GGTGCCTGAT ATGAAGGCTG GCGAGCTGGT GCCATTGCCC
ATCAAGTAA
 
Protein sequence
MKTISRLIGL LAAVGWMTAA NADLADVEAN LGRLKVPEGF KVEVYAEVPG ARQMALGTSG 
TVYVGTRGNK VYAVVDKNKD HKADEVITIL DDLKVGNGVA MWEGNLYVAE QHRITRYAAP
DFDLNLPFKQ MREVIYDQLP DKVHHGWRYI AFGPDKKLYA TIGAPCNVCD PQGIEASIIR
MDPDGKNVEV FAKGVRNSVG MDFQPGTNVL YFTDNGVDMM GDDIPPDELN AAPQAGLHFG
FPYVGGRDAR PKDWQNKKPP QAVTPPVVEF QAHSANLGFK FYTGKQFPRD YQGNAIVAQH
GSWNRSQPVG YQLMRVVFDE QHQVKSHEVF IEGWLNDGEA WGRPVDVLQL NDGSLLVSDD
YSGVIYRVSY GESSAAAAPG RGQASKVTGL NMPESAVAHP DGRIFVSEIG EFGKSGDGKI
TVINKDGSRR TLADGLNDPK GLDLFNNQLY VADMDQVVRV GLDGSKTVIA KSGDFPEKPM
FLNDIEIDGL GNVYVSDSGD DDGKHGAIYQ ISPEGKITQL INDKSGIKRP NGLLLDGPGK
LLVADFGNGK LFQVNFASKK ASVTLLNQGF GGADGLVRDT DGLLYVSDWA GGKVWQLTEP
RATPQLITEG HQSAADIALS ADGRFLLVPD MKAGELVPLP IK