Gene Mfla_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1037 
Symbol 
ID4000103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1078857 
End bp1081232 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content59% 
IMG OID637937937 
Productputative phosphoketolase 
Protein accessionYP_545146 
Protein GI91775390 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCCC AAGCCGCGGA CCATGTCCTG AATGCTCAAC AATTGGCGCA ATTGCATGCC 
TACTGGCGCG CCGCCAATTA TCTCTCGGTC GGCCAGATCT ACCTCTATGC CAACCCGTTG
CTCAAGACGC CTTTGCTGCC AGAGCATATC AAGCCGCGCT TGCTGGGCCA CTGGGGAACG
ACACCGGGGC TTAATTTCGT CTATGTCCAC GCCAACCGGG TCATTCGCGA GCACGACCTC
AACATGATCT ACATTACCGG CCCGGGGCAT GGCGGTCCGG CCGTCGTCGC CAATACCTAC
CTGGAAGGCA GTTATACCGA GCTTTACCCG GAAATCACCG AGGACGAGAC CGGCATGCTC
AAGCTGTTCA AGCAGTTCTC CTTCCCTGGC GGCATCCCCA GCCATGCCTC GGCGCAGACA
CCGGGGTCGA TCAACGAGGG AGGCGAACTG GGCTATTGCC TGCTGCATGC GTTCGGCGCA
GTGTTCGACA ACCCGGACCT GATCGCCTGC TGCGTAGTCG GCGACGGCGA GGCGGAAACC
GGCCCGCTGG CCACCTCCTG GCACGGCAAC AAATTCCTCA ATCCCAAGAC CGACGGCGTG
GTATTGCCTA TCCTGCACCT CAACGGCTAC AAGATCGCCA ACCCCACCAT CCTGTCGCGC
ATTGGCGATG AAGAGCTGCA GAAACTGTTC GAAGGCTACG GCTACACGCC CTATTTCGTC
GAAGGCAGCG ACCCTGACTT CATGCATCAG CGCATGGCCA GCGTCATGGA CCAGGTCATC
GGGGAAATAC AACGGATCAA GCGCGAAGCC AGCCTGAAAC GCGAAGTCAG CCGTCCGCGC
TGGCCCATGA TCATCCTGCG TTCGCCGAAG GGCTGGACCG GACCTGCGGA AGTGGACGGC
AAACCGAACA CTGGCACCTT TCACGCGCAC CAAGTGCCGT TCGGTGAGTT AGACAAGCCG
GAGCACATCA AGCTGTTGCA GGACTGGATG CAATCCTACG CGCCGCACGA ACTGTTCGAC
GATACAGGCA GGCTCAAGGA AGAGCTGCGC GCGCTTGCCC CCAAGGGCGA ACGGCGCATG
GGCGCCAACC CGCACGCCAA TGGCGGCCTG CTGCTCAAGC CTCTGGCACT GCCGGATTTC
CGCAAGTTTG CCTATGCCGT GACCCAGCCG GGCGAACGCC AGACCAGCAC CACGGGCGAG
CTCGGCAAGT ATCTGGCCGA AGTTTGGCGG CTCAATCCGA ACAACTTCCG CGTATTCAGC
CCGGATGAAA ATAACTCCAA TCGCCTGCAG GCATTGTTCG AGGTCACCAA TCGGCAATGG
ATGGCCGAAC GTCTGGAAAG CGACGAGCAT CTTGCAGCGC ATGGCGGCGT GATGGAAATA
CTCTCAGAAC ATAGCTGCGA AGGCTGGCTG GAAGGTTACC TGCTCACTGG CCGCCATGGA
TTCTTTTCCT GCTACGAGGC ATTTATCCAT ATCATCGACT CCATGTTCAA CCAGCATGCC
AAGTGGCTCA AGGAATGTCG CCACATTCCC TGGCGCAAAC CGATTGCCTC GCTCAACATC
CTGCTGAGCT CGCATGTCTG GCGCCAGGAC CATAATGGCT TCTCGCATCA GGATCCCGGC
TTCATCGACC ATGTCGCCAA CAAAAAGCCG GACATCATCC GCATCTACCT GCCTGCCGAC
GCCAATACCC TGATCGCAAT CACCGATCAT TGCCTGCAAA GCCGCAACCT CATCAATGTC
ATCGTTGCAG GCAAGCAACC GCAAGGGCAA TGGCTGGACA TGCCCGCCGC GATTGAACAT
GCGCGCCGTG GCGCCAGTAT TTGGCACTGG GCCGGCAACT GCGCAGCAGA CGAGGAACCC
GACTTGGTCA TCGCCAGCGC GGGCGATGTG CCGGTCATGG AGTCGCTTGC CGCCGTCATG
CTGTTGCGCG AGCATCTGCC CGCGCTCAAG ATACGCTACG TCAACGTGGT CGACCTCATG
ACCCTAGTTC CTCACGACGT GCATCCACAC GGCCTCACCG ACGATGCCTT CGACGAACTG
TTCACCCGGG ATAAACCAGT GATCTTCGCT TTCCATGGCT ATCCTGGGCT GATCCACCGC
CTGGCCTACA AGCGGCACAA TCATGCCAAC TTCCATGTGC ACGGCTTCCT GGAAGAAGGC
ACGACCACTA CGCCTTTCGA CATGACGGTG CTGAACAGGC TGGATCGCTA CAATCTGGTC
AAGAATACCT TGCGCTGGCT ACCGCATATC GACAATGCTC CGGTTCTGGA TGCGCTCATG
GACGAAAAGC TCGCGCAACA CAAGCAATAC ATCTGGCAGC ATGGCGAGGA CATGCCGGAA
ATCCGGGATT GGAAATGGCA GCCTGCAGAT CATTGA
 
Protein sequence
MQSQAADHVL NAQQLAQLHA YWRAANYLSV GQIYLYANPL LKTPLLPEHI KPRLLGHWGT 
TPGLNFVYVH ANRVIREHDL NMIYITGPGH GGPAVVANTY LEGSYTELYP EITEDETGML
KLFKQFSFPG GIPSHASAQT PGSINEGGEL GYCLLHAFGA VFDNPDLIAC CVVGDGEAET
GPLATSWHGN KFLNPKTDGV VLPILHLNGY KIANPTILSR IGDEELQKLF EGYGYTPYFV
EGSDPDFMHQ RMASVMDQVI GEIQRIKREA SLKREVSRPR WPMIILRSPK GWTGPAEVDG
KPNTGTFHAH QVPFGELDKP EHIKLLQDWM QSYAPHELFD DTGRLKEELR ALAPKGERRM
GANPHANGGL LLKPLALPDF RKFAYAVTQP GERQTSTTGE LGKYLAEVWR LNPNNFRVFS
PDENNSNRLQ ALFEVTNRQW MAERLESDEH LAAHGGVMEI LSEHSCEGWL EGYLLTGRHG
FFSCYEAFIH IIDSMFNQHA KWLKECRHIP WRKPIASLNI LLSSHVWRQD HNGFSHQDPG
FIDHVANKKP DIIRIYLPAD ANTLIAITDH CLQSRNLINV IVAGKQPQGQ WLDMPAAIEH
ARRGASIWHW AGNCAADEEP DLVIASAGDV PVMESLAAVM LLREHLPALK IRYVNVVDLM
TLVPHDVHPH GLTDDAFDEL FTRDKPVIFA FHGYPGLIHR LAYKRHNHAN FHVHGFLEEG
TTTTPFDMTV LNRLDRYNLV KNTLRWLPHI DNAPVLDALM DEKLAQHKQY IWQHGEDMPE
IRDWKWQPAD H