Gene Mkms_4686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4686 
Symbol 
ID4616101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4910184 
End bp4911863 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content66% 
IMG OID639794378 
Productalpha amylase, catalytic region 
Protein accessionYP_940667 
Protein GI119870715 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGA TCGAGACCGG CGACCTGTGG TGGAAGAACG CCGTCTTCTA CTGCGCCGAC 
ATCGAGACGT TCTACGACTG GAACGGCGAC GGCACCGTCG ACATCCGGGG CATGACCGAG
CGCATCGAGT ACCTCTTCGA CCTCGGGGTC ACCTGCCTGT GGCTGATGCC GTTCTATCCG
ACCGCCCGCA AGGACGACGG CTACGACATC ATCGACTTCT TCGGGGTCGA TCAGCGCCTG
GGCAACCTCG GTGACTTCGT CGAGCTGGTG CGAACCGCAC GATCCAAGGG CATCCGGGTG
ATCGTCGACT TCGTCATGAA CCACACCTCC GATGCCCACC CCTGGTTCAA ATCCGCCCGG
CGCAGCAAGG ACGACCCGTA CCGCGATTAC TACGTGTGGA GTGAGACCGA GCCCAAGTCG
AGTCCGGCCG ACGTCGTCTT CCCCGACCAG GAGGACAGCA TCTGGGAGCA GGAACCCAAA
ACCGGCGAGT GGTACCTGCA TCACTTCTAC AAGCACCAGC CCGACCTGAA CATCGCCAAT
CCCAAAGTGC AGGAGGAGAT TTCGCGCACA CTCGGCTTCT GGTTGCAACT CGGCGTCTCG
GGTTTCCGGG TCGACGCCGT CCCCTTCCTG TTCGCCCGCG ACGGAGTGCC CGGCGACCCC
GGCGTCTTCG ATCCCTACGA ATACCTCGGG GACGTCCGCA ACTTCGTCAA CCGGCGGGTT
GGTGACGCGG TGCTGCTCGG CGAGGTGAAC GTGCCCTACG ACGATCAGAA GTCGTTCTTC
GGGGGCCCCG ACGGCGACGG ACTCAACATG CAGTTCGACT TCATCGGCAT GCAGAACATG
TACCTCTCGC TGGCCCGGGG CGACGCCGGC CCGATCGCCG CGGCGCTGCG CGAACGCCCA
CCGCTCGACG AGACGAGCCA GTGGGCGAAC TTCGTGCGCA ACCACGACGA GCTCACGCTC
GACAAACTGG GCGACGCGGA ACGGCAGGAG ATCTTCGACG CCTTCGGACC CGAGCGCGAC
ATGCAGCTCT ACGGCCGCGG GCTGCGCCGA CGCCTGCCGG CGATGCTCGG CGGGGACGAA
CGGCGGATGC GCATGGTCTA CTCGCTGGCG TTCTCGCTGC CCGGCACTCC GGTCCTCTAC
TACGGCGAGG AGATCGGGAT GGCCGAGAAC CTCGACATCC CAGGGCGGTT CGCCGTGCGG
GTTCCGATGC AGTGGACCAG CGACGACAAC GGCGGTTTCT CCAGGGCCGC CGCGCGCCGA
CTCCCCCGCC CGATCACCGA CGGCCTGTAC GGCCCCGACC GGGTGAACGT CGCCGACCAG
CGCCACGACC ACCAGTCGTT CTGGTGGTTC ATCCGCGATC TCATCTACAC CTATCGCCAG
CAGCCGGAGA TCGGCTGGTC GACGGCCGAG GTGCTCGACC AGCCCAACCC TGCGGTGCTC
GCGCACTGCT GCCGGGAGAA GAGTGGGTGG ACGATGGTGG GGCTGCACAA TTTCGGCGCC
GACCGGTGCA TGGTGCCGAT CCAGATCGAG GACGCCCCGG AGGGGTCGAC GCTCGTCGAC
CTGCTGGACG GGCTGTCGGT GCACGAACTG GACGCGAAGG GACGGCTCGA GATCGGCCTG
GACGGTTACG GTTACCGTTG GCTACGCCTG GTCCGGCCGG GCGACGACCC GATCATCTGA
 
Protein sequence
MRKIETGDLW WKNAVFYCAD IETFYDWNGD GTVDIRGMTE RIEYLFDLGV TCLWLMPFYP 
TARKDDGYDI IDFFGVDQRL GNLGDFVELV RTARSKGIRV IVDFVMNHTS DAHPWFKSAR
RSKDDPYRDY YVWSETEPKS SPADVVFPDQ EDSIWEQEPK TGEWYLHHFY KHQPDLNIAN
PKVQEEISRT LGFWLQLGVS GFRVDAVPFL FARDGVPGDP GVFDPYEYLG DVRNFVNRRV
GDAVLLGEVN VPYDDQKSFF GGPDGDGLNM QFDFIGMQNM YLSLARGDAG PIAAALRERP
PLDETSQWAN FVRNHDELTL DKLGDAERQE IFDAFGPERD MQLYGRGLRR RLPAMLGGDE
RRMRMVYSLA FSLPGTPVLY YGEEIGMAEN LDIPGRFAVR VPMQWTSDDN GGFSRAAARR
LPRPITDGLY GPDRVNVADQ RHDHQSFWWF IRDLIYTYRQ QPEIGWSTAE VLDQPNPAVL
AHCCREKSGW TMVGLHNFGA DRCMVPIQIE DAPEGSTLVD LLDGLSVHEL DAKGRLEIGL
DGYGYRWLRL VRPGDDPII