Gene Mkms_3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3684 
Symbol 
ID4611616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3883074 
End bp3884693 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content70% 
IMG OID639793362 
Productalpha amylase, catalytic region 
Protein accessionYP_939668 
Protein GI119869716 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.942466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAA TGGCTCACCC CCCGACAGCT GACCAGCCCT GGTGGTCGCG CGCGGTGTTC 
TACCAGGTCT ATCCCCGGTC CTTCCACGAC AGCGACGGCG ACGGTGTCGG CGACCTCGAC
GGCGTGACCG CCAAACTCGA CTATCTGAGT GAACTCGGCG TCGACGCACT GTGGCTCAAC
CCGGTCACCG TCTCCCCCAT GGCCGACCAC GGCTACGACG TCGCCGACCC CCGCGACATC
GACCCGCTGT TCGGCGGCAT CGACGCCCTG GACCGGCTCA TCGCCGCCGC GCACGACCGC
GGTATCCGCA TCACCATGGA TCTGGTGCCC AACCACACCA GCTCGGCGCA CCCGTGGTTC
CAGGCGGCGC TGGCGGCGGG TCCGGGCAGC GCCCAGCGGG AGCGCTACAT CTTCCGGGAG
GGCACCGGTC CCGACGGGCT GCTCCCGCCC AACAACTGGA TCTCCGTCTT CGGCGGGCCC
GCGTGGACGC GGATCGTCGA ACCCGACGGG CAGCCCGGCC AGTGGTACCT GCACCTGTTC
GACCCGGAGC AGCCCGACCT CAACTGGGAC AACCCCGAGG TCTTCGAGGA TCTCGAGAAG
ACGCTGCGCT TCTGGCTCGA CCGCGGCGTC GACGGTTTCC GGATCGACGT GGCGCATGGG
ATGGCCAAAC CGCCGGACCT GCCCGATATG GAGATCGCCG AGAACAGGAT GCTCGCCGAG
ACCGCCAGCG ATCCGCGGTT CGACCACCAG GGCGTCCACG ACATCCACCG CAACATCCGC
TCCGTGCTCG ACGACTATCC CGGCGCGGTC GCCGTCGGCG AGGTGTGGGT CTACGACAAC
GCCGCGTTCG CCGCCTACCT GCGGGCCGAC GAACTGCATC TGGGCTTCAA CTTCCGGCTG
GTGCGCGCCG ACTTCGACGC CGACGAGATC CACGACGCGA TCGAGAACTC GCTGGCCGCC
GTCGCCCTGG AAAACGCGAC GCCGACGTGG ACGCTGTCCA ACCACGACGT CGAGCGGGAG
GTCACCCGGT ACGGCGGCGG GGCGCTCGGG CTGGCGCGGG CCCGGGCGAT GGCCCTGGTG
ATGCTGGCGC TGCCCGGCGT GGTGTTCGTC TACAACGGCG AAGAACTCGG CCTGCCCAAC
GTCGACCTGC CCGACGAAGT GCTCCAGGAC CCGGTGTGGG AACGCTCCGA CCGCACCGAA
CGCGGGCGCG ACGGATGCCG CGTGCCGATG CCGTGGAGCG GTGACGCTCC CCCGTTCGGG
TTCTCGACGA CGGCCGACAC CTGGCTGCCG ATGCCGGCGG AATGGTCGTC GCTGACCGTC
GAACGCCAGC TGGCCGAGCC GGACTCCATG CTGCACTTCT TCCGCCGGGC GCTGCGCCTG
CGCCGGGACC GCTGTGGCGT CGACGGGGCC ACGCTGACGC AGCTGTCCGC CGAGGACGGG
GTGGTCACGT TCCGCACCGA CGGCGGACTC ACCTGCGTGC TCAACGCCGG TGAGCGCCCG
GTCGACCTGC CCCCCGGTGA GGTGCTGCTC GCCAGCGCGC CCCTTCAGGC ACATTCCCCG
TCGCTTCGCT CGCCCCAGGA TCGGCGGCTA CCCCCCGACA CGGCCGCCTG GGTGGTCTAA
 
Protein sequence
MGAMAHPPTA DQPWWSRAVF YQVYPRSFHD SDGDGVGDLD GVTAKLDYLS ELGVDALWLN 
PVTVSPMADH GYDVADPRDI DPLFGGIDAL DRLIAAAHDR GIRITMDLVP NHTSSAHPWF
QAALAAGPGS AQRERYIFRE GTGPDGLLPP NNWISVFGGP AWTRIVEPDG QPGQWYLHLF
DPEQPDLNWD NPEVFEDLEK TLRFWLDRGV DGFRIDVAHG MAKPPDLPDM EIAENRMLAE
TASDPRFDHQ GVHDIHRNIR SVLDDYPGAV AVGEVWVYDN AAFAAYLRAD ELHLGFNFRL
VRADFDADEI HDAIENSLAA VALENATPTW TLSNHDVERE VTRYGGGALG LARARAMALV
MLALPGVVFV YNGEELGLPN VDLPDEVLQD PVWERSDRTE RGRDGCRVPM PWSGDAPPFG
FSTTADTWLP MPAEWSSLTV ERQLAEPDSM LHFFRRALRL RRDRCGVDGA TLTQLSAEDG
VVTFRTDGGL TCVLNAGERP VDLPPGEVLL ASAPLQAHSP SLRSPQDRRL PPDTAAWVV