Gene Mkms_5210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5210 
Symbol 
ID4612893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5453019 
End bp5454809 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content65% 
IMG OID639794907 
Producttrehalose synthase 
Protein accessionYP_941189 
Protein GI119871237 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCACA GCAGCGGCTC ACCCGCCCAC CCCGATCACG ATCCGGCCGA GGGCAGCCAC 
ATCGAGGACG GGGTGGTCGA ACATCCGACC GCAGGGGACT TCGGCCACGC GCGGATGGTC
CCCGAGGACC GGACGTGGTT CAAGCGGGCC GTGTTCTACG AGGTGCTCGT GCGTGCGTTC
CACGATTCGG ACGCCGACGG TTCCGGTGAC CTGCGCGGGC TGACCGAACG ACTGGACTAC
CTGCAGTGGC TCGGCGTCGA CTGTCTGTGG CTGCCGCCGT TCTACGATTC ACCGCTGCGC
GACGGTGGAT ACGACATCCG CGACTTCTAC AAGGTGCTGC CCGAATTCGG CACCGTCGAG
GACTTCGTCA CGCTGCTCGA CGCCGCCCAC CGCCGCGGCA TCCGGGTGAT CACCGACCTG
GTGATGAACC ACACCTCGGA CTCCCACCCG TGGTTCCAGG AGTCGCGCCG CGACCCGGAC
GGACCCTACG GCGACTTCTA CGTCTGGAGC GACACCAGCG ACAGGTACGC CGACGCGCGG
ATCATCTTCG TCGACACCGA GGAGTCCAAC TGGACCTTCG ACCCGGTGCG GCGGCAGTTC
TATTGGCACC GCTTCTTCTC CCACCAGCCG GATCTGAACT ACGACAACCC GGCCGTGCAG
GAGGCGATGC TCGACGTGCT GCGCTTCTGG CTCGACCTCG GCATCGACGG GTTCCGGCTC
GACGCCGTGC CGTACCTGTT CGAACGCGAG GGCACCAACT GCGAGAACCT GCCGGAGACC
CATGCGTTCC TGCGGCACTG CCGCAAGGTG ATCGACGACG AGTATCCGGG CCGGGTGCTG
CTGGCCGAGG CCAACCAGTG GCCGGCCGAC GTGGTCGCGT ACTTCGGTGA CCCGGACACC
GGCGGCGACG AGTGCCATAT GGCGTTCCAT TTCCCGCTGA TGCCAAGGAT TTTCATGGCC
GTCCGGCGCG AGTCGCGGTT CCCGATCTCC GAGATCCTCG CGCAGACACC GGAGATCCCG
GATATGGCGC AGTGGGGGAT CTTCCTGCGC AACCACGACG AGTTGACCCT CGAGATGGTC
ACCGACGAAG AACGTGACTA CATGTACTCC GAATACGCCA AAGACCCACG GATGAAAGCG
AATGTCGGCA TCCGGCGGCG TCTGGCACCA CTACTGGAGA ACGACCGCAA TCAGATCGAA
TTGTTCACCG CGCTGCTGCT CTCACTCCCC GGGTCACCGG TGCTGTACTA CGGCGACGAG
ATCGGCATGG GCGACATCAT CTGGCTCGGT GACCGCGACG GTGTCCGCAC CCCGATGCAG
TGGACGCCGG ACCGCAACGC GGGCTTCTCG AAGGCCACGC CCGGCCGCCT GTATCTGCCG
CCCAACCAGG ACGCCATCTA CGGTTACCAG GCGGTGAATG TCGAAGCGCA GCGGGACAGT
TCGAATTCGC TGCTGAACTG GACGAAGACC ATGCTCGGGG TGCGCAGACG CCACGACGCG
TTCGCGATCG GCGCGTTCCG CGAACTCGGC GGGTCGAACC CGTCGGTGCT GGCGTTCGTG
CGTGAGACCG CCACCGACAC GGTGCTCTGC GTCAACAACC TGTCCCGCTT CCCGCAGCCC
ATCGAACTGA ATCTGCAGCA GTGGAACGGT TTCACGCCGG TCGAGATGAC CGGCTACGTC
GACTTCCCGA GTATCGGGGC GCTGCCCTAC CTGCTGACCC TGCCCGGCCA CGGGTTCTAC
TGGTTCCAGC TACGCGCCCC CGACCCCGAA CCCGAAGGAG TGCAGCCATG A
 
Protein sequence
MDHSSGSPAH PDHDPAEGSH IEDGVVEHPT AGDFGHARMV PEDRTWFKRA VFYEVLVRAF 
HDSDADGSGD LRGLTERLDY LQWLGVDCLW LPPFYDSPLR DGGYDIRDFY KVLPEFGTVE
DFVTLLDAAH RRGIRVITDL VMNHTSDSHP WFQESRRDPD GPYGDFYVWS DTSDRYADAR
IIFVDTEESN WTFDPVRRQF YWHRFFSHQP DLNYDNPAVQ EAMLDVLRFW LDLGIDGFRL
DAVPYLFERE GTNCENLPET HAFLRHCRKV IDDEYPGRVL LAEANQWPAD VVAYFGDPDT
GGDECHMAFH FPLMPRIFMA VRRESRFPIS EILAQTPEIP DMAQWGIFLR NHDELTLEMV
TDEERDYMYS EYAKDPRMKA NVGIRRRLAP LLENDRNQIE LFTALLLSLP GSPVLYYGDE
IGMGDIIWLG DRDGVRTPMQ WTPDRNAGFS KATPGRLYLP PNQDAIYGYQ AVNVEAQRDS
SNSLLNWTKT MLGVRRRHDA FAIGAFRELG GSNPSVLAFV RETATDTVLC VNNLSRFPQP
IELNLQQWNG FTPVEMTGYV DFPSIGALPY LLTLPGHGFY WFQLRAPDPE PEGVQP