Gene Mmcs_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5121 
Symbol 
ID4113950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5415397 
End bp5417187 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content65% 
IMG OID638034279 
Producttrehalose synthase-like protein 
Protein accessionYP_642281 
Protein GI108802084 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCACA GCAGCGGCTC ACCCGCCCAC CCCGATCACG ATCCGGCCGA GGGCAGCCAC 
ATCGAGGACG GGGTGGTCGA ACATCCGACC GCAGGGGACT TCGGCCACGC GCGGATGGTC
CCCGAGGACC GGACGTGGTT CAAGCGGGCC GTGTTCTACG AGGTGCTCGT GCGTGCGTTC
CACGATTCGG ACGCCGACGG TTCCGGTGAC CTGCGCGGGC TGACCGAACG ACTGGACTAC
CTGCAGTGGC TCGGCGTCGA CTGTCTGTGG CTGCCGCCGT TCTACGATTC ACCGCTGCGC
GACGGTGGAT ACGACATCCG CGACTTCTAC AAGGTGCTGC CCGAATTCGG CACCGTCGAG
GACTTCGTCA CGCTGCTCGA CGCCGCCCAC CGCCGCGGCA TCCGGGTGAT CACCGACCTG
GTGATGAACC ACACCTCGGA CTCCCACCCG TGGTTCCAGG AGTCGCGCCG CGACCCGGAC
GGACCCTACG GCGACTTCTA CGTCTGGAGC GACACCAGCG ACAGGTACGC CGACGCGCGG
ATCATCTTCG TCGACACCGA GGAGTCCAAC TGGACCTTCG ACCCGGTGCG GCGGCAGTTC
TATTGGCACC GCTTCTTCTC CCACCAGCCG GATCTGAACT ACGACAACCC GGCCGTGCAG
GAGGCGATGC TCGACGTGCT GCGCTTCTGG CTCGACCTCG GCATCGACGG GTTCCGGCTC
GACGCCGTGC CGTACCTGTT CGAACGCGAG GGCACCAACT GCGAGAACCT GCCGGAGACC
CATGCGTTCC TGCGGCACTG CCGCAAGGTG ATCGACGACG AGTATCCGGG CCGGGTGCTG
CTGGCCGAGG CCAACCAGTG GCCGGCCGAC GTGGTCGCGT ACTTCGGTGA CCCGGACACC
GGCGGCGACG AGTGCCATAT GGCGTTCCAT TTCCCGCTGA TGCCAAGGAT TTTCATGGCC
GTCCGGCGCG AGTCGCGGTT CCCGATCTCC GAGATCCTCG CGCAGACACC GGAGATCCCG
GATATGGCGC AGTGGGGGAT CTTCCTGCGC AACCACGACG AGTTGACCCT CGAGATGGTC
ACCGACGAAG AACGTGACTA CATGTACTCC GAATACGCCA AAGACCCACG GATGAAAGCG
AATGTCGGCA TCCGGCGGCG TCTGGCACCA CTACTGGAGA ACGACCGCAA TCAGATCGAA
TTGTTCACCG CGCTGCTGCT CTCACTCCCC GGGTCACCGG TGCTGTACTA CGGCGACGAG
ATCGGCATGG GCGACATCAT CTGGCTCGGT GACCGCGACG GTGTCCGCAC CCCGATGCAG
TGGACGCCGG ACCGCAACGC GGGCTTCTCG AAGGCCACGC CCGGCCGCCT GTATCTGCCG
CCCAACCAGG ACGCCATCTA CGGTTACCAG GCGGTGAATG TCGAAGCGCA GCGGGACAGT
TCGAATTCGC TGCTGAACTG GACGAAGACC ATGCTCGGGG TGCGCAGACG CCACGACGCG
TTCGCGATCG GCGCGTTCCG CGAACTCGGC GGGTCGAACC CGTCGGTGCT GGCGTTCGTG
CGTGAGACCG CCACCGACAC GGTGCTCTGC GTCAACAACC TGTCCCGCTT CCCGCAGCCC
ATCGAACTGA ATCTGCAGCA GTGGAACGGT TTCACGCCGG TCGAGATGAC CGGCTACGTC
GACTTCCCGA GTATCGGGGC GCTGCCCTAC CTGCTGACCC TGCCCGGCCA CGGGTTCTAC
TGGTTCCAGC TACGCGCCCC CGACCCCGAA CCCGAAGGAG TGCAGCCATG A
 
Protein sequence
MDHSSGSPAH PDHDPAEGSH IEDGVVEHPT AGDFGHARMV PEDRTWFKRA VFYEVLVRAF 
HDSDADGSGD LRGLTERLDY LQWLGVDCLW LPPFYDSPLR DGGYDIRDFY KVLPEFGTVE
DFVTLLDAAH RRGIRVITDL VMNHTSDSHP WFQESRRDPD GPYGDFYVWS DTSDRYADAR
IIFVDTEESN WTFDPVRRQF YWHRFFSHQP DLNYDNPAVQ EAMLDVLRFW LDLGIDGFRL
DAVPYLFERE GTNCENLPET HAFLRHCRKV IDDEYPGRVL LAEANQWPAD VVAYFGDPDT
GGDECHMAFH FPLMPRIFMA VRRESRFPIS EILAQTPEIP DMAQWGIFLR NHDELTLEMV
TDEERDYMYS EYAKDPRMKA NVGIRRRLAP LLENDRNQIE LFTALLLSLP GSPVLYYGDE
IGMGDIIWLG DRDGVRTPMQ WTPDRNAGFS KATPGRLYLP PNQDAIYGYQ AVNVEAQRDS
SNSLLNWTKT MLGVRRRHDA FAIGAFRELG GSNPSVLAFV RETATDTVLC VNNLSRFPQP
IELNLQQWNG FTPVEMTGYV DFPSIGALPY LLTLPGHGFY WFQLRAPDPE PEGVQP