Gene Mkms_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0356 
Symbol 
ID4615239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp388823 
End bp391093 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content68% 
IMG OID639790031 
Producttrehalose synthase 
Protein accessionYP_936363 
Protein GI119866411 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02455] trehalose synthase, Pseudomonas stutzeri type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.375645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGG CGCCAGAGAC GGACTACGAC GACCAGGCCG GGTCGGGCAC CGACGATCAG 
ACGAACGAGC CCAGCGAGAT CACCTACGAC GAGTACCTGC ATCCGGCGCG TCCGCGGGCG
CTGCGGTTCC GTCCCCGCGT CAAATCGCCG TTCACCCGCC GCTCGGTGGC CCAGGACGGT
CCGGCGCGCG CGGACAACCC GGCGTACGTG TCGTGGCTGC TGTCGCAGTC GATGCTCGCC
GACGCCAACG AGATCAGCCA GCAGTTCTCC GGTCAGGGCT CGATGTGGCA GAACCCGTAC
GCCACACCGA GCCCGCGCGG CGCGGTGGAG ACGGCGTCGG TGTGGTTCAC CGCCTACCCG
TTGTCGTTGA TCACCCGGCC CAACGAGTCG TTCCTCAAGG CGCTGGCCGA CGACGAGATG
TGGAAGGCGT TCGCCGACAT CGGTATCGAG GCCATCCACA CCGGACCGGT CAAGCGCGCG
GGCGGGATCT CGGGGTGGCA GACCACCCCG AGCGTCGACG GCCACTTCGA CCGCATCAGC
ACCGAGATCG ACCCGGCCTT CGGCACCGAG GAGGAGTTCC GCAAGATGTG CGGGACGGCC
AACTGGTACG GCGGCACGAT CATCGACGAC ATCGTGCCCG GCCACACCGG TAAGGGCGCC
GACTTCCGCC TCGCGGAGAT GAAGTACGCC GACTATCCGG GGATCTACCA CATGGTGGAG
GTGGATCCCC GCGACTGGGA ACACCTGCCC GCCGTGCCGC CCGGTGTCGA CTCGGTCAAC
ATCGACCAGG CCACCGAGGA GTGGCTGGAC AAGGCCGGTT ACATCATCGG CCGCCTGCAA
CGGGTGATCT TCTACGCCGA GGGCATCAAG GAGACCAACT GGAGCGTCAC CCGGCCGGTG
CTCGGCGTCG ACGGCGTGGA GCGCCGCTGG GTGTACCTGC ACTACTTCAA GGACGGTCAG
CCGTCGATCA ACTGGCTCGA CCCGTCGTTC GCCGGGATGC GCCTCGTGAT CGGGGACGCG
CTGCATTCGC TCGCCGACCT GGGCACCGGC GGACTTCGTT TGGACGCCAA CGGTTTTCTC
GGCGCGGAGA AGACCGCGGC CGAGGACAGC ACCGCCTGGT CGGAGGGTCA TCCGCTGTCC
GAGGCGGCCA ACCATCTGAT CGCCAGCATG GTGCGCAAGG TGGGCGGGTT CACCTTCCAG
GAGCTCAACC TCACCATCGA CGACATCCGC CAGATCGGCG AAGCGGGCGC GGACCTGTCC
TACGACTTCA TCAACCGGCC CGCCTACCAG CACGCCCTGG CCACCGCGGA CACCGAGTTC
CTGCGGCTGA CGCTGCGCAC CACGCTGGAA CTCGGCGTCG ACCCGGCGTC GTTGATTCAC
GCCCTGCAGA ACCACGACGA GTTGACCTAC GAACTCGTGC ACTGGTCGAA CGGACACCGC
GACGACATCT ACACCTACAA GGGTGAGGAG ATCACCGGAG AGGCGCTCGG CGAGACGATC
CGCCGGGACC TCAGCGTACG GCTCACCGGC GAGAACGCCC CGTACAACCT GGTTTTCACC
ACCAACGGGA TCGCCTGCAC CACCGCGACG GTCATCGCGG CCACCCTCGG CATCACCGAT
CTGGACGAGA TCGACGACGA TCCGGTGCGC ATCGACCGCA TCCGGCGAGC CCATCTGCTG
CTGGCGATGT TCAACGCGCT GCAGCCGGGG GTGTTCGCGC TGTCGGGGTG GGACCTGTGC
GGCATGCTCA CGTTGCCGGC CGGGCAGGTC GGCGAACTCC TGCGCGGCGG CGACACCCGG
TGGATCCACC GGGCCGCCCA CGATCTGATG GGCGTGAATC CCGGTGCGAC ACAGTCGTTG
GCGGGGATGC CGCGGGGGCG GAGCCTCTAC GGGTCGATCC CCGATCAGCT CGACGACGAG
ACCAGCTTCC TACGCCAGTT GCAGGCGATC CTGCGGGTGC GTTCGCACTA CGGCATCGCG
ACGAGCCGCC AGGTCGACAT CCCCGAGGTC TCCCACCGCG GCATGCTCGT GCTGGTGCAC
CAGCTGGCCG ACGAGGGCCG CTATCAGTTG ACCGTGCTGA ACTTCGCCAA CGAGGAGGTC
GCCGGAACGG TGCGCTCCGA GGCGCTGCCA CCCGGTGCCG AGGTGTCGGA CATGTTCACC
GGCCACGCCT TCGCCACGGT CGACGACCTG CACAGCTTCA CCGTCGAGAT GCCCGCGCAT
CACGGGATGT CCCTGCTGGT CGAGGTCACC GCCGACGACT CCGAGACGTG A
 
Protein sequence
MSEAPETDYD DQAGSGTDDQ TNEPSEITYD EYLHPARPRA LRFRPRVKSP FTRRSVAQDG 
PARADNPAYV SWLLSQSMLA DANEISQQFS GQGSMWQNPY ATPSPRGAVE TASVWFTAYP
LSLITRPNES FLKALADDEM WKAFADIGIE AIHTGPVKRA GGISGWQTTP SVDGHFDRIS
TEIDPAFGTE EEFRKMCGTA NWYGGTIIDD IVPGHTGKGA DFRLAEMKYA DYPGIYHMVE
VDPRDWEHLP AVPPGVDSVN IDQATEEWLD KAGYIIGRLQ RVIFYAEGIK ETNWSVTRPV
LGVDGVERRW VYLHYFKDGQ PSINWLDPSF AGMRLVIGDA LHSLADLGTG GLRLDANGFL
GAEKTAAEDS TAWSEGHPLS EAANHLIASM VRKVGGFTFQ ELNLTIDDIR QIGEAGADLS
YDFINRPAYQ HALATADTEF LRLTLRTTLE LGVDPASLIH ALQNHDELTY ELVHWSNGHR
DDIYTYKGEE ITGEALGETI RRDLSVRLTG ENAPYNLVFT TNGIACTTAT VIAATLGITD
LDEIDDDPVR IDRIRRAHLL LAMFNALQPG VFALSGWDLC GMLTLPAGQV GELLRGGDTR
WIHRAAHDLM GVNPGATQSL AGMPRGRSLY GSIPDQLDDE TSFLRQLQAI LRVRSHYGIA
TSRQVDIPEV SHRGMLVLVH QLADEGRYQL TVLNFANEEV AGTVRSEALP PGAEVSDMFT
GHAFATVDDL HSFTVEMPAH HGMSLLVEVT ADDSET