Gene Mkms_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3141 
Symbol 
ID4610976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3285914 
End bp3288238 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content70% 
IMG OID639792812 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_939125 
Protein GI119869173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.675938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.272499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG ACCGGCCCGT GCTGTCGACC TACCGGTTGC AGATGCGCGG AGACGCGTTC 
ACCCTCGCCG ACGCCGAGGC GCTGGTGGAC TACCTCGACG AACTCGGCGT CTCACACCTG
TACCTGTCGC CGATCCTGAC CGCGGTCGAG GGTTCGACCC ACGGTTACGA CGTCACCGAT
CCCACGACGG TCTCGGCGGC CCTCGGCGGC GCCGACGCGC TCGCATCGCT GTCGTCGGCG
GCCGGGGCCC GCGGTATGGG GTTGGTCGTT GACATCGTGC CCAACCACGT CGGCGTCGAC
GATCCCAGGC AGAACCGGTG GTGGTGGGAT CTGCTCACCC ACGGCCGCGA CTCCGCCTAT
GCCGACTACT TCGACGTCGA CTGGACCGCC GACCCGGACG GCCGGATCCT GTTGCCCGTG
CTCGGATCCA ACGGCGACGT CGCGGATCTC ACCGTGGACG GGGACACGCT GCGACTGGGC
GACCTGGTGT TCCCGATCGC GCCGGGTACC GCCGACGGTG CCGGTCCGGA CGTGCACGAC
CGCCAGCACT ACCGGTTGAT CGGGTGGAAG CCCCGGGCGA GCGAAGCGCC GGGGAATCCG
TCCGGCATCT GCGGATACCG CCGCTTCTTC TCGATCACCT CGCTGGCCGG GCTGCGGCAG
GAAGACCCCG CGGTGTTCGA GGCGACCCAT ACCGAGGTCA AGCGGTGGTT CACCGACGGC
CTCGTCGACG GTATCCGCAT CGACCACCCC GACGGATTGT CCGATCCGCC CGGGTATCTG
GTCCGGCTGC GCGAACTCGT CGGCCCGCAG GCGTGGATCG TGATCGAGAA GATCCTCGCC
GTCGACGAGC CACTGGACCC CACGCTGCCG GTCGCGGGCA CCACCGGATA CGACGCGCTG
CGCGAGGCCG GTGGGGTGTT CGTCGATCCC GCAGGCGCCG AGGCGCTCGG CACGCTCTAC
GAATCCACCG GTGTCGACTA CAGCGCGATG CCGCGGGCCG CGCGGGCGCT CAAGGTGAAG
GCGGTGACCA CCACCCTGGG CAGTGAGCTG GCCCGGCTGT GCCGCACGAT CGTCGCCGCC
ACCGGTCAGG ACCACGACGA CCTGCCGGAC GCGGTCGCCG CGGTGATCAG CCACCTCGGC
GTCTACCGCT CCGACTATTC GGTTCTGGCG ACGGTGTTGC CGGTCGCGAT CGCCGAAACA
CTCTCGCAGC AGCCGCGTTT CGCGCCGGCC CTGCAGATCG TGTCGACCGC GCTGTCCAGC
AGCCGGGAGA CCGCGGTGCG GTTCCAGCAG CTGTGCGGCG CGGCCACCGC GAAGGCCATG
GAGGACTGCC TGTTCTACCG CGACGCCCGC CTGGTGTCGC TCAACGAGGT CGGCGGTGAG
CCCGAGTGGT TCGGGGTCAG CATCGCCGAA TTCCACCAGC GCGCCACGGC CCGCGCCGCC
ATGTGGCCGC ACGCGATGAC GACGCTCACG ACCCACGACA CCAAACGCGG TGAGGACGTG
CGGGCCCGCA TCGGCGTGCT GTCCCAGGTG CCGTCGCTGT GGGCGCAGTA CGTCGAGGCG
TGGGCGGAAC GAACCCCGCC ACCCGATTCC GGCACCGGAC TCTTCTTGCT GCAGAACATG
TTCGGCGTCT GGCCGGTCGA CGGGGTGGTC ACCGACGAAC TGCGCGATCG CCTGCACGCC
TACGCCGAGA AGGCGATCCG CGAAGCCGCC ACCCACACCA CGTGGAACGA TCCGGACAGC
GAGTTCGAGA GCGGCGTACA CGCGTGGCTG GACAGCGTGA TCGACGGCCC GGTCGGCACC
GAACTCGGCC GCCTGGTCGA ACAACTCGAC CCGCACGCCC GCAACGACAG CCTCGGCCAG
AAACTGTTGG CGCTCACCGG CCCCGGTGTC CCGGACGTCT ACCAGGGCAC CGAACTCTGG
GAGGACAGCC TGGTCGATCC CGACAACCGC CGTCCGGTCG ACTACACGCA GCGTCGCGCG
GAACTGGAGG CACTGCGGCA TCCGAAGATG CGCGTGGTGC ACGCCGCGCT GCGGGCGCGC
CGCGACCGGC CGGCCAGCTT CCTCGACGGT GGGTACACCC CGGTGCTCGC ACGCGGTGCG
GCCGCCGACC ACGTGGTCGC GTTCCTGCGC GGCGACGATG TGCTCACCGC GGTCAGCCGC
CACACCGTCC GGCTACGCGA CACCGGTTGG GGTGACACCG AATTGACCCT TCCCGAGGGC
AACTGGACCG ACCGCATCAC CGGCGGCCGG TTCACCGGCA CCGTGACCGC GGCCGATCTG
TTCGCCGAAC TGCCCGCCGT GCTGCTGGAG CGTGACCATG CCTGA
 
Protein sequence
MAADRPVLST YRLQMRGDAF TLADAEALVD YLDELGVSHL YLSPILTAVE GSTHGYDVTD 
PTTVSAALGG ADALASLSSA AGARGMGLVV DIVPNHVGVD DPRQNRWWWD LLTHGRDSAY
ADYFDVDWTA DPDGRILLPV LGSNGDVADL TVDGDTLRLG DLVFPIAPGT ADGAGPDVHD
RQHYRLIGWK PRASEAPGNP SGICGYRRFF SITSLAGLRQ EDPAVFEATH TEVKRWFTDG
LVDGIRIDHP DGLSDPPGYL VRLRELVGPQ AWIVIEKILA VDEPLDPTLP VAGTTGYDAL
REAGGVFVDP AGAEALGTLY ESTGVDYSAM PRAARALKVK AVTTTLGSEL ARLCRTIVAA
TGQDHDDLPD AVAAVISHLG VYRSDYSVLA TVLPVAIAET LSQQPRFAPA LQIVSTALSS
SRETAVRFQQ LCGAATAKAM EDCLFYRDAR LVSLNEVGGE PEWFGVSIAE FHQRATARAA
MWPHAMTTLT THDTKRGEDV RARIGVLSQV PSLWAQYVEA WAERTPPPDS GTGLFLLQNM
FGVWPVDGVV TDELRDRLHA YAEKAIREAA THTTWNDPDS EFESGVHAWL DSVIDGPVGT
ELGRLVEQLD PHARNDSLGQ KLLALTGPGV PDVYQGTELW EDSLVDPDNR RPVDYTQRRA
ELEALRHPKM RVVHAALRAR RDRPASFLDG GYTPVLARGA AADHVVAFLR GDDVLTAVSR
HTVRLRDTGW GDTELTLPEG NWTDRITGGR FTGTVTAADL FAELPAVLLE RDHA