Gene Mkms_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3659 
Symbol 
ID4611591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3855998 
End bp3857149 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID639793337 
Productglycosyl transferase, group 1 
Protein accessionYP_939643 
Protein GI119869691 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.650107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGACG CGCGGAACCG CGGTGGCCGG CGTGTCGATC TCCTATTCGA TGCGCGGCAT 
ATCCGCCAAA GCGGGATCGG CACGTACATC CGGACCCAGC TGCCCTACCT GGAGGAGGCG
GCCGACCGTG ACGGCCAGAC CCTGGCGGTG CTCGCCGACC CGCAGGCCGT TCCTGCGCTG
CGCCCGAGCA CCGAGCTCAT CCTGGCCTCC CCTGCGCAGG CGCCGATGTA CTCGGCCGCC
GAACAATCGG TGTGGCGCCG CGCATTCGAG GCGGCGCGAC CGCGCGCGGT GTGGCTGCCG
CACTACCCCT ACCCGTTGGC CCGTTTCCTT CCGGGCAACC GCCGCACGGC GCTGTACGTC
ACCGTCCACG ACACGATTCA TCTTCTGCCC GAGGCCATCA GCGGGCAGAG TCGGGCCCGG
CGGCTGTATG CCCGCGCCAT GCTGGGTGCC GACGCCAGGT TCTGCCGCCG GATCTTCACG
GTGTCGGAGG CGACGGCCAC GACGCTGAGG GACATCGAAC CGTCCGCGCC GGTGCTGGTG
ACACCGATCC CCGTGGACGA GGTGTGGCTC GATCCGGTGG ATCCGGCCCT GTCGCCGGTC
GGTGGCCGTT ACCTGCTGTA CGTCGGAAAC ACGAAGATCT ACAAGAACCT TCCGCTCGTT
CTCGAGGTGT TCGCCGATCT CACCGACGAG ATTCCTCACA AGCTCGTGAT CGCCGGCGGC
GGCGCCACCC TGCGGACCAT GGACGACCGG GTCCGCAGAC TCGCCGAGGA CAATCCGGAC
CGCGTCCTGG TGACCGGCCA GCTCCCGTTC GCCGCGCTGC GGTCCTTGGT TGCGTCCGCG
GAGTTGCTGA TCATGCCGTC CCTGTACGAG GGCGCCGGCC TGCCGCCGCT GGAAGCCATG
GCCTCGCGCA CACCGGTGTT GGCGTCGGAC ATCCCCTCGA TCCGTGAGAC GTGCGGCGAC
GGCGCCGAGT TCTTCGACCC GCACCGGCCC ACGGAACTGG CCGACCTGCT CCGGCGGTAC
TGCTCAGACG ACGCGTCACG AGCCGACCTG GCCAGGCGCG GCCACGCGCA CGTGCTGGCA
CGCCAGCAGC AGATCCGCCC GACCGCGGCG GCCGATGCGA TCTTCGGCGA ACTCGTCGGC
AGCCGCACGT GA
 
Protein sequence
MKDARNRGGR RVDLLFDARH IRQSGIGTYI RTQLPYLEEA ADRDGQTLAV LADPQAVPAL 
RPSTELILAS PAQAPMYSAA EQSVWRRAFE AARPRAVWLP HYPYPLARFL PGNRRTALYV
TVHDTIHLLP EAISGQSRAR RLYARAMLGA DARFCRRIFT VSEATATTLR DIEPSAPVLV
TPIPVDEVWL DPVDPALSPV GGRYLLYVGN TKIYKNLPLV LEVFADLTDE IPHKLVIAGG
GATLRTMDDR VRRLAEDNPD RVLVTGQLPF AALRSLVASA ELLIMPSLYE GAGLPPLEAM
ASRTPVLASD IPSIRETCGD GAEFFDPHRP TELADLLRRY CSDDASRADL ARRGHAHVLA
RQQQIRPTAA ADAIFGELVG SRT