Gene Mkms_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3646 
Symbol 
ID4611576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3839375 
End bp3840679 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID639793322 
Productglycosyl transferase family protein 
Protein accessionYP_939630 
Protein GI119869678 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.293186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAG TACTGATCGC GTCCATGTCG CCGATAGGCC ACCTCGGGCC GTTGCTCAAC 
CTCGCCCGCG GCCTCGTCGA CCGCGGTGAC CGGGTCACGG TCCTGACCTC GGCCGCGCGC
GCCGGGATGA TCCGCGCGGC CGGGGCACGA CCCCGCGCCC TGCCGCCGCA GACCGACATC
GACGAGAGCC GGCTCAACGA AAGCCTCCCC GGGCGCGAGA AGACGTCCGG CATCAAACGG
GTCGACTTCG ACATCACCAA CGTCTTCGTG ACCCCGATGC CCCATCAGGC GGCGGCCCTG
GCCGAGGCGT TCGCCGAGAC ACGGTATGAC GCCGTCATCG TCGACGCGAT GTTCCTGGGC
ATCCTGCCGT TCCTGCTCGG TGAACACGCC GCCCGCCCAC CGGTGCTGGC CTACTCGACC
ACGCCGCTGT TGATCAGCAG CCGGGACACC GCTCCTCCGG GGTTGGGTCT GCCGCCGTCG
TCGAGCCCGC TCGGGCGGCT GCGCAACTCG GCGCTGACCA CGCTGACGCA CCGGGTCCTC
CTGCGAGGCT GCCACCGGGC CGCCGACGAG GCGCTGCACC GGATGAACAG CCGCCCGCTG
CCGATGTTCG TCACCGACGC CGCGTTGCTC GCCGACCGCT TCATCGCCCC TACCGTCCCC
GAATTCGACT ATCCGCGCGG CGATCTGCCG CCTCATGTGC GCTACGTGGG CGCCGTGCAT
CCCGCACGGA CGCAGACGTT CACCCCGCCC CCGTGGTGGG GGGCGCTCGA CGGCGAACGC
CCGGTGGTGC ACGTCACCCA GGGCACCGTC GACAACGCCG ACCCCCGGCG GCTACTGCTG
CCGACCGTCG AGGCGCTGGC CGGTGAGGAG GTCACCGTGG TGGTCACCAC CGGTGGCCGT
GGACTTTCCG TACCTCACAC CGCCCTGCCG ACGAATACCC ATGTGGCCGA ATTCATTCCG
CACGACGTGT TGCTTCCGAA GGTCGACGTG ATGGTCACCA ACGGCGGGTT CGGTGCGGTG
CAGCGCGCGC TGTCCCTCGG CGTGCCGCTC GTGGTCGCGG GCGACACCGA GGACAAGCCG
GAGGTCGCCG CGCGCGTCGC CTGGACCGGT GCCGGTGTCG ACCTGCGCAC CGGCACGCCG
ACTCCCGGTG CGATCCGCTC GGCGGTCCGC GACGTGCTCG ACCGCGCGCA CTACCGGGAG
AACGCCCGAC GGCTCGAGGT CGCCTTCACA CGCCGCGACG GGGTGGCCGA GATCGCCGCG
GTGATCGACG AAGTCCTCGC CGAGCGTCGT CAGACAGTGC GGTGA
 
Protein sequence
MPEVLIASMS PIGHLGPLLN LARGLVDRGD RVTVLTSAAR AGMIRAAGAR PRALPPQTDI 
DESRLNESLP GREKTSGIKR VDFDITNVFV TPMPHQAAAL AEAFAETRYD AVIVDAMFLG
ILPFLLGEHA ARPPVLAYST TPLLISSRDT APPGLGLPPS SSPLGRLRNS ALTTLTHRVL
LRGCHRAADE ALHRMNSRPL PMFVTDAALL ADRFIAPTVP EFDYPRGDLP PHVRYVGAVH
PARTQTFTPP PWWGALDGER PVVHVTQGTV DNADPRRLLL PTVEALAGEE VTVVVTTGGR
GLSVPHTALP TNTHVAEFIP HDVLLPKVDV MVTNGGFGAV QRALSLGVPL VVAGDTEDKP
EVAARVAWTG AGVDLRTGTP TPGAIRSAVR DVLDRAHYRE NARRLEVAFT RRDGVAEIAA
VIDEVLAERR QTVR