Gene Mkms_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1286 
Symbol 
ID4614299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1382103 
End bp1383383 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content71% 
IMG OID639790961 
Productglycosyl transferase family protein 
Protein accessionYP_937288 
Protein GI119867336 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.540508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.192884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TCCTGGCCTA CACCTCACCG TCGGCGGGCC ACCTCTTTCC GATGCTCGCG 
CTCCTCGGCG AACTGGCCGG GCGCGGTCAC CGGGTGCACG TGCGCACCTA CGCCGGCGGA
GTCCCCGCCG CCCGCGCGGC GGGGCTGACC GCCGACGCCG TGGATCCCCG CATCGAGGAC
ATCGTGAGCG AGGACTGGCG GGCCACCGGT GGACGTGCGG TGCTGCGTAT GACGATCGAG
ACGTTCGGTG GCCGCGGCGC CCACGAACTC GACGACCTCG ACGATGCGAT CGCGCTCGTC
GGGCCTGATC TGCTGCTGCT CGACATCAAC TGCTGGGGTG CGATGGCCGC CGCTGACGCC
GGCGACATCC TGTGGGCGGT GTTCTCCCCG TACACCCCGT TCCTGAACTC ACCGGGGATG
CCGCCGGTGG GCGCCGGGAT GGCGCCGTGG CCGGGATTCG TCGGCCGGGT GCGGGACGCG
GGTGTGCGCG CCGTGGTTCA GCAGGTGTTC GACGTGCCGA TGATGGCTAA CGTCAACGGT
TTTCGTGCGA AACGGGGGCT ACCCGCCCTG CGCGATGTGG ACGCCGTCCT ACGCCGTGCG
CCGCTGATGC TGGTGGCCGG CGGCGAGCCG TTCGAGTATC CGCACCCGGG CTGGGGTGCG
GCGGTGCAGA TGATCGGACC GTGTGAATAC GACCCGAAAC CCGCCGCGGC GCCGTCGTGG
CTCGACGGCA TCGACCGTCC GGTGGTCCTC GTCACGACCT CGTCGGTGAA ACAGGCCGAC
TCCGCGCTGG TTACCACGGC GCTGACCGCG TTGGCGGACA AAGACATTCA CATCGTCGCG
ACCTGCCCCT CGGGTATTCC CGGCGGAATC ACGGTGCCGC GCAACGCCAC CGTGACCGGA
TTCCTCCCGC ACGGTCCGGT GCTCGACCGG GCGGTCTGTG CGATCACGCA CGGCGGTATG
GGTGTCACCC AGAAGGCGCT GTCCCGCGGG GTGCCGGTGT GCGCGGTGCC GCACGGCCGC
GATCAGTTCG AGGTGGCCCG CCGGGTGCAG GCCGCCCGGT GCGGCACCCG CCTGCCGGCG
CGGCGCCTGA CACCGCAGCG GTTGCGCACC GCGGTCGAAC GCGCGCTGAC GATGACCGCG
GGGGCGCGTC GCGTCGCCGC CGGCTTCGCC GCCACCGGCG GCGTGGCACG CGGCGCGGAT
TTGCTGGAAC AGCGGCTGAT CGGTCGGTCG GCTACCCGAA GTGCACGCCT TGAGCCAGAG
GTAACTCGGC GGAGTAGTTG A
 
Protein sequence
MATILAYTSP SAGHLFPMLA LLGELAGRGH RVHVRTYAGG VPAARAAGLT ADAVDPRIED 
IVSEDWRATG GRAVLRMTIE TFGGRGAHEL DDLDDAIALV GPDLLLLDIN CWGAMAAADA
GDILWAVFSP YTPFLNSPGM PPVGAGMAPW PGFVGRVRDA GVRAVVQQVF DVPMMANVNG
FRAKRGLPAL RDVDAVLRRA PLMLVAGGEP FEYPHPGWGA AVQMIGPCEY DPKPAAAPSW
LDGIDRPVVL VTTSSVKQAD SALVTTALTA LADKDIHIVA TCPSGIPGGI TVPRNATVTG
FLPHGPVLDR AVCAITHGGM GVTQKALSRG VPVCAVPHGR DQFEVARRVQ AARCGTRLPA
RRLTPQRLRT AVERALTMTA GARRVAAGFA ATGGVARGAD LLEQRLIGRS ATRSARLEPE
VTRRSS