Gene Mkms_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0229 
Symbol 
ID4615458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp246960 
End bp248120 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID639789904 
Productglycosyl transferase, group 1 
Protein accessionYP_936236 
Protein GI119866284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0736189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC GCCCGCTTCG TTCCGTCCTG CTCTTGTGCT GGCGGGACAC CGGTCACCCC 
CAGGGTGGCG GAAGCGAGAT GTATCTGCAG CGCATCGGTG AACAGCTGGC CGCCCCGGGG
GTGCGGGTCA CACTGCGCAC CGCGTCCTAT CCGGGTGCGG CCCGCCGCGA GGTGGTCGAC
GGGGTACACG TCAGCCGTGG GGGCGGCCGG TACTCGGTCT ATCCGTGGGC GCTGCTGGTG
ATGGCGCTGG CCCGGATCGG TCTCGGGCCG CTGCGCGGGG CGCGCCCCGA CGTCGTGGTG
GACACCCAGA ACGGGGTGCC GTTCCTGGCC CGGCTGGTCT ACGGCCGCCG CACCGCCGTG
CTCGTCCACC ACTGCCACCG CGAACAGTGG CCGGTCGCCG GACCGGTGCT GGGCCGGCTC
GGGTGGTTCG TCGAGTCGGT CGTGTCGCCG CGGGTGCACC GGCGGGCGCA GTATCTGACA
GTGTCGCTGC CGTCGTCGCG TGATCTGACC GACCTCGGGG TCGACGGTTC CCGGATCGCG
GTGGTGCGCA ACGGGCTCGA CGAGGCGCCC GCGTCGACGC TCACCTTGCC GCGGTCGGTG
ACGCCGCGGA TCGCGGTGTT GTCGCGGCTG GTGCCGCACA AGCAGATCGA AGACGCGCTG
GACGCCGTCG CCGCGCTGCG GCCGCACATC CCCGATCTGC ACCTCGACGT GCTCGGCGGC
GGGTGGTGGC AGCAGAAGCT GGTCGACCAC GCCCGGCTTT CGGGAATCTC CGATGCGGTC
ACGTTCCACG GGCATGTCGA CGACATCACC AAACACGAAG TGCTGCAACA TAGCTGGGTG
CATGTGCTGC CGTCTCGCAA AGAGGGCTGG GGGCTCGCGG TGACCGAGGC GGGTCAGCAC
GCGGTCCCGA CCATCGGTTA CCGGTCGTCC GGCGGGCTGA CGGACTCGAT CGTCGACGGT
GTAACCGGCC TGCTGGTAGA CGACCGCGAC GAACTCGTCG AGGCGCTGAG GCAGTTGCTC
GGTGACCATG TGCTGCGCGA GCAGCTGGGC GCGAAGGCCC AGGCGCGCAG TGTCGAGTTC
TCGTGGCGGC AAAGCGCTTC GGCGATGCGT GAGGTGTTCG ACGCGATGCT GAGCGGCCGG
TACGTCAGCG GCGTCGTCTA G
 
Protein sequence
MSARPLRSVL LLCWRDTGHP QGGGSEMYLQ RIGEQLAAPG VRVTLRTASY PGAARREVVD 
GVHVSRGGGR YSVYPWALLV MALARIGLGP LRGARPDVVV DTQNGVPFLA RLVYGRRTAV
LVHHCHREQW PVAGPVLGRL GWFVESVVSP RVHRRAQYLT VSLPSSRDLT DLGVDGSRIA
VVRNGLDEAP ASTLTLPRSV TPRIAVLSRL VPHKQIEDAL DAVAALRPHI PDLHLDVLGG
GWWQQKLVDH ARLSGISDAV TFHGHVDDIT KHEVLQHSWV HVLPSRKEGW GLAVTEAGQH
AVPTIGYRSS GGLTDSIVDG VTGLLVDDRD ELVEALRQLL GDHVLREQLG AKAQARSVEF
SWRQSASAMR EVFDAMLSGR YVSGVV