Gene Mjls_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0650 
Symbol 
ID4876394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp705320 
End bp706639 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID640137963 
Productglycosyl transferase, group 1 
Protein accessionYP_001068952 
Protein GI126433261 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.303118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.416053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCTAG CAACGGACCA GCTCGGTAGG CCGCCCCAGC GCGTGGCCGT GTTGTCGGTG 
CACACGTCGC CGCTGGCGCA GCCCGGCACC GGCGATGCCG GCGGGATGAA CGTGTACGTA
CTGCAGAGCG CGCTGCACAT GGCGCGCCGC GGCGTCGAGG TGGAGATCTT CACCCGCGCC
ACGACATCGG CCGATCCGCC CGTCGTGCGG GTGGCGCCGG GTGTACTCGT GCGCAACGTC
GTGGCAGGCC CATTCGAAGG GCTCGACAAA TACGATCTGC CCACCCAGCT GTGCGCGTTC
ACCGCCGGGG TGCTGCGCGC CGAAGCCACC CACGAACCCG GTTACTACGA CATCGTGCAC
TCGCACTACT GGCTGTCCGG TCAGGTCGGC TGGCTGGCCC GGGACCGCTG GGCGGTGCCG
CTGGTGCACA CCGCCCACAC GCTGGCGGCG GTGAAGAACG CCGCACTGGC CGAGGGTGAC
TCACCCGAAC CACCCCTGCG TGCCGTCGGC GAGCAGCAGG TGGTCGACGA GGCCGACCGG
CTGATCGTCA ACACCGAACT CGAAGCCGAA CAACTGGTTT CGTTGCACAA CGCCGACCCG
TCACGCATCG ACGTCGTGCA CCCCGGGGTC GACCTGGACA CCTTCACCCC CGGTGATCAG
GCAGCCGCCC GCGCCGCGCT CGGCCTGGAC CCGCGCGAGA CCGTCGTCGC GTTCGTCGGC
CGGATCCAGC CGCTCAAGGC GCCCGACATC CTGCTGCGGG CGGCGGCGAA GCTGCCCGAC
GTGCGGGTGC TGGTCGCCGG CGGACCCTCG GGGTCGGGCC TGGCCGCGCC GGACAACCTG
GTTGCCCTCG CCGACGAACT GGGTATCTCA GAGCGCGTGA CGTTCCTGCC CCCGCAGTCG
CGCGAGGATC TGGTGCGGGT GTATCGCGCC GCCGATCTTG TTGCGGTGCC GAGCTACTCG
GAGTCGTTCG GGCTGGTTGC CGTCGAGGCG CAGGCCTGCG GCACCCCGGT GGTCGCGGCC
GCCGTGGGCG GGCTGCCGGT GGCGGTGCGC GACGGGGTGA CCGGTGCGCT CGTCGACGGT
CACGACGTCG GCGACTGGGC GCACACCATC GACTCGCTGC TGTCGCGCGG TCCGGCCACC
ATGCGGCGGG CCGCAGTCGA GCATGCGGCC ACCTTCTCCT GGGCCCACAC CGTCGACGAC
CTGCTGGCCA GCTACGGCCG GGCGATCAGC GACTACCGCG ACCGGCATCC GCACGCCGAC
GAAACGCTGT CCCGCCGCAC CGCACGGCGG TTCTCGAGGC GACGGGGGGT CCGGGCGTGA
 
Protein sequence
MRLATDQLGR PPQRVAVLSV HTSPLAQPGT GDAGGMNVYV LQSALHMARR GVEVEIFTRA 
TTSADPPVVR VAPGVLVRNV VAGPFEGLDK YDLPTQLCAF TAGVLRAEAT HEPGYYDIVH
SHYWLSGQVG WLARDRWAVP LVHTAHTLAA VKNAALAEGD SPEPPLRAVG EQQVVDEADR
LIVNTELEAE QLVSLHNADP SRIDVVHPGV DLDTFTPGDQ AAARAALGLD PRETVVAFVG
RIQPLKAPDI LLRAAAKLPD VRVLVAGGPS GSGLAAPDNL VALADELGIS ERVTFLPPQS
REDLVRVYRA ADLVAVPSYS ESFGLVAVEA QACGTPVVAA AVGGLPVAVR DGVTGALVDG
HDVGDWAHTI DSLLSRGPAT MRRAAVEHAA TFSWAHTVDD LLASYGRAIS DYRDRHPHAD
ETLSRRTARR FSRRRGVRA