Gene Mjls_4945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4945 
Symbol 
ID4880644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5185532 
End bp5187487 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content68% 
IMG OID640142255 
Productglycosyl transferase family protein 
Protein accessionYP_001073201 
Protein GI126437510 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.420909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.200608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAAC CGGTAGCGCG GCAGCACCGG GACTCCCAGA CGCCACCGGC ACCGCCGCCG 
CAGATACCGC AGAAGCCCAA AGCTGTTGTC AGGCACTATC GTTCGATCGA CAGCGCTCCG
CCGGCGTATG CGATCAAACG TCCGCCCAGC GCTGTCAACG GTGCGCTGCT CATCTTTGTC
GCGTTGAGCA GTCTGACGCT GCTGCTCGGA ACCGTACAGG CCAGGGCGTG GGGAGACCAC
GCCCGCGACC TGGTCTTCGC CACGGCCGAC GGGCAGGCGG CCGCGATACC GGTGCGAGCG
TTCCTGCTGG TGATGGTCAC CTCGGTGGCG TGGTCGCTGG ACACGAACTT CTGGCGCCGG
TTGGCTGTGC ACCTCGAGCT GACCGGCGTG CTGATCCTGG TCTGCGCGGT GGTGGATTTC
TCGGCTTACC TCGGCTACCA CGTCGGACTC TTCTATCCGC AGATCGTCGG CCAGCAGTTG
GCGTCAAGTC TGGCGGCGAT GGTGCTGTTG CCGTTCACCG TGATGCGGCA CGCCCGGCTG
CCGAGGCCGG CGCGCCTGCG GCCGGCGGGG AGGATGCGTT GGCATGCCTG GGTGCGGCTG
GCGGTTCCGC TGGCGGTGGC GTTCGTGGCA GCCGCCTGGA TCGAGGACCG CATGCCGGTC
CCGGTGGCCT GGATGCGGGA GTGGGCGCTG ATGGGTGGCG TGGGTCCGGG GATCTTCCTG
GTCCAGCAGC TGTTCGGCAT CCTCGCCGCG GGGATCGGGC TGGTGATGAT CCGCCGGTCG
CGCCGCGCAC GTTTCGCGCC GCCCCTCGCG GTGATCATCC CGGCGCACAA CGAGGCCCAC
GACATCACCG CCACGATCGA GGCCGTCGAC CGGGCCGCGG CCCGGTACGC CGAGACGGTC
CACATCTATG TCATCGACAA CGCCTCCACC GACGACACCG CGGACGTCGC ACAGACCGCC
ATCGCCGCCT GCGCACACTC CACCGGGGAG GTGCACGAAT GCGCGGTCCC CGGGAAGGCG
GTGGCGCTCA ACTACGGCCT GTCGGTGATC CGGGAGGAGT TCGTCGTGCG TATCGATGCC
GACACCGTGA TCGGCGAGAA CTGCCTCGAC GTCACGCTGC GTCATTTCAC CGATGCGAAG
GTCGCCGCCG TCGGCGGGAT GCCGCGGCCG GAACGTATCC GAACCTTCTT CGACCGGGTG
CGATTGGTCG AGGTGCTCGT CAAACACGGC TTCTTCCAGG TCGCGATGAT GGGCTACGAC
GGGATCATCG GCGAGCCCGG CATGTTCGTG GTCTACCGGC GCCGCGTCGT CGAAGAGGTC
GGCGGCATCG TGCAGGGCAT GAACGGTGAG GACACCGACA TCTGCATGAG GATGAGCAGT
CAGGGCTACC TGAGCCTGGT CGACCCCACC GCGGTCTACT TCAGCGAGAC CCCGCAGAGC
TGGGCGCATC TGCGCGAACA ACGCACCCGC TGGTTTCGCA GCATCTACCA CATCGCCGCC
CACAACCGGC ACGCGATCCT GAGCCGGAGT TCGATGGCCG GGGCGGTGAT GCTGCCGTTC
CAGCTCGCCA ACGCGGCGCG CCGAGCGATG ATGCTGCCCC TGCTGTTGTT CGGCCTCCTG
ATCTTCGGAC TGTTCCGCGA GTCGTTCCCC GGTCTGCACC CCGAGCGGCT CCTCGCGGTG
TTCCTCGGGC TGCCGCTGCT GGTGGCACTC GGCGTATGCC TCGTGCGTCA GCCCCGAGCG
GTCCTCTACC TCCCCGAGTA CCTCGTATTC CGGATAGTGC GCAGCTATTT CACCCTCGCC
GCGGTGCTGA GCCTGGTGTT TCCGCCGCTG CATCCCCGGC AGGCGCTGCG GGAGCGACGG
CGAACGCGTA GGCGACCCCG TCACCGACGC AACCGTGTCA CCCCCGCCGA TCGCAGTTCC
AGCGCCGCAA GCCCGGATAT CGCGGCGACG TCCTGA
 
Protein sequence
MSEPVARQHR DSQTPPAPPP QIPQKPKAVV RHYRSIDSAP PAYAIKRPPS AVNGALLIFV 
ALSSLTLLLG TVQARAWGDH ARDLVFATAD GQAAAIPVRA FLLVMVTSVA WSLDTNFWRR
LAVHLELTGV LILVCAVVDF SAYLGYHVGL FYPQIVGQQL ASSLAAMVLL PFTVMRHARL
PRPARLRPAG RMRWHAWVRL AVPLAVAFVA AAWIEDRMPV PVAWMREWAL MGGVGPGIFL
VQQLFGILAA GIGLVMIRRS RRARFAPPLA VIIPAHNEAH DITATIEAVD RAAARYAETV
HIYVIDNAST DDTADVAQTA IAACAHSTGE VHECAVPGKA VALNYGLSVI REEFVVRIDA
DTVIGENCLD VTLRHFTDAK VAAVGGMPRP ERIRTFFDRV RLVEVLVKHG FFQVAMMGYD
GIIGEPGMFV VYRRRVVEEV GGIVQGMNGE DTDICMRMSS QGYLSLVDPT AVYFSETPQS
WAHLREQRTR WFRSIYHIAA HNRHAILSRS SMAGAVMLPF QLANAARRAM MLPLLLFGLL
IFGLFRESFP GLHPERLLAV FLGLPLLVAL GVCLVRQPRA VLYLPEYLVF RIVRSYFTLA
AVLSLVFPPL HPRQALRERR RTRRRPRHRR NRVTPADRSS SAASPDIAAT S