Gene Mjls_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0394 
Symbol 
ID4876140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp437110 
End bp439086 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content66% 
IMG OID640137708 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001068698 
Protein GI126433007 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.193019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.528861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCACC AACTGCCCGA AATCTCCCCC GGCAAATACC CGTCGCCCTG GTTGCGGCTG 
CTGATCCTCT GCACCGCGCT GTTGGGCATC AACTACATCG TGTGGCGCTG GTTCGGGTCG
ATCAACTGGG CCGCCTGGTG GATCGCGGTA CCGCTGGTGA TCGCCGAGAC CTACAGCGTC
ATCGACTCGC TGCTGTTCGC GATGACGATG TGGAAGATGT TGCGGCGCAA CCCACCTCCG
CCGCCGCCCG ACGACGCGAC CGTCGACGTC TTCATCACCA CCTACAACGA GCCGATCGAC
ATGGTGCTGG AGACGGCCGA GGCCGCCCAG CGGATCCGCT TCCCGCACTC GACCTGGATC
CTCGACGACG GCGACCGCCA CGACCTGGCC GAAGCGGCCG CCGAGCGCGG CATCGGCTAC
ATCACCCGAT CGTCGAGTTG GACCCCCGAC AAACCGCGCC ATGCCAAGGC GGGCAACCTC
AACAACGCGC TGTTCGAGAC CCACGGCGAG TTCATCCTCG TACTCGACGC CGACCAGGTG
CCCGAACCCG AGATCCTCGA CAGGACCCTG GGCTACTTCC GCGATCCGCA CATGGCGCTG
GTGCAGACAC CGCAGTACTT CCACAACGTC CCGTTCAGCG ACCCGCTGGG TAGCCAGGCG
CCGCTGTTCT ACGGGCCGAT CCAACAGGGC AAGGACGGGT GGAACGCGGC CTACTTCTGC
GGGTCGAATG CGGTGTTGCG CCGGGAAGCG CTGATGCGAT TGGGGATTCG CGGATACGTG
CGCGCCGTCG AGGAGGGCGT CCGGCGGACG CTCTATGCGG CCCGCAAGAT GATCAGGACC
GCGCGTAAAC AACCGGGCGC CGACCAGCCC GAGGTGCAGG AGGCGCTGGA GTCGGTGCTG
CAGGCGGTGC GCGACGCCCG CCGTCAGTTG CGGGACAAGC GGGCCTTGGC CGACATCACC
TTCGACTTCC AGCAGCGCGT CGACGCGGCC GCACGCACGG TGGTCGACGC CGACATCACC
GCCATGCGGG CCGATCTCGA GGTGATCACC GCGCTGAGCG AACACCCCGA GGCCACCGCC
ACGACAGTGG TGTTCGACGA CGAAGCGCTG GAGTCGCTGG CGGGCCGGGA GTGGTCCCCG
CTCGGCGCGA TCGAGTCGAT CGGCGCGATG ATCCGCGCCG TCGACGTGGG CCGCGACGAC
GAGGCGCAAC CGATGCTGCC GATGGCCACC ATCTCGGTCA CCGAGGACAT GGCCACCTGT
ATGCGGCTGC ACGCACTGGG CTGGCGCTCG GCCTACCACC ACGAGGTCCT CGCCCGCGGT
CTGGCGCCCG ACGACGTGCG GACCATGCTC ACCCAGCGGC TGCGCTGGGC GCAGGGCACC
ATCCAGGTGA TGCTGCGGGA GAACCCGTTC GTGCAGAAGG GACTCTCGAT CGGCCAGAAA
CTCATGTACT GGGCGACCAT GTACAGCTAT CTGGCCGGAT TCGCCGCGCT GGCCTACATC
GCCGCCCCGG CGATCTACCT GATCTTCGGC ATCATGCCGG TGACCGCGTA CAGCTGGGAC
TTCTTCGGGC GGCTCATCCC GTTCCTCGTG CTCAACCAGC TGATGTTCAT CATCATCAGC
CGCGGCACCC CGACCTGGCG CGGCCAGCAG TACAGCCTCG CGCTGTTCCC GGTGTGGATC
CGGGCCTGTT ACACGGCGTT CCTCAACGTG GTGTTCGGGC GACCGCTGGG CTTCGCGGTC
ACCCCGAAGA CCAGACAGGA GGCGACGGCG ATCCCGTGGC ACCTGGTGAA GTGGCAACTC
GCCGCGATGG CCATGTTGGT CGTCGCATCG ATCATCGGCA TCGTGCAGCT GTACTTCGGT
GCGATCTCCG TGCTCGGTGT CGGTGTGAAC CTCTTCTGGG TGATATTCGA CCTGTTGATT
CTGAGCGTGG TGATCCAGGC GGTGCGCTTC CGCGGACACC AGGACGAAGG AGTGTGA
 
Protein sequence
MRHQLPEISP GKYPSPWLRL LILCTALLGI NYIVWRWFGS INWAAWWIAV PLVIAETYSV 
IDSLLFAMTM WKMLRRNPPP PPPDDATVDV FITTYNEPID MVLETAEAAQ RIRFPHSTWI
LDDGDRHDLA EAAAERGIGY ITRSSSWTPD KPRHAKAGNL NNALFETHGE FILVLDADQV
PEPEILDRTL GYFRDPHMAL VQTPQYFHNV PFSDPLGSQA PLFYGPIQQG KDGWNAAYFC
GSNAVLRREA LMRLGIRGYV RAVEEGVRRT LYAARKMIRT ARKQPGADQP EVQEALESVL
QAVRDARRQL RDKRALADIT FDFQQRVDAA ARTVVDADIT AMRADLEVIT ALSEHPEATA
TTVVFDDEAL ESLAGREWSP LGAIESIGAM IRAVDVGRDD EAQPMLPMAT ISVTEDMATC
MRLHALGWRS AYHHEVLARG LAPDDVRTML TQRLRWAQGT IQVMLRENPF VQKGLSIGQK
LMYWATMYSY LAGFAALAYI AAPAIYLIFG IMPVTAYSWD FFGRLIPFLV LNQLMFIIIS
RGTPTWRGQQ YSLALFPVWI RACYTAFLNV VFGRPLGFAV TPKTRQEATA IPWHLVKWQL
AAMAMLVVAS IIGIVQLYFG AISVLGVGVN LFWVIFDLLI LSVVIQAVRF RGHQDEGV