Gene Mmcs_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0406 
Symbol 
ID4109252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp453129 
End bp455105 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content67% 
IMG OID638029531 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_637583 
Protein GI108797386 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCACC AACTGCCCGA AATCTCCCCC GGCAAATACC CGTCGCCCTG GTTGCGGCTG 
CTGATCCTCT GCACCGCGCT GTTGGGCATC AACTACATCG TGTGGCGCTG GTTCGGGTCG
ATCAACTGGG CCGCCTGGTG GATCGCGGTA CCGCTGGTGA TCGCCGAGAC CTACAGCGTC
ATCGACTCGC TGCTGTTCGC GATGACGATG TGGAAGATGT TGCGGCGCAA CCCACCTCCG
CCGCCGCCCG ACGACGCGAC CGTCGACGTC TTCATCACCA CCTACAACGA GCCGATCGAC
ATGGTGCTGG AGACGGCCGA GGCCGCCCAG CGGATCCGCT TCCCGCACTC GACCTGGATC
CTCGACGACG GCGACCGCCA CGACCTGGCC GAAGCGGCCG CCGAGCGCGG CATCGGCTAC
ATCACCCGAT CGTCGAGTTG GACCCCCGAC AAACCGCGCC ATGCCAAGGC GGGCAACCTC
AACAACGCGC TGTTCGAGAC CCACGGCGAG TTCATCCTCG TACTCGACGC CGACCAGGTG
CCCGAACCCG AGATCCTCGA CAGGACCCTG GGCTACTTCC GCGATCCGCA CATGGCGCTG
GTGCAGACGC CGCAGTACTT CCACAACGTC CCGTTCAGCG ACCCGCTGGG TAGCCAGGCG
CCGCTGTTCT ACGGGCCGAT CCAACAGGGT AAGGACGGGT GGAACGCGGC CTACTTCTGC
GGGTCGAATG CGGTGTTGCG CCGGGAAGCG CTGATGCGAT TGGGGATTCG CGGATACGTG
CGCGCCGTCG AGGAGGGCGT CCGGCGGACG CTCTATGCGG CCCGCAAGAT GATCAGGACC
GCGCGTAAAC AACCGGGCGC CGACCAGCCC GAGGTGCAGG AGGCGCTGGA GTCGGTGCTG
CAGGCGGTGC GCGACGCGCG CCGTCAGTTG CGGGACAAGC GGGCCCTGGC CGACATCACC
TTCGACTTCC AGCAGCGCGT CGACGCGGCC GCACGCACGG TGGTCGACGC CGACATCACC
GCCATGCGGG CCGATCTCGA GGTGATCACC GCGCTGAGCG AACACCCCGA GGCCACCGCC
ACGACAGTGG TGTTCGACGA CGAAGCGCTG GAGTCGCTGG CGGGCCGGGA GTGGTCCCCG
CTCGGCGCGA TCGAGTCGAT CGGCGCGATG ATCCGCGCCG TCGACGTGGG CCGCGACGAC
GAGGCGCAAC CCATGCTGCC GATGGCCACC ATCTCGGTCA CCGAGGACAT GGCCACCTGT
ATGCGGCTGC ACGCACTGGG CTGGCGCTCG GCCTACCACC ACGAGGTCCT CGCCCGCGGT
CTGGCGCCCG ACGACGTGCG GACCATGCTC ACCCAGCGGC TGCGCTGGGC GCAGGGCACC
ATCCAGGTGA TGCTGCGGGA GAACCCGTTC GTGCAGAAGG GACTCTCGAT CGGCCAGAAG
CTCATGTACT GGGCGACCAT GTACAGCTAT CTGGCCGGGT TCGCCGCGCT GGCCTACATC
GCCGCCCCGG CGATCTACCT GATCTTCGGC ATCATGCCGG TGACCGCGTA CAGCTGGGAC
TTCTTCGGGC GGCTCATCCC GTTCCTCGTG CTCAACCAGC TGATGTTCAT CATCATCAGC
CGCGGCACCC CGACCTGGCG CGGCCAGCAG TACAGCCTCG CGCTGTTCCC GGTGTGGATC
CGGGCCTGTT ACACGGCGTT CCTCAACGTG GTGTTCGGGC GACCGCTGGG CTTCGCGGTC
ACCCCGAAGA CCAGACAGGA GGCGACGGCG ATCCCGTGGC ACCTGGTGAA GTGGCAACTC
GCCGCGATGG CCATGCTGGT TGTCGCATCG ATCATCGGCA TCGTGCAGCT GTACTTCGGT
GCGATCTCCG TGCTCGGTGT CGGTGTGAAC CTCTTCTGGG TGATATTCGA CCTGTTGATT
CTGAGCGTGG TGATCCAGGC GGTGCGCTTC CGCGGGCACC AGGACGAAGG AGTGTGA
 
Protein sequence
MRHQLPEISP GKYPSPWLRL LILCTALLGI NYIVWRWFGS INWAAWWIAV PLVIAETYSV 
IDSLLFAMTM WKMLRRNPPP PPPDDATVDV FITTYNEPID MVLETAEAAQ RIRFPHSTWI
LDDGDRHDLA EAAAERGIGY ITRSSSWTPD KPRHAKAGNL NNALFETHGE FILVLDADQV
PEPEILDRTL GYFRDPHMAL VQTPQYFHNV PFSDPLGSQA PLFYGPIQQG KDGWNAAYFC
GSNAVLRREA LMRLGIRGYV RAVEEGVRRT LYAARKMIRT ARKQPGADQP EVQEALESVL
QAVRDARRQL RDKRALADIT FDFQQRVDAA ARTVVDADIT AMRADLEVIT ALSEHPEATA
TTVVFDDEAL ESLAGREWSP LGAIESIGAM IRAVDVGRDD EAQPMLPMAT ISVTEDMATC
MRLHALGWRS AYHHEVLARG LAPDDVRTML TQRLRWAQGT IQVMLRENPF VQKGLSIGQK
LMYWATMYSY LAGFAALAYI AAPAIYLIFG IMPVTAYSWD FFGRLIPFLV LNQLMFIIIS
RGTPTWRGQQ YSLALFPVWI RACYTAFLNV VFGRPLGFAV TPKTRQEATA IPWHLVKWQL
AAMAMLVVAS IIGIVQLYFG AISVLGVGVN LFWVIFDLLI LSVVIQAVRF RGHQDEGV