Gene Cthe_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2949 
Symbol 
ID4810837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3466795 
End bp3468498 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content41% 
IMG OID640108372 
Productpectinesterase 
Protein accessionYP_001039340 
Protein GI125975430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.765546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA ACAAAAAAAT ATTACTTTTA CTATGCATTA ATCTTTGCCT GTTCTTTTTA 
CTAATCAATA GAATTCAGCT TGTGTCTTCC GCAGCGGTCA ATGCGGATAT AATAGTTGCC
AAAGACGGTA CAGGCAATTT CACAACCATA CAGGCCGCAA TTGATTCAGT ACCGTCAAAC
AGTTCAAAAA GAACCGTTAT ATTTGTCAAA AACGGTACAT ACAAAGAAGT TGTTACAATC
AGGAAAAACA ACATACACCT CATCGGAGAA AGCAATACAA AAACAATCAT TACATATGAC
AATTATGCGG GTAAACTAAA ACCTGACGGC ACCACATACG GTACATCCGG TTCCGCATCA
TTCTATCTCT ATGGAACTGA CACAATCCTT GAAAACATCA CAATTGAAAA TTCCTTTGAT
GAAAGTATCG ACGTAAAAGA CAAGCAAGCC GTAGCTGCTT ATATCCGCGG CGACAGGCAA
ATAATCAAAA ATTGTATTTT TATCGGAAAT CAGGATACCT TGTATGCACA CTCGGGCAGA
CAGTATTATG TGAACTGCAA AATCATAGGT GACACGGATT TTATATTTGG CGGCGCCACA
GCTGTATTTG AAAACTGCGA AATTGTTTCA ACACCCAAAG GGGGATATGT CACTGCTGCA
AGCACTGATC TCGAAAATTA CGGATTTCTG TTCTTAAACT GCAGATTGAC AAGCGATGCT
CCCAAAAATT CAACATATCT TGGAAGACCC TGGCGTCCCA ATGCATATGT AGTTTACAAA
ACATGTTATT TGGGAGCGCA TATAAAGGAG TCCGGCTGGA CCAGCATGAG TGGTAATTTG
CCTGAAAATG CGCGCTTTTT TGAGTACAAA AACACAGGCC CGGGAGCGGT GGTCAACTCA
TCGAGAAGAC AGTTGTCATA CGCCGAAGCC GCAAAATTTA CTCCGCAAAA CCTATTAAAA
GGTACTGACA ATTGGAATCC GGTTGCGCTC GTGTCTCAAA CGTCAACTTT AACACCGACT
CAAAAACCCA CCTCAACACC GGCTCCAACC CCAATGGACG GCCAATTGAT AAAATCATTA
ACGGTAAAGG ATTCGGCAAA TTCGTCCAAT TGGTCCATAC AGTCGAATTT ACGGGTTGGT
GATACAGTTT TTGGTGACAG AACATACAAG TTTGTCACAA TTCCAAATGA GTTCCTTGGC
TCCGAATGGA TCAGGACAGC CTGTGACTCG AAAAAATCCA CAGAAGACCT GGCCTACTTT
ACCGCCAAAG CTGACATAAC CGTATATGTG GGTCTGGACT CAAGGGTTGC AACCATACCG
TCATGGCTTA ACGATTGGAC CAAAACCTCA CTGACAATAA CCGACGACGG TTCACCACAG
GTTACCTACA ACCTTTACAA AAAGAATTTC AGTGCAAACT CCGTTGTAAC CCTTGGTCCT
AATGGGGCTT CAAGCGGGGC TGTGAATTAT ATTGTCATAG TTAAACAAAA CAATCAAAAT
ATAGTATATG GTGATTTGAA CGGAGACGGA CTGGTAAACT CAAGCGACTA TTCATTATTA
AAAAGATACA TACTTAAACA AATAGATTTG ACGGAGGAAA AACTTAAGGC GGCAGACCTG
AATAGAAACG GCTCCGTTGA CTCAGTGGAT TATTCCATAT TAAAGAGATT TTTGTTGAAA
ACAATTACAC AATTGCCTGT ATAA
 
Protein sequence
MIKNKKILLL LCINLCLFFL LINRIQLVSS AAVNADIIVA KDGTGNFTTI QAAIDSVPSN 
SSKRTVIFVK NGTYKEVVTI RKNNIHLIGE SNTKTIITYD NYAGKLKPDG TTYGTSGSAS
FYLYGTDTIL ENITIENSFD ESIDVKDKQA VAAYIRGDRQ IIKNCIFIGN QDTLYAHSGR
QYYVNCKIIG DTDFIFGGAT AVFENCEIVS TPKGGYVTAA STDLENYGFL FLNCRLTSDA
PKNSTYLGRP WRPNAYVVYK TCYLGAHIKE SGWTSMSGNL PENARFFEYK NTGPGAVVNS
SRRQLSYAEA AKFTPQNLLK GTDNWNPVAL VSQTSTLTPT QKPTSTPAPT PMDGQLIKSL
TVKDSANSSN WSIQSNLRVG DTVFGDRTYK FVTIPNEFLG SEWIRTACDS KKSTEDLAYF
TAKADITVYV GLDSRVATIP SWLNDWTKTS LTITDDGSPQ VTYNLYKKNF SANSVVTLGP
NGASSGAVNY IVIVKQNNQN IVYGDLNGDG LVNSSDYSLL KRYILKQIDL TEEKLKAADL
NRNGSVDSVD YSILKRFLLK TITQLPV