Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2949 |
Symbol | |
ID | 4810837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3466795 |
End bp | 3468498 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108372 |
Product | pectinesterase |
Protein accession | YP_001039340 |
Protein GI | 125975430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4677] Pectin methylesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.765546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA ACAAAAAAAT ATTACTTTTA CTATGCATTA ATCTTTGCCT GTTCTTTTTA CTAATCAATA GAATTCAGCT TGTGTCTTCC GCAGCGGTCA ATGCGGATAT AATAGTTGCC AAAGACGGTA CAGGCAATTT CACAACCATA CAGGCCGCAA TTGATTCAGT ACCGTCAAAC AGTTCAAAAA GAACCGTTAT ATTTGTCAAA AACGGTACAT ACAAAGAAGT TGTTACAATC AGGAAAAACA ACATACACCT CATCGGAGAA AGCAATACAA AAACAATCAT TACATATGAC AATTATGCGG GTAAACTAAA ACCTGACGGC ACCACATACG GTACATCCGG TTCCGCATCA TTCTATCTCT ATGGAACTGA CACAATCCTT GAAAACATCA CAATTGAAAA TTCCTTTGAT GAAAGTATCG ACGTAAAAGA CAAGCAAGCC GTAGCTGCTT ATATCCGCGG CGACAGGCAA ATAATCAAAA ATTGTATTTT TATCGGAAAT CAGGATACCT TGTATGCACA CTCGGGCAGA CAGTATTATG TGAACTGCAA AATCATAGGT GACACGGATT TTATATTTGG CGGCGCCACA GCTGTATTTG AAAACTGCGA AATTGTTTCA ACACCCAAAG GGGGATATGT CACTGCTGCA AGCACTGATC TCGAAAATTA CGGATTTCTG TTCTTAAACT GCAGATTGAC AAGCGATGCT CCCAAAAATT CAACATATCT TGGAAGACCC TGGCGTCCCA ATGCATATGT AGTTTACAAA ACATGTTATT TGGGAGCGCA TATAAAGGAG TCCGGCTGGA CCAGCATGAG TGGTAATTTG CCTGAAAATG CGCGCTTTTT TGAGTACAAA AACACAGGCC CGGGAGCGGT GGTCAACTCA TCGAGAAGAC AGTTGTCATA CGCCGAAGCC GCAAAATTTA CTCCGCAAAA CCTATTAAAA GGTACTGACA ATTGGAATCC GGTTGCGCTC GTGTCTCAAA CGTCAACTTT AACACCGACT CAAAAACCCA CCTCAACACC GGCTCCAACC CCAATGGACG GCCAATTGAT AAAATCATTA ACGGTAAAGG ATTCGGCAAA TTCGTCCAAT TGGTCCATAC AGTCGAATTT ACGGGTTGGT GATACAGTTT TTGGTGACAG AACATACAAG TTTGTCACAA TTCCAAATGA GTTCCTTGGC TCCGAATGGA TCAGGACAGC CTGTGACTCG AAAAAATCCA CAGAAGACCT GGCCTACTTT ACCGCCAAAG CTGACATAAC CGTATATGTG GGTCTGGACT CAAGGGTTGC AACCATACCG TCATGGCTTA ACGATTGGAC CAAAACCTCA CTGACAATAA CCGACGACGG TTCACCACAG GTTACCTACA ACCTTTACAA AAAGAATTTC AGTGCAAACT CCGTTGTAAC CCTTGGTCCT AATGGGGCTT CAAGCGGGGC TGTGAATTAT ATTGTCATAG TTAAACAAAA CAATCAAAAT ATAGTATATG GTGATTTGAA CGGAGACGGA CTGGTAAACT CAAGCGACTA TTCATTATTA AAAAGATACA TACTTAAACA AATAGATTTG ACGGAGGAAA AACTTAAGGC GGCAGACCTG AATAGAAACG GCTCCGTTGA CTCAGTGGAT TATTCCATAT TAAAGAGATT TTTGTTGAAA ACAATTACAC AATTGCCTGT ATAA
|
Protein sequence | MIKNKKILLL LCINLCLFFL LINRIQLVSS AAVNADIIVA KDGTGNFTTI QAAIDSVPSN SSKRTVIFVK NGTYKEVVTI RKNNIHLIGE SNTKTIITYD NYAGKLKPDG TTYGTSGSAS FYLYGTDTIL ENITIENSFD ESIDVKDKQA VAAYIRGDRQ IIKNCIFIGN QDTLYAHSGR QYYVNCKIIG DTDFIFGGAT AVFENCEIVS TPKGGYVTAA STDLENYGFL FLNCRLTSDA PKNSTYLGRP WRPNAYVVYK TCYLGAHIKE SGWTSMSGNL PENARFFEYK NTGPGAVVNS SRRQLSYAEA AKFTPQNLLK GTDNWNPVAL VSQTSTLTPT QKPTSTPAPT PMDGQLIKSL TVKDSANSSN WSIQSNLRVG DTVFGDRTYK FVTIPNEFLG SEWIRTACDS KKSTEDLAYF TAKADITVYV GLDSRVATIP SWLNDWTKTS LTITDDGSPQ VTYNLYKKNF SANSVVTLGP NGASSGAVNY IVIVKQNNQN IVYGDLNGDG LVNSSDYSLL KRYILKQIDL TEEKLKAADL NRNGSVDSVD YSILKRFLLK TITQLPV
|
| |