Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3078 |
Symbol | |
ID | 4809952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3625229 |
End bp | 3632170 |
Gene Length | 6942 bp |
Protein Length | 2313 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640108502 |
Product | cellulosome anchoring protein, cohesin region |
Protein accession | YP_001039467 |
Protein GI | 125975557 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAA AAAATAAAGT ATTATCAATT TTGTTAACTC TGCTGCTAAT AATCTCTACC ACATCCGTAA ACATGTCTTT TGCTGAAGCA ACTCCAAGTA TTGAAATGGT TCTTGATAAA ACTGAAGTCC ATGTAGGAGA TGTAATAACG GCCACAATAA AAGTCAATAA CATTAGAAAA TTGGCGGGAT ATCAGCTAAA TATCAAATTT GACCCTGAAG TTTTACAGCC GGTAGACCCT GCAACAGGAG AGGAATTTAC TGATAAGTCC ATGCCGGTAA ATAGGGTTTT GCTGACAAAC AGCAAATATG GACCTACTCC TGTGGCGGGT AACGATATAA AGTCAGGAAT TATTAATTTT GCTACGGGAT ATAACAATTT AACAGCGTAC AAATCCAGCG GAATAGACGA ACATACAGGA ATAATAGGAG AGATTGGTTT TAAAGTTTTA AAGAAACAAA ATACGTCTAT TAGGTTTGAA GATACATTAT CGATGCCCGG GGCAATATCG GGAACAAGTT TGTTTGACTG GGATGCAGAA ACTATAACAG GATATGAGGT AATACAGCCG GATCTTATAG TTGTAGAGGC AGAACCGTTA AAAGACGCCA GCGTGGCTCT GGAACTGGAT AAGACGAAGG TAAAAGTAGG GGACATAATA ACAGCGACGA TAAAGATAGA GAACATGAAG AATTTTGCAG GGTACCAGTT GAATATCAAG TATGACCCGA CCATGTTGGA GGCAATAGAA CTGGAGACAG GAAGTGCGAT AGCGAAGAGG ACATGGCCGG TTACAGGAGG TACTGTTCTG CAAAGTGACA ATTATGGAAA GACGACTGCG GTAGCGAATG ATGTAGGAGC AGGTATAATA AACTTTGCTG AGGCATACTC GAACCTTACC AAATACAGAG AGACAGGTGT GGCAGAAGAG ACAGGTATAA TAGGAAAGAT AGGCTTCAGA GTGCTGAAGG CAGGAAGTAC GGCTATAAGA TTTGAGGATA CGACAGCGAT GCCGGGAGCA ATAGAAGGAA CATACATGTT CGACTGGTAT GGCGAGAACA TCAAAGGGTA TAGCGTAGTA CAGCCTGGGG AAATAGTGGT AGAAGGAGAA GAGCCGGGTG AAGAGCCGAC AGAAGAGCCT GTACCGACAG AGACATCGGT AGATCCCACA CCGACAGTGA CAGAAGAGCC TGTACCTTCA GAGCTTCCAG ATTCCTATGT GATAATGGAA CTGGATAAGA CGAAGGTAAA AGTAGGGGAC ATAATAACAG CGACGATAAA GATAGAGAAC ATGAAGAATT TTGCAGGGTA CCAGTTGAAT ATCAAGTATG ACCCGACCAT GTTGGAGGCA ATAGAACTGG AGACAGGAAG CGCGATAGCG AAGAGGACAT GGCCGGTTAC AGGAGGTACT GTTCTGCAAA GTGACAATTA TGGAAAGACG ACTGCGGTAG CGAATGATGT AGGAGCAGGT ATAATAAACT TTGCTGAGGC ATACTCGAAC CTTACCAAAT ACAGAGAGAC AGGTGTGGCA GAGGAGACAG GTATAATAGG AAAGATAGGC TTCAGAGTGC TGAAGGCAGG AAGTACGGCT ATAAGATTTG AGGATACGAC AGCGATGCCG GGAGCAATAG AAGGAACATA CATGTTCGAC TGGTATGGCG AGAACATCAA AGGGTATAGC GTAGTACAGC CTGGGGAAAT AGTGGTAGAA GGAGAAGAGC CGGGTGAAGA GCCGACAGAA GAGCCTGTAC CGACAGAGAC ATCGGTAGAT CCCACACCGA CAGTGACAGA AGAGCCTGTA CCTTCAGAGC TTCCAGATTC CTATGTGATA ATGGAACTGG ATAAGACGAA GGTAAAAGTA GGGGACATAA TAACAGCGAC GATAAAGATA GAGAACATGA AGAATTTTGC AGGGTACCAG TTGAATATCA AGTATGACCC GACCATGTTG GAGGCAATAG AACTGGAGAC AGGAAGCGCG ATAGCGAAGA GGACATGGCC GGTTACAGGA GGTACTGTTC TGCAAAGTGA CAATTATGGA AAGACGACTG CGGTAGCGAA TGATGTAGGA GCAGGTATAA TAAACTTTGC TGAGGCATAC TCGAACCTTA CCAAATACAG AGAGACAGGT GTGGCAGAGG AGACAGGTAT AATAGGAAAG ATAGGCTTCA GAGTGCTGAA GGCAGGAAGT ACGGCTATAA GATTTGAGGA TACGACAGCG ATGCCGGGAG CAATAGAAGG AACATACATG TTCGACTGGT ATGGCGAGAA CATCAAAGGG TATAGCGTAG TACAGCCTGG GGAAATAGTG GCAGAAGGAG AAGAGCCGGG TGAAGAGCCG ACAGAAGAGC CTGTACCGAC AGAGACATCG GCAGATCCCA CACCGACAGT GACAGAAGAG CCTGTACCTT CAGAGCTTCC AGATTCCTAT GTGATAATGG AACTGGATAA GACGAAGGTA AAAGTAGGGG ACATAATAAC AGCGACGATA AAGATAGAGA ACATGAAGAA TTTTGCAGGG TACCAGTTGA ATATCAAGTA TGACCCGACC ATGTTGGAGG CAATAGAACT GGAGACAGGA AGTGCGATAG CGAAGAGGAC ATGGCCGGTT ACAGGAGGTA CTGTTCTGCA AAGTGACAAT TATGGAAAGA CGACTGCGGT AGCGAATGAT GTAGGAGCAG GTATAATAAA CTTTGCTGAG GCATACTCGA ACCTTACCAA ATACAGAGAG ACAGGTGTGG CAGAGGAGAC AGGTATAATA GGAAAGATAG GCTTCAGAGT ACTGAAGGCA GGAAGTACGG CTATAAGATT TGAGGATACG ACAGCGATGC CGGGAGCAAT AGAAGGAACA TACATGTTCG ACTGGTATGG CGAGAACATC AAAGGGTATA GCGTAGTACA GCCTGGGGAA ATAGTGGCAG AAGGAGAAGA GCCGGGTGAA GAGCCGACAG AAGAGCCTGT ACCGACAGAG ACACCAGTAG ATCCCACACC GACAGTGACA GAAGAGCCTG TACCTTCAGA GCTTCCAGAT TCCTATGTAA TAATGGAACT GGATAAGACG AAGGTAAAAG TAGGGGACAT AATAACAGCG ACGATAAAGA TAGAGAACAT GAAGAATTTT GCAGGGTACC AGTTGAATAT CAAGTATGAC CCGACCATGT TGGAGGCAAT AGAACTGGAG ACAGGAAGTG CGATAGCGAA GAGGACATGG CCGGTTACAG GAGGTACTGT TCTGCAAAGT GACAATTATG GAAAGACGAC TGCGGTAGCG AATGATGTAG GAGCAGGTAT AATAAACTTT GCTGAGGCAT ACTCGAACCT TACCAAATAC AGAGAGACAG GTGTGGCAGA GGAGACAGGT ATAATAGGAA AGATAGGCTT CAGAGTACTG AAGGCAGGAA GTACGGCTAT AAGATTTGAG GATACGACAG CGATGCCGGG AGCAATAGAA GGAACATACA TGTTCGACTG GTATGGCGAG AACATCAAAG GGTATAGCGT AGTACAGCCT GGGGAAATAG TGGCGGAAGG AGAAGAGCCG ACAGAAGAGC CTGTACCGAC AGAGACACCA GTAGATCCCA CACCGACAGT GACAGAAGAG CCTGTACCTT CAGAGCTTCC AGATTCCTAT GTGATAATGG AATTGGATAA GACGAAGGTA AAAGAAGGCG ACGTAATAAT AGCAACAATA AGAGTAAATA ACATAAAGAA TCTTGCCGGA TATCAGATAG GCATCAAATA TGACCCGAAA GTATTAGAGG CATTTAATAT CGAGACAGGG GACCCAATAG ATGAAGGAAC ATGGCCTGCA GTAGGGGGAA CAATACTGAA GAATAGAGAT TACCTGCCGA CTGGGGTAGC AATAAACAAT GTATCTAAAG GAATACTGAA TTTTGCTGCT TATTACGTTT ACTTCGATGA CTATAGAGAG GAAGGAAAGT CAGAAGATAC AGGAATTATA GGAAATATAG GCTTTAGAGT ACTGAAGGCG GAAGATACAA CGATAAGATT TGAAGAGCTG GAGTCAATGC CGGGTTCAAT AGACGGAACA TATATGTTGG ATTGGTATCT TAATAGAATC TCTGGCTATG TAGTAATACA ACCGGCGCCT ATAAAGGCGG CTAGTGACGA ACCAATACCA ACGGATACAC CATCAGATGA ACCGACACCG TCAGACGAGC CAACGCCATC TGACGAACCG ACACCGTCTG ATGAGCCAAC ACCGTCAGAT GAACCGACTC CGTCAGAGAC ACCTGAGGAG CCGATACCGA CGGATACACC ATCAGATGAA CCGACACCAT CAGACGAGCC AACGCCATCT GATGAACCAA CACCGTCTGA TGAGCCAACA CCATCTGATG AACCGACTCC GTCAGAGACA CCTGAGGAGC CGATACCGAC GGATACACCA TCAGATGAAC CGACACCGTC AGACGAGCCA ACGCCATCTG ACGAACCAAC ACCGTCTGAT GAGCCAACAC CGTCAGATGA ACCGACTCCG TCAGAGACAC CTGAGGAGCC GATACCGACG GATACACCAT CAGATGAACC GACACCGTCA GACGAGCCAA CGCCATCTGA CGAACCAACA CCGTCTGATG AGCCAACACC GTCAGATGAA CCGACTCCGT CAGAGACACC TGAGGAGCCG ATACCGACGG ATACACCATC AGATGAACCG ACACCGTCAG ACGAGCCGAC ACCATCTGAC GAACCAACAC CGTCAGACGA GCCAACGCCA TCTGACGAAC CGACACCGTC TGATGAGCCA ACACCATCTG ATGAACCGAC TCCGTCAGAG ACACCTGAGG AGCCGATACC GACGGATACA CCATCAGATG AACCGACACC GTCAGACGAG CCGACACCAT CTGACGAACC AACACCGTCA GACGAGCCAA CGCCATCTGA CGAACCGACA CCGTCTGATG AGCCAACACC ATCTGATGAA CCGACTCCGT CAGAGACACC TGAGGAGCCG ATACCGACGG ATACACCATC AGATGAACCG ACACCGTCAG ACGAGCCGAC ACCATCTGAC GAACCAACAC CGTCTGATGA GCCAACACCG TCAGATGAAC CGACTCCGTC AGAGACACCT GAGGAGCCGA TACCGACGGA TACACCATCA GATGAACCGA CACCGTCAGA CGAGCCAACG CCATCTGACG AACCGACACC GTCTGATGAG CCAACACCGT CAGATGAACC GACTCCGTCA GAGACACCTG AGGAGCCGAT ACCGACGGAT ACACCATCAG ATGAACCGAC ACCGTCAGAC GAGCCAACGC CATCTGACGA ACCGACACCG TCTGATGAGC CAACACCGTC AGATGAACCG ACTCCGTCAG AGACACCTGA GGAGCCGATA CCGACGGATA CACCATCAGA TGAACCGACA CCATCAGACG AGCCAACGCC ATCTGATGAA CCAACACCGT CTGATGAGCC AACACCATCT GATGAACCGA CTCCGTCAGA GACACCTGAG GAGCCGATAC CGACGGATAC ACCATCAGAT GAACCGACAC CGTCAGACGA GCCAACGCCA TCTGACGAAC CAACACCGTC TGATGAGCCA ACACCGTCAG ATGAACCGAC TCCGTCAGAG ACACCTGAGG AGCCGATACC GACGGATACA CCATCAGATG AACCGACACC GTCAGACGAG CCAACGCCAT CTGACGAACC AACACCGTCT GATGAGCCAA CACCGTCAGA TGAACCGACT CCGTCAGAGA CACCTGAGGA GCCGATACCG ACGGATACAC CATCAGATGA ACCGACACCG TCAGACGAGC CGACACCATC TGACGAACCA ACACCGTCAG ACGAGCCAAC GCCATCTGAC GAACCGACAC CGTCTGATGA GCCAACACCA TCTGATGAAC CGACTCCGTC AGAGACACCT GAGGAGCCGA TACCGACGGA TACACCATCA GATGAACCGA CACCGTCAGA CGAGCCGACA CCATCTGACG AACCAACACC GTCAGACGAG CCAACGCCAT CTGACGAACC GACACCGTCT GATGAGCCAA CACCATCTGA TGAACCGACT CCGTCAGAGA CACCTGAGGA GCCGACACCG ACTACTACAC CGACACCAAC ACCGTCGACA ACGCCTACAA GTGGCAGCGG AGGCAGTGGT GGAAGCGGTG GTGGCGGCGG AGGTGGTGGA GGAACTGTAC CTACATCTCC AACACCGACA CCGACATCTA AACCGACGTC TACACCTGCA CCGACAGAAA TCGAAGAGCC TACACCATCT GATGTGCCTG GTGCAATCGG TGGAGAACAT AGAGCATACT TAAGAGGATA TCCGGATGGA AGCTTCAGGC CTGAAAGAAA TATAACAAGA GCTGAAGCGG CGGTAATCTT TGCTAAGTTG CTTGGAGCCG ATGAAAGCTA TGGAGCTCAG TCTGCAAGTC CATATAGTGA TTTGGCTGAT ACTCACTGGG CTGCATGGGC AATCAAATTT GCAACAAGCC AGGGCTTGTT CAAAGGATAT CCGGACGGTA CGTTTAAACC TGATCAGAAC ATAACGAGAG CGGAATTCGC AACTGTGGTA CTCCACTTCC TGACAAAAGT TAAGGGTCAG GAAATAATGA GCAAGCTTGC AACAATAGAT ATAAGTAATC CGAAGTTTGA CGATTGTGTC GGACATTGGG CACAAGAGTT TATTGAGAAA TTGACAAGCT TGGGTTATAT TAGTGGCTAT CCTGACGGAA CGTTCAAGCC GCAAAACTAT ATTAAACGTT CCGAAAGTGT GGCACTGATT AACAGAGCTC TGGAGAGAGG TCCGCTTAAT GGAGCGCCGA AGCTCTTCCC GGATGTTAAC GAATCATACT GGGCATTTGG CGACATTATG GACGGTGCTC TCGACCACAG TTACATTATC GAAGATGAGA AAGAAAAATT CGTTAAATTG CTCGAAGATT AA
|
Protein sequence | MKRKNKVLSI LLTLLLIIST TSVNMSFAEA TPSIEMVLDK TEVHVGDVIT ATIKVNNIRK LAGYQLNIKF DPEVLQPVDP ATGEEFTDKS MPVNRVLLTN SKYGPTPVAG NDIKSGIINF ATGYNNLTAY KSSGIDEHTG IIGEIGFKVL KKQNTSIRFE DTLSMPGAIS GTSLFDWDAE TITGYEVIQP DLIVVEAEPL KDASVALELD KTKVKVGDII TATIKIENMK NFAGYQLNIK YDPTMLEAIE LETGSAIAKR TWPVTGGTVL QSDNYGKTTA VANDVGAGII NFAEAYSNLT KYRETGVAEE TGIIGKIGFR VLKAGSTAIR FEDTTAMPGA IEGTYMFDWY GENIKGYSVV QPGEIVVEGE EPGEEPTEEP VPTETSVDPT PTVTEEPVPS ELPDSYVIME LDKTKVKVGD IITATIKIEN MKNFAGYQLN IKYDPTMLEA IELETGSAIA KRTWPVTGGT VLQSDNYGKT TAVANDVGAG IINFAEAYSN LTKYRETGVA EETGIIGKIG FRVLKAGSTA IRFEDTTAMP GAIEGTYMFD WYGENIKGYS VVQPGEIVVE GEEPGEEPTE EPVPTETSVD PTPTVTEEPV PSELPDSYVI MELDKTKVKV GDIITATIKI ENMKNFAGYQ LNIKYDPTML EAIELETGSA IAKRTWPVTG GTVLQSDNYG KTTAVANDVG AGIINFAEAY SNLTKYRETG VAEETGIIGK IGFRVLKAGS TAIRFEDTTA MPGAIEGTYM FDWYGENIKG YSVVQPGEIV AEGEEPGEEP TEEPVPTETS ADPTPTVTEE PVPSELPDSY VIMELDKTKV KVGDIITATI KIENMKNFAG YQLNIKYDPT MLEAIELETG SAIAKRTWPV TGGTVLQSDN YGKTTAVAND VGAGIINFAE AYSNLTKYRE TGVAEETGII GKIGFRVLKA GSTAIRFEDT TAMPGAIEGT YMFDWYGENI KGYSVVQPGE IVAEGEEPGE EPTEEPVPTE TPVDPTPTVT EEPVPSELPD SYVIMELDKT KVKVGDIITA TIKIENMKNF AGYQLNIKYD PTMLEAIELE TGSAIAKRTW PVTGGTVLQS DNYGKTTAVA NDVGAGIINF AEAYSNLTKY RETGVAEETG IIGKIGFRVL KAGSTAIRFE DTTAMPGAIE GTYMFDWYGE NIKGYSVVQP GEIVAEGEEP TEEPVPTETP VDPTPTVTEE PVPSELPDSY VIMELDKTKV KEGDVIIATI RVNNIKNLAG YQIGIKYDPK VLEAFNIETG DPIDEGTWPA VGGTILKNRD YLPTGVAINN VSKGILNFAA YYVYFDDYRE EGKSEDTGII GNIGFRVLKA EDTTIRFEEL ESMPGSIDGT YMLDWYLNRI SGYVVIQPAP IKAASDEPIP TDTPSDEPTP SDEPTPSDEP TPSDEPTPSD EPTPSETPEE PIPTDTPSDE PTPSDEPTPS DEPTPSDEPT PSDEPTPSET PEEPIPTDTP SDEPTPSDEP TPSDEPTPSD EPTPSDEPTP SETPEEPIPT DTPSDEPTPS DEPTPSDEPT PSDEPTPSDE PTPSETPEEP IPTDTPSDEP TPSDEPTPSD EPTPSDEPTP SDEPTPSDEP TPSDEPTPSE TPEEPIPTDT PSDEPTPSDE PTPSDEPTPS DEPTPSDEPT PSDEPTPSDE PTPSETPEEP IPTDTPSDEP TPSDEPTPSD EPTPSDEPTP SDEPTPSETP EEPIPTDTPS DEPTPSDEPT PSDEPTPSDE PTPSDEPTPS ETPEEPIPTD TPSDEPTPSD EPTPSDEPTP SDEPTPSDEP TPSETPEEPI PTDTPSDEPT PSDEPTPSDE PTPSDEPTPS DEPTPSETPE EPIPTDTPSD EPTPSDEPTP SDEPTPSDEP TPSDEPTPSE TPEEPIPTDT PSDEPTPSDE PTPSDEPTPS DEPTPSDEPT PSETPEEPIP TDTPSDEPTP SDEPTPSDEP TPSDEPTPSD EPTPSDEPTP SDEPTPSETP EEPIPTDTPS DEPTPSDEPT PSDEPTPSDE PTPSDEPTPS DEPTPSDEPT PSETPEEPTP TTTPTPTPST TPTSGSGGSG GSGGGGGGGG GTVPTSPTPT PTSKPTSTPA PTEIEEPTPS DVPGAIGGEH RAYLRGYPDG SFRPERNITR AEAAVIFAKL LGADESYGAQ SASPYSDLAD THWAAWAIKF ATSQGLFKGY PDGTFKPDQN ITRAEFATVV LHFLTKVKGQ EIMSKLATID ISNPKFDDCV GHWAQEFIEK LTSLGYISGY PDGTFKPQNY IKRSESVALI NRALERGPLN GAPKLFPDVN ESYWAFGDIM DGALDHSYII EDEKEKFVKL LED
|
| |