Gene Cthe_3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3078 
Symbol 
ID4809952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3625229 
End bp3632170 
Gene Length6942 bp 
Protein Length2313 aa 
Translation table11 
GC content48% 
IMG OID640108502 
Productcellulosome anchoring protein, cohesin region 
Protein accessionYP_001039467 
Protein GI125975557 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAA AAAATAAAGT ATTATCAATT TTGTTAACTC TGCTGCTAAT AATCTCTACC 
ACATCCGTAA ACATGTCTTT TGCTGAAGCA ACTCCAAGTA TTGAAATGGT TCTTGATAAA
ACTGAAGTCC ATGTAGGAGA TGTAATAACG GCCACAATAA AAGTCAATAA CATTAGAAAA
TTGGCGGGAT ATCAGCTAAA TATCAAATTT GACCCTGAAG TTTTACAGCC GGTAGACCCT
GCAACAGGAG AGGAATTTAC TGATAAGTCC ATGCCGGTAA ATAGGGTTTT GCTGACAAAC
AGCAAATATG GACCTACTCC TGTGGCGGGT AACGATATAA AGTCAGGAAT TATTAATTTT
GCTACGGGAT ATAACAATTT AACAGCGTAC AAATCCAGCG GAATAGACGA ACATACAGGA
ATAATAGGAG AGATTGGTTT TAAAGTTTTA AAGAAACAAA ATACGTCTAT TAGGTTTGAA
GATACATTAT CGATGCCCGG GGCAATATCG GGAACAAGTT TGTTTGACTG GGATGCAGAA
ACTATAACAG GATATGAGGT AATACAGCCG GATCTTATAG TTGTAGAGGC AGAACCGTTA
AAAGACGCCA GCGTGGCTCT GGAACTGGAT AAGACGAAGG TAAAAGTAGG GGACATAATA
ACAGCGACGA TAAAGATAGA GAACATGAAG AATTTTGCAG GGTACCAGTT GAATATCAAG
TATGACCCGA CCATGTTGGA GGCAATAGAA CTGGAGACAG GAAGTGCGAT AGCGAAGAGG
ACATGGCCGG TTACAGGAGG TACTGTTCTG CAAAGTGACA ATTATGGAAA GACGACTGCG
GTAGCGAATG ATGTAGGAGC AGGTATAATA AACTTTGCTG AGGCATACTC GAACCTTACC
AAATACAGAG AGACAGGTGT GGCAGAAGAG ACAGGTATAA TAGGAAAGAT AGGCTTCAGA
GTGCTGAAGG CAGGAAGTAC GGCTATAAGA TTTGAGGATA CGACAGCGAT GCCGGGAGCA
ATAGAAGGAA CATACATGTT CGACTGGTAT GGCGAGAACA TCAAAGGGTA TAGCGTAGTA
CAGCCTGGGG AAATAGTGGT AGAAGGAGAA GAGCCGGGTG AAGAGCCGAC AGAAGAGCCT
GTACCGACAG AGACATCGGT AGATCCCACA CCGACAGTGA CAGAAGAGCC TGTACCTTCA
GAGCTTCCAG ATTCCTATGT GATAATGGAA CTGGATAAGA CGAAGGTAAA AGTAGGGGAC
ATAATAACAG CGACGATAAA GATAGAGAAC ATGAAGAATT TTGCAGGGTA CCAGTTGAAT
ATCAAGTATG ACCCGACCAT GTTGGAGGCA ATAGAACTGG AGACAGGAAG CGCGATAGCG
AAGAGGACAT GGCCGGTTAC AGGAGGTACT GTTCTGCAAA GTGACAATTA TGGAAAGACG
ACTGCGGTAG CGAATGATGT AGGAGCAGGT ATAATAAACT TTGCTGAGGC ATACTCGAAC
CTTACCAAAT ACAGAGAGAC AGGTGTGGCA GAGGAGACAG GTATAATAGG AAAGATAGGC
TTCAGAGTGC TGAAGGCAGG AAGTACGGCT ATAAGATTTG AGGATACGAC AGCGATGCCG
GGAGCAATAG AAGGAACATA CATGTTCGAC TGGTATGGCG AGAACATCAA AGGGTATAGC
GTAGTACAGC CTGGGGAAAT AGTGGTAGAA GGAGAAGAGC CGGGTGAAGA GCCGACAGAA
GAGCCTGTAC CGACAGAGAC ATCGGTAGAT CCCACACCGA CAGTGACAGA AGAGCCTGTA
CCTTCAGAGC TTCCAGATTC CTATGTGATA ATGGAACTGG ATAAGACGAA GGTAAAAGTA
GGGGACATAA TAACAGCGAC GATAAAGATA GAGAACATGA AGAATTTTGC AGGGTACCAG
TTGAATATCA AGTATGACCC GACCATGTTG GAGGCAATAG AACTGGAGAC AGGAAGCGCG
ATAGCGAAGA GGACATGGCC GGTTACAGGA GGTACTGTTC TGCAAAGTGA CAATTATGGA
AAGACGACTG CGGTAGCGAA TGATGTAGGA GCAGGTATAA TAAACTTTGC TGAGGCATAC
TCGAACCTTA CCAAATACAG AGAGACAGGT GTGGCAGAGG AGACAGGTAT AATAGGAAAG
ATAGGCTTCA GAGTGCTGAA GGCAGGAAGT ACGGCTATAA GATTTGAGGA TACGACAGCG
ATGCCGGGAG CAATAGAAGG AACATACATG TTCGACTGGT ATGGCGAGAA CATCAAAGGG
TATAGCGTAG TACAGCCTGG GGAAATAGTG GCAGAAGGAG AAGAGCCGGG TGAAGAGCCG
ACAGAAGAGC CTGTACCGAC AGAGACATCG GCAGATCCCA CACCGACAGT GACAGAAGAG
CCTGTACCTT CAGAGCTTCC AGATTCCTAT GTGATAATGG AACTGGATAA GACGAAGGTA
AAAGTAGGGG ACATAATAAC AGCGACGATA AAGATAGAGA ACATGAAGAA TTTTGCAGGG
TACCAGTTGA ATATCAAGTA TGACCCGACC ATGTTGGAGG CAATAGAACT GGAGACAGGA
AGTGCGATAG CGAAGAGGAC ATGGCCGGTT ACAGGAGGTA CTGTTCTGCA AAGTGACAAT
TATGGAAAGA CGACTGCGGT AGCGAATGAT GTAGGAGCAG GTATAATAAA CTTTGCTGAG
GCATACTCGA ACCTTACCAA ATACAGAGAG ACAGGTGTGG CAGAGGAGAC AGGTATAATA
GGAAAGATAG GCTTCAGAGT ACTGAAGGCA GGAAGTACGG CTATAAGATT TGAGGATACG
ACAGCGATGC CGGGAGCAAT AGAAGGAACA TACATGTTCG ACTGGTATGG CGAGAACATC
AAAGGGTATA GCGTAGTACA GCCTGGGGAA ATAGTGGCAG AAGGAGAAGA GCCGGGTGAA
GAGCCGACAG AAGAGCCTGT ACCGACAGAG ACACCAGTAG ATCCCACACC GACAGTGACA
GAAGAGCCTG TACCTTCAGA GCTTCCAGAT TCCTATGTAA TAATGGAACT GGATAAGACG
AAGGTAAAAG TAGGGGACAT AATAACAGCG ACGATAAAGA TAGAGAACAT GAAGAATTTT
GCAGGGTACC AGTTGAATAT CAAGTATGAC CCGACCATGT TGGAGGCAAT AGAACTGGAG
ACAGGAAGTG CGATAGCGAA GAGGACATGG CCGGTTACAG GAGGTACTGT TCTGCAAAGT
GACAATTATG GAAAGACGAC TGCGGTAGCG AATGATGTAG GAGCAGGTAT AATAAACTTT
GCTGAGGCAT ACTCGAACCT TACCAAATAC AGAGAGACAG GTGTGGCAGA GGAGACAGGT
ATAATAGGAA AGATAGGCTT CAGAGTACTG AAGGCAGGAA GTACGGCTAT AAGATTTGAG
GATACGACAG CGATGCCGGG AGCAATAGAA GGAACATACA TGTTCGACTG GTATGGCGAG
AACATCAAAG GGTATAGCGT AGTACAGCCT GGGGAAATAG TGGCGGAAGG AGAAGAGCCG
ACAGAAGAGC CTGTACCGAC AGAGACACCA GTAGATCCCA CACCGACAGT GACAGAAGAG
CCTGTACCTT CAGAGCTTCC AGATTCCTAT GTGATAATGG AATTGGATAA GACGAAGGTA
AAAGAAGGCG ACGTAATAAT AGCAACAATA AGAGTAAATA ACATAAAGAA TCTTGCCGGA
TATCAGATAG GCATCAAATA TGACCCGAAA GTATTAGAGG CATTTAATAT CGAGACAGGG
GACCCAATAG ATGAAGGAAC ATGGCCTGCA GTAGGGGGAA CAATACTGAA GAATAGAGAT
TACCTGCCGA CTGGGGTAGC AATAAACAAT GTATCTAAAG GAATACTGAA TTTTGCTGCT
TATTACGTTT ACTTCGATGA CTATAGAGAG GAAGGAAAGT CAGAAGATAC AGGAATTATA
GGAAATATAG GCTTTAGAGT ACTGAAGGCG GAAGATACAA CGATAAGATT TGAAGAGCTG
GAGTCAATGC CGGGTTCAAT AGACGGAACA TATATGTTGG ATTGGTATCT TAATAGAATC
TCTGGCTATG TAGTAATACA ACCGGCGCCT ATAAAGGCGG CTAGTGACGA ACCAATACCA
ACGGATACAC CATCAGATGA ACCGACACCG TCAGACGAGC CAACGCCATC TGACGAACCG
ACACCGTCTG ATGAGCCAAC ACCGTCAGAT GAACCGACTC CGTCAGAGAC ACCTGAGGAG
CCGATACCGA CGGATACACC ATCAGATGAA CCGACACCAT CAGACGAGCC AACGCCATCT
GATGAACCAA CACCGTCTGA TGAGCCAACA CCATCTGATG AACCGACTCC GTCAGAGACA
CCTGAGGAGC CGATACCGAC GGATACACCA TCAGATGAAC CGACACCGTC AGACGAGCCA
ACGCCATCTG ACGAACCAAC ACCGTCTGAT GAGCCAACAC CGTCAGATGA ACCGACTCCG
TCAGAGACAC CTGAGGAGCC GATACCGACG GATACACCAT CAGATGAACC GACACCGTCA
GACGAGCCAA CGCCATCTGA CGAACCAACA CCGTCTGATG AGCCAACACC GTCAGATGAA
CCGACTCCGT CAGAGACACC TGAGGAGCCG ATACCGACGG ATACACCATC AGATGAACCG
ACACCGTCAG ACGAGCCGAC ACCATCTGAC GAACCAACAC CGTCAGACGA GCCAACGCCA
TCTGACGAAC CGACACCGTC TGATGAGCCA ACACCATCTG ATGAACCGAC TCCGTCAGAG
ACACCTGAGG AGCCGATACC GACGGATACA CCATCAGATG AACCGACACC GTCAGACGAG
CCGACACCAT CTGACGAACC AACACCGTCA GACGAGCCAA CGCCATCTGA CGAACCGACA
CCGTCTGATG AGCCAACACC ATCTGATGAA CCGACTCCGT CAGAGACACC TGAGGAGCCG
ATACCGACGG ATACACCATC AGATGAACCG ACACCGTCAG ACGAGCCGAC ACCATCTGAC
GAACCAACAC CGTCTGATGA GCCAACACCG TCAGATGAAC CGACTCCGTC AGAGACACCT
GAGGAGCCGA TACCGACGGA TACACCATCA GATGAACCGA CACCGTCAGA CGAGCCAACG
CCATCTGACG AACCGACACC GTCTGATGAG CCAACACCGT CAGATGAACC GACTCCGTCA
GAGACACCTG AGGAGCCGAT ACCGACGGAT ACACCATCAG ATGAACCGAC ACCGTCAGAC
GAGCCAACGC CATCTGACGA ACCGACACCG TCTGATGAGC CAACACCGTC AGATGAACCG
ACTCCGTCAG AGACACCTGA GGAGCCGATA CCGACGGATA CACCATCAGA TGAACCGACA
CCATCAGACG AGCCAACGCC ATCTGATGAA CCAACACCGT CTGATGAGCC AACACCATCT
GATGAACCGA CTCCGTCAGA GACACCTGAG GAGCCGATAC CGACGGATAC ACCATCAGAT
GAACCGACAC CGTCAGACGA GCCAACGCCA TCTGACGAAC CAACACCGTC TGATGAGCCA
ACACCGTCAG ATGAACCGAC TCCGTCAGAG ACACCTGAGG AGCCGATACC GACGGATACA
CCATCAGATG AACCGACACC GTCAGACGAG CCAACGCCAT CTGACGAACC AACACCGTCT
GATGAGCCAA CACCGTCAGA TGAACCGACT CCGTCAGAGA CACCTGAGGA GCCGATACCG
ACGGATACAC CATCAGATGA ACCGACACCG TCAGACGAGC CGACACCATC TGACGAACCA
ACACCGTCAG ACGAGCCAAC GCCATCTGAC GAACCGACAC CGTCTGATGA GCCAACACCA
TCTGATGAAC CGACTCCGTC AGAGACACCT GAGGAGCCGA TACCGACGGA TACACCATCA
GATGAACCGA CACCGTCAGA CGAGCCGACA CCATCTGACG AACCAACACC GTCAGACGAG
CCAACGCCAT CTGACGAACC GACACCGTCT GATGAGCCAA CACCATCTGA TGAACCGACT
CCGTCAGAGA CACCTGAGGA GCCGACACCG ACTACTACAC CGACACCAAC ACCGTCGACA
ACGCCTACAA GTGGCAGCGG AGGCAGTGGT GGAAGCGGTG GTGGCGGCGG AGGTGGTGGA
GGAACTGTAC CTACATCTCC AACACCGACA CCGACATCTA AACCGACGTC TACACCTGCA
CCGACAGAAA TCGAAGAGCC TACACCATCT GATGTGCCTG GTGCAATCGG TGGAGAACAT
AGAGCATACT TAAGAGGATA TCCGGATGGA AGCTTCAGGC CTGAAAGAAA TATAACAAGA
GCTGAAGCGG CGGTAATCTT TGCTAAGTTG CTTGGAGCCG ATGAAAGCTA TGGAGCTCAG
TCTGCAAGTC CATATAGTGA TTTGGCTGAT ACTCACTGGG CTGCATGGGC AATCAAATTT
GCAACAAGCC AGGGCTTGTT CAAAGGATAT CCGGACGGTA CGTTTAAACC TGATCAGAAC
ATAACGAGAG CGGAATTCGC AACTGTGGTA CTCCACTTCC TGACAAAAGT TAAGGGTCAG
GAAATAATGA GCAAGCTTGC AACAATAGAT ATAAGTAATC CGAAGTTTGA CGATTGTGTC
GGACATTGGG CACAAGAGTT TATTGAGAAA TTGACAAGCT TGGGTTATAT TAGTGGCTAT
CCTGACGGAA CGTTCAAGCC GCAAAACTAT ATTAAACGTT CCGAAAGTGT GGCACTGATT
AACAGAGCTC TGGAGAGAGG TCCGCTTAAT GGAGCGCCGA AGCTCTTCCC GGATGTTAAC
GAATCATACT GGGCATTTGG CGACATTATG GACGGTGCTC TCGACCACAG TTACATTATC
GAAGATGAGA AAGAAAAATT CGTTAAATTG CTCGAAGATT AA
 
Protein sequence
MKRKNKVLSI LLTLLLIIST TSVNMSFAEA TPSIEMVLDK TEVHVGDVIT ATIKVNNIRK 
LAGYQLNIKF DPEVLQPVDP ATGEEFTDKS MPVNRVLLTN SKYGPTPVAG NDIKSGIINF
ATGYNNLTAY KSSGIDEHTG IIGEIGFKVL KKQNTSIRFE DTLSMPGAIS GTSLFDWDAE
TITGYEVIQP DLIVVEAEPL KDASVALELD KTKVKVGDII TATIKIENMK NFAGYQLNIK
YDPTMLEAIE LETGSAIAKR TWPVTGGTVL QSDNYGKTTA VANDVGAGII NFAEAYSNLT
KYRETGVAEE TGIIGKIGFR VLKAGSTAIR FEDTTAMPGA IEGTYMFDWY GENIKGYSVV
QPGEIVVEGE EPGEEPTEEP VPTETSVDPT PTVTEEPVPS ELPDSYVIME LDKTKVKVGD
IITATIKIEN MKNFAGYQLN IKYDPTMLEA IELETGSAIA KRTWPVTGGT VLQSDNYGKT
TAVANDVGAG IINFAEAYSN LTKYRETGVA EETGIIGKIG FRVLKAGSTA IRFEDTTAMP
GAIEGTYMFD WYGENIKGYS VVQPGEIVVE GEEPGEEPTE EPVPTETSVD PTPTVTEEPV
PSELPDSYVI MELDKTKVKV GDIITATIKI ENMKNFAGYQ LNIKYDPTML EAIELETGSA
IAKRTWPVTG GTVLQSDNYG KTTAVANDVG AGIINFAEAY SNLTKYRETG VAEETGIIGK
IGFRVLKAGS TAIRFEDTTA MPGAIEGTYM FDWYGENIKG YSVVQPGEIV AEGEEPGEEP
TEEPVPTETS ADPTPTVTEE PVPSELPDSY VIMELDKTKV KVGDIITATI KIENMKNFAG
YQLNIKYDPT MLEAIELETG SAIAKRTWPV TGGTVLQSDN YGKTTAVAND VGAGIINFAE
AYSNLTKYRE TGVAEETGII GKIGFRVLKA GSTAIRFEDT TAMPGAIEGT YMFDWYGENI
KGYSVVQPGE IVAEGEEPGE EPTEEPVPTE TPVDPTPTVT EEPVPSELPD SYVIMELDKT
KVKVGDIITA TIKIENMKNF AGYQLNIKYD PTMLEAIELE TGSAIAKRTW PVTGGTVLQS
DNYGKTTAVA NDVGAGIINF AEAYSNLTKY RETGVAEETG IIGKIGFRVL KAGSTAIRFE
DTTAMPGAIE GTYMFDWYGE NIKGYSVVQP GEIVAEGEEP TEEPVPTETP VDPTPTVTEE
PVPSELPDSY VIMELDKTKV KEGDVIIATI RVNNIKNLAG YQIGIKYDPK VLEAFNIETG
DPIDEGTWPA VGGTILKNRD YLPTGVAINN VSKGILNFAA YYVYFDDYRE EGKSEDTGII
GNIGFRVLKA EDTTIRFEEL ESMPGSIDGT YMLDWYLNRI SGYVVIQPAP IKAASDEPIP
TDTPSDEPTP SDEPTPSDEP TPSDEPTPSD EPTPSETPEE PIPTDTPSDE PTPSDEPTPS
DEPTPSDEPT PSDEPTPSET PEEPIPTDTP SDEPTPSDEP TPSDEPTPSD EPTPSDEPTP
SETPEEPIPT DTPSDEPTPS DEPTPSDEPT PSDEPTPSDE PTPSETPEEP IPTDTPSDEP
TPSDEPTPSD EPTPSDEPTP SDEPTPSDEP TPSDEPTPSE TPEEPIPTDT PSDEPTPSDE
PTPSDEPTPS DEPTPSDEPT PSDEPTPSDE PTPSETPEEP IPTDTPSDEP TPSDEPTPSD
EPTPSDEPTP SDEPTPSETP EEPIPTDTPS DEPTPSDEPT PSDEPTPSDE PTPSDEPTPS
ETPEEPIPTD TPSDEPTPSD EPTPSDEPTP SDEPTPSDEP TPSETPEEPI PTDTPSDEPT
PSDEPTPSDE PTPSDEPTPS DEPTPSETPE EPIPTDTPSD EPTPSDEPTP SDEPTPSDEP
TPSDEPTPSE TPEEPIPTDT PSDEPTPSDE PTPSDEPTPS DEPTPSDEPT PSETPEEPIP
TDTPSDEPTP SDEPTPSDEP TPSDEPTPSD EPTPSDEPTP SDEPTPSETP EEPIPTDTPS
DEPTPSDEPT PSDEPTPSDE PTPSDEPTPS DEPTPSDEPT PSETPEEPTP TTTPTPTPST
TPTSGSGGSG GSGGGGGGGG GTVPTSPTPT PTSKPTSTPA PTEIEEPTPS DVPGAIGGEH
RAYLRGYPDG SFRPERNITR AEAAVIFAKL LGADESYGAQ SASPYSDLAD THWAAWAIKF
ATSQGLFKGY PDGTFKPDQN ITRAEFATVV LHFLTKVKGQ EIMSKLATID ISNPKFDDCV
GHWAQEFIEK LTSLGYISGY PDGTFKPQNY IKRSESVALI NRALERGPLN GAPKLFPDVN
ESYWAFGDIM DGALDHSYII EDEKEKFVKL LED