Gene Cthe_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3077 
Symbol 
ID4809951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3619516 
End bp3625077 
Gene Length5562 bp 
Protein Length1853 aa 
Translation table11 
GC content42% 
IMG OID640108501 
Productcellulosome anchoring protein, cohesin region 
Protein accessionYP_001039466 
Protein GI125975556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAG TCATCAGTAT GCTCTTAGTT GTGGCTATGC TGACGACGAT TTTTGCGGCG 
ATGATACCGC AGACAGTATC GGCGGCCACA ATGACAGTCG AGATCGGCAA AGTTACAGCA
GCCGTTGGAT CAAAAGTAGA AATACCTATA ACCCTGAAAG GAGTGCCATC CAAAGGAATG
GCCAATTGCG ACTTCGTATT GGGTTATGAT CCAAATGTGC TGGAAGTAAC AGAAGTAAAA
CCAGGAAGCA TAATAAAAGA TCCGGATCCT AGCAAGAGCT TTGATAGCGC AATATATCCG
GATCGAAAGA TGATTGTATT TCTGTTTGCA GAAGACAGTG GAAGAGGAAC GTATGCAATA
ACTCAGGATG GAGTATTTGC AACAATTGTA GCCACTGTCA AATCAGCTGC AGCGGCACCG
ATTACTTTGC TTGAAGTAGG TGCATTTGCG GACAACGATT TAGTAGAAAT AAGCACAACT
TTTGTCGCGG GCGGAGTAAA TCTTGGTAGT TCCGTACCGA CAACACAGCC AAATGTTCCG
TCAGACGGTG TGGTAGTAGA AATTGGCAAA GTTACGGGAT CTGTTGGAAC TACAGTTGAA
ATACCTGTAT ATTTCAGAGG AGTTCCATCC AAAGGAATAG CAAACTGCGA CTTTGTGTTC
AGATATGATC CGAATGTATT GGAAATTATA GGGATAGATC CCGGAGACAT AATAGTTGAC
CCGAATCCTA CCAAGAGCTT TGATACTGCA ATATATCCTG ACAGAAAGAT AATAGTATTC
CTGTTTGCGG AAGACAGCGG AACAGGAGCG TATGCAATAA CTAAAGACGG AGTATTTGCA
AAAATAAGAG CAACTGTAAA ATCAAGTGCT CCGGGCTATA TTACTTTCGA CGAAGTAGGT
GGATTTGCAG ATAATGACCT GGTAGAACAG AAGGTATCAT TTATAGACGG TGGTGTTAAC
GTTGGCAATG CAACACCGAC CAAGGGAGCA ACACCAACAA ATACAGCTAC GCCGACAAAA
TCAGCTACGG CTACGCCCAC CAGGCCATCG GTACCGACAA ACACACCGAC AAACACACCG
GCAAATACAC CGGTATCAGG CAATTTGAAG GTTGAATTCT ACAACAGCAA TCCTTCAGAT
ACTACTAACT CAATCAATCC TCAGTTCAAG GTTACTAATA CCGGAAGCAG TGCAATTGAT
TTGTCCAAAC TCACATTGAG ATATTATTAT ACAGTAGACG GACAGAAAGA TCAGACCTTC
TGGTGTGACC ATGCTGCAAT AATCGGCAGT AACGGCAGCT ACAACGGAAT TACTTCAAAT
GTAAAAGGAA CATTTGTAAA AATGAGTTCC TCAACAAATA ACGCAGACAC CTACCTTGAA
ATAAGCTTTA CAGGCGGAAC TCTTGAACCG GGTGCACATG TTCAGATACA AGGTAGATTT
GCAAAGAATG ACTGGAGTAA CTATACACAG TCAAATGACT ACTCATTCAA GTCTGCTTCA
CAGTTTGTTG AATGGGATCA GGTAACAGCA TACTTGAACG GTGTTCTTGT ATGGGGTAAA
GAACCCGGTG GCAGTGTAGT ACCATCAACA CAGCCTGTAA CAACACCACC TGCAACAACA
AAACCACCTG CAACAACAAA ACCACCTGCA ACAACAATAC CGCCGTCAGA TGATCCGAAT
GCAATAAAGA TTAAGGTGGA CACAGTAAAT GCAAAACCGG GAGACACAGT AAATATACCT
GTAAGATTCA GTGGTATACC ATCCAAGGGA ATAGCAAACT GTGACTTTGT ATACAGCTAT
GACCCGAATG TACTTGAGAT AATAGAGATA AAACCGGGAG AATTGATAGT TGACCCGAAT
CCTGACAAGA GCTTTGATAC TGCAGTATAT CCTGACAGAA AGATAATAGT ATTCCTGTTT
GCAGAAGACA GCGGAACAGG AGCGTATGCA ATAACTAAAG ACGGAGTATT TGCTACGATA
GTAGCGAAAG TAAAATCCGG AGCACCTAAC GGACTCAGTG TAATCAAATT TGTAGAAGTA
GGCGGATTTG CGAACAATGA CCTTGTAGAA CAGAGGACAC AGTTCTTTGA CGGTGGAGTA
AATGTTGGAG ATACAACAGT ACCTACAACA CCTACAACAC CTGTAACAAC ACCGACAGAT
GATTCGAATG CAGTAAGGAT TAAGGTGGAC ACAGTAAATG CAAAACCGGG AGACACAGTA
AGAATACCTG TAAGATTCAG CGGTATACCA TCCAAGGGAA TAGCAAACTG TGACTTTGTA
TACAGCTATG ACCCGAATGT ACTTGAGATA ATAGAGATAG AACCGGGAGA CATAATAGTT
GACCCGAATC CTGACAAGAG CTTTGATACT GCAGTATATC CTGACAGAAA GATAATAGTA
TTCCTGTTTG CGGAAGACAG CGGAACAGGA GCGTATGCAA TAACTAAAGA CGGAGTATTT
GCTACGATAG TAGCGAAAGT AAAATCCGGA GCACCTAACG GACTCAGTGT AATCAAATTT
GTAGAAGTAG GCGGATTTGC GAACAATGAC CTTGTAGAAC AGAAGACACA GTTCTTTGAC
GGTGGAGTAA ATGTTGGAGA TACAACAGAA CCTGCAACAC CTACAACACC TGTAACAACA
CCGACAACAA CAGATGATCT GGATGCAGTA AGGATTAAAG TGGACACAGT AAATGCAAAA
CCGGGAGACA CAGTAAGAAT ACCTGTAAGA TTCAGCGGTA TACCATCCAA GGGAATAGCA
AACTGTGACT TTGTATACAG CTATGACCCG AATGTACTTG AGATAATAGA GATAGAACCG
GGAGACATAA TAGTTGACCC GAATCCTGAC AAGAGCTTTG ATACTGCAGT ATATCCTGAC
AGAAAGATAA TAGTATTCCT GTTTGCGGAA GACAGCGGAA CAGGAGCGTA TGCAATAACT
AAAGACGGAG TATTTGCTAC GATAGTAGCG AAAGTAAAAT CCGGAGCACC TAACGGACTC
AGTGTAATCA AATTTGTAGA AGTAGGCGGA TTTGCGAACA ATGACCTTGT AGAACAGAAG
ACACAGTTCT TTGACGGTGG AGTAAATGTT GGAGATACAA CAGAACCTGC AACACCTACA
ACACCTGTAA CAACACCGAC AACAACAGAT GATCTGGATG CAGTAAGGAT TAAAGTGGAC
ACAGTAAATG CAAAACCGGG AGACACAGTA AGAATACCTG TAAGATTCAG CGGTATACCA
TCCAAGGGAA TAGCAAACTG TGACTTTGTA TACAGCTATG ACCCGAATGT ACTTGAGATA
ATAGAGATAG AACCGGGAGA CATAATAGTT GACCCGAATC CTGACAAGAG CTTTGATACT
GCAGTATATC CTGACAGAAA GATAATAGTA TTCCTGTTTG CAGAAGACAG CGGAACAGGA
GCGTATGCAA TAACTAAAGA CGGAGTATTT GCTACGATAG TAGCGAAAGT AAAAGAAGGA
GCACCTAACG GACTCAGTGT AATCAAATTT GTAGAAGTAG GCGGATTTGC GAACAATGAC
CTTGTAGAAC AGAAGACACA GTTCTTTGAC GGTGGAGTAA ATGTTGGAGA TACAACAGAA
CCTGCAACAC CTACAACACC TGTAACAACA CCGACAACAA CAGATGATCT GGATGCAGTA
AGGATTAAAG TGGACACAGT AAATGCAAAA CCGGGAGACA CAGTAAGAAT ACCTGTAAGA
TTCAGCGGTA TACCATCCAA GGGAATAGCA AACTGTGACT TTGTATACAG CTATGACCCG
AATGTACTTG AGATAATAGA GATAGAACCG GGAGAATTGA TAGTTGACCC GAATCCTACC
AAGAGCTTTG ATACTGCAGT ATATCCTGAC AGAAAGATGA TAGTATTCCT GTTTGCGGAA
GACAGCGGAA CAGGAGCGTA TGCAATAACT GAAGATGGAG TATTTGCTAC GATAGTAGCG
AAAGTAAAAT CCGGAGCACC TAACGGACTC AGTGTAATCA AATTTGTAGA AGTAGGCGGA
TTTGCGAACA ATGACCTTGT AGAACAGAAG ACACAGTTCT TTGACGGTGG AGTAAATGTT
GGAGATACAA CAGAACCTGC AACACCTACA ACACCTGTAA CAACACCGAC AACAACAGAT
GATCTGGATG CAGTAAGGAT TAAAGTGGAC ACAGTAAATG CAAAACCGGG AGACACAGTA
AGAATACCTG TAAGATTCAG CGGTATACCA TCCAAGGGAA TAGCAAACTG TGACTTTGTA
TACAGCTATG ACCCGAATGT ACTTGAGATA ATAGAGATAG AACCGGGAGA CATAATAGTT
GACCCGAATC CTGACAAGAG CTTTGATACT GCAGTATATC CTGACAGAAA GATAATAGTA
TTCCTGTTTG CAGAAGACAG CGGAACGGGA GCGTATGCAA TAACTAAAGA CGGAGTATTT
GCTACGATAG TAGCGAAAGT AAAAGAAGGA GCACCTAACG GACTCAGTGT AATCAAATTT
GTAGAAGTAG GCGGATTTGC GAACAATGAC CTTGTAGAAC AGAAGACACA GTTCTTTGAC
GGTGGAGTAA ATGTTGGAGA TACAACAGTA CCTACAACAT CGCCGACAAC AACACCGCCA
GAGCCGACGA TAACTCCGAA CAAGTTGACA CTTAAGATAG GCAGAGCAGA AGGAAGACCT
GGAGACACGG TGGAAATACC GGTTAACTTG TATGGAGTAC CTCAAAAAGG AATAGCAAGC
GGTGACTTCG TAGTAAGCTA TGACCCGAAT GTACTTGAGA TAATAGAGAT AGAACCGGGA
GAATTGATAG TTGACCCGAA TCCTACCAAG AGCTTTGATA CTGCAGTATA TCCTGACAGA
AAGATGATAG TATTCCTGTT TGCGGAAGAC AGCGGAACAG GAGCGTATGC AATAACTGAA
GATGGAGTAT TTGCTACGAT AGTAGCGAAA GTAAAAGAAG GAGCACCTGA AGGATTCAGT
GCAATAGAAA TTTCTGAGTT TGGTGCATTT GCAGATAATG ATCTGGTAGA AGTGGAAACT
GACCTTATCA ATGGTGGAGT ACTTGTAACT AATAAACCTG TAATAGAAGG ATATAAAGTA
TCCGGATACA TTTTGCCAGA CTTCTCCTTC GACGCTACTG TTGCACCACT TGTAAAGGCC
GGATTCAAAG TTGAAATAGT AGGAACAGAA TTGTATGCAG TAACAGATGC AAACGGATAC
TTTGAAATAA CCGGAGTACC TGCAAATGCA AGCGGATATA CATTGAAGAT TTCAAGAGCA
ACTTACTTGG ACAGAGTAAT TGCAAATGTT GTAGTAACGG GAGATACTTC AGTTTCAACT
TCACAGGCTC CAATAATGAT GTGGGTAGGA GACATAGTGA AAGACAATTC TATCAACCTG
TTGGACGTTG CAGAAGTTAT CCGTTGCTTC AACGCTACTA AAGGAAGCGC AAACTACGTA
GAAGAACTTG ACATTAATAG AAACGGCGCA ATTAACATGC AAGACATAAT GATTGTTCAT
AAGCACTTTG GAGCTACATC AAGTGATTAC GACGCACAGT AA
 
Protein sequence
MRKVISMLLV VAMLTTIFAA MIPQTVSAAT MTVEIGKVTA AVGSKVEIPI TLKGVPSKGM 
ANCDFVLGYD PNVLEVTEVK PGSIIKDPDP SKSFDSAIYP DRKMIVFLFA EDSGRGTYAI
TQDGVFATIV ATVKSAAAAP ITLLEVGAFA DNDLVEISTT FVAGGVNLGS SVPTTQPNVP
SDGVVVEIGK VTGSVGTTVE IPVYFRGVPS KGIANCDFVF RYDPNVLEII GIDPGDIIVD
PNPTKSFDTA IYPDRKIIVF LFAEDSGTGA YAITKDGVFA KIRATVKSSA PGYITFDEVG
GFADNDLVEQ KVSFIDGGVN VGNATPTKGA TPTNTATPTK SATATPTRPS VPTNTPTNTP
ANTPVSGNLK VEFYNSNPSD TTNSINPQFK VTNTGSSAID LSKLTLRYYY TVDGQKDQTF
WCDHAAIIGS NGSYNGITSN VKGTFVKMSS STNNADTYLE ISFTGGTLEP GAHVQIQGRF
AKNDWSNYTQ SNDYSFKSAS QFVEWDQVTA YLNGVLVWGK EPGGSVVPST QPVTTPPATT
KPPATTKPPA TTIPPSDDPN AIKIKVDTVN AKPGDTVNIP VRFSGIPSKG IANCDFVYSY
DPNVLEIIEI KPGELIVDPN PDKSFDTAVY PDRKIIVFLF AEDSGTGAYA ITKDGVFATI
VAKVKSGAPN GLSVIKFVEV GGFANNDLVE QRTQFFDGGV NVGDTTVPTT PTTPVTTPTD
DSNAVRIKVD TVNAKPGDTV RIPVRFSGIP SKGIANCDFV YSYDPNVLEI IEIEPGDIIV
DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG AYAITKDGVF ATIVAKVKSG APNGLSVIKF
VEVGGFANND LVEQKTQFFD GGVNVGDTTE PATPTTPVTT PTTTDDLDAV RIKVDTVNAK
PGDTVRIPVR FSGIPSKGIA NCDFVYSYDP NVLEIIEIEP GDIIVDPNPD KSFDTAVYPD
RKIIVFLFAE DSGTGAYAIT KDGVFATIVA KVKSGAPNGL SVIKFVEVGG FANNDLVEQK
TQFFDGGVNV GDTTEPATPT TPVTTPTTTD DLDAVRIKVD TVNAKPGDTV RIPVRFSGIP
SKGIANCDFV YSYDPNVLEI IEIEPGDIIV DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG
AYAITKDGVF ATIVAKVKEG APNGLSVIKF VEVGGFANND LVEQKTQFFD GGVNVGDTTE
PATPTTPVTT PTTTDDLDAV RIKVDTVNAK PGDTVRIPVR FSGIPSKGIA NCDFVYSYDP
NVLEIIEIEP GELIVDPNPT KSFDTAVYPD RKMIVFLFAE DSGTGAYAIT EDGVFATIVA
KVKSGAPNGL SVIKFVEVGG FANNDLVEQK TQFFDGGVNV GDTTEPATPT TPVTTPTTTD
DLDAVRIKVD TVNAKPGDTV RIPVRFSGIP SKGIANCDFV YSYDPNVLEI IEIEPGDIIV
DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG AYAITKDGVF ATIVAKVKEG APNGLSVIKF
VEVGGFANND LVEQKTQFFD GGVNVGDTTV PTTSPTTTPP EPTITPNKLT LKIGRAEGRP
GDTVEIPVNL YGVPQKGIAS GDFVVSYDPN VLEIIEIEPG ELIVDPNPTK SFDTAVYPDR
KMIVFLFAED SGTGAYAITE DGVFATIVAK VKEGAPEGFS AIEISEFGAF ADNDLVEVET
DLINGGVLVT NKPVIEGYKV SGYILPDFSF DATVAPLVKA GFKVEIVGTE LYAVTDANGY
FEITGVPANA SGYTLKISRA TYLDRVIANV VVTGDTSVST SQAPIMMWVG DIVKDNSINL
LDVAEVIRCF NATKGSANYV EELDINRNGA INMQDIMIVH KHFGATSSDY DAQ