Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3077 |
Symbol | |
ID | 4809951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3619516 |
End bp | 3625077 |
Gene Length | 5562 bp |
Protein Length | 1853 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108501 |
Product | cellulosome anchoring protein, cohesin region |
Protein accession | YP_001039466 |
Protein GI | 125975556 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAG TCATCAGTAT GCTCTTAGTT GTGGCTATGC TGACGACGAT TTTTGCGGCG ATGATACCGC AGACAGTATC GGCGGCCACA ATGACAGTCG AGATCGGCAA AGTTACAGCA GCCGTTGGAT CAAAAGTAGA AATACCTATA ACCCTGAAAG GAGTGCCATC CAAAGGAATG GCCAATTGCG ACTTCGTATT GGGTTATGAT CCAAATGTGC TGGAAGTAAC AGAAGTAAAA CCAGGAAGCA TAATAAAAGA TCCGGATCCT AGCAAGAGCT TTGATAGCGC AATATATCCG GATCGAAAGA TGATTGTATT TCTGTTTGCA GAAGACAGTG GAAGAGGAAC GTATGCAATA ACTCAGGATG GAGTATTTGC AACAATTGTA GCCACTGTCA AATCAGCTGC AGCGGCACCG ATTACTTTGC TTGAAGTAGG TGCATTTGCG GACAACGATT TAGTAGAAAT AAGCACAACT TTTGTCGCGG GCGGAGTAAA TCTTGGTAGT TCCGTACCGA CAACACAGCC AAATGTTCCG TCAGACGGTG TGGTAGTAGA AATTGGCAAA GTTACGGGAT CTGTTGGAAC TACAGTTGAA ATACCTGTAT ATTTCAGAGG AGTTCCATCC AAAGGAATAG CAAACTGCGA CTTTGTGTTC AGATATGATC CGAATGTATT GGAAATTATA GGGATAGATC CCGGAGACAT AATAGTTGAC CCGAATCCTA CCAAGAGCTT TGATACTGCA ATATATCCTG ACAGAAAGAT AATAGTATTC CTGTTTGCGG AAGACAGCGG AACAGGAGCG TATGCAATAA CTAAAGACGG AGTATTTGCA AAAATAAGAG CAACTGTAAA ATCAAGTGCT CCGGGCTATA TTACTTTCGA CGAAGTAGGT GGATTTGCAG ATAATGACCT GGTAGAACAG AAGGTATCAT TTATAGACGG TGGTGTTAAC GTTGGCAATG CAACACCGAC CAAGGGAGCA ACACCAACAA ATACAGCTAC GCCGACAAAA TCAGCTACGG CTACGCCCAC CAGGCCATCG GTACCGACAA ACACACCGAC AAACACACCG GCAAATACAC CGGTATCAGG CAATTTGAAG GTTGAATTCT ACAACAGCAA TCCTTCAGAT ACTACTAACT CAATCAATCC TCAGTTCAAG GTTACTAATA CCGGAAGCAG TGCAATTGAT TTGTCCAAAC TCACATTGAG ATATTATTAT ACAGTAGACG GACAGAAAGA TCAGACCTTC TGGTGTGACC ATGCTGCAAT AATCGGCAGT AACGGCAGCT ACAACGGAAT TACTTCAAAT GTAAAAGGAA CATTTGTAAA AATGAGTTCC TCAACAAATA ACGCAGACAC CTACCTTGAA ATAAGCTTTA CAGGCGGAAC TCTTGAACCG GGTGCACATG TTCAGATACA AGGTAGATTT GCAAAGAATG ACTGGAGTAA CTATACACAG TCAAATGACT ACTCATTCAA GTCTGCTTCA CAGTTTGTTG AATGGGATCA GGTAACAGCA TACTTGAACG GTGTTCTTGT ATGGGGTAAA GAACCCGGTG GCAGTGTAGT ACCATCAACA CAGCCTGTAA CAACACCACC TGCAACAACA AAACCACCTG CAACAACAAA ACCACCTGCA ACAACAATAC CGCCGTCAGA TGATCCGAAT GCAATAAAGA TTAAGGTGGA CACAGTAAAT GCAAAACCGG GAGACACAGT AAATATACCT GTAAGATTCA GTGGTATACC ATCCAAGGGA ATAGCAAACT GTGACTTTGT ATACAGCTAT GACCCGAATG TACTTGAGAT AATAGAGATA AAACCGGGAG AATTGATAGT TGACCCGAAT CCTGACAAGA GCTTTGATAC TGCAGTATAT CCTGACAGAA AGATAATAGT ATTCCTGTTT GCAGAAGACA GCGGAACAGG AGCGTATGCA ATAACTAAAG ACGGAGTATT TGCTACGATA GTAGCGAAAG TAAAATCCGG AGCACCTAAC GGACTCAGTG TAATCAAATT TGTAGAAGTA GGCGGATTTG CGAACAATGA CCTTGTAGAA CAGAGGACAC AGTTCTTTGA CGGTGGAGTA AATGTTGGAG ATACAACAGT ACCTACAACA CCTACAACAC CTGTAACAAC ACCGACAGAT GATTCGAATG CAGTAAGGAT TAAGGTGGAC ACAGTAAATG CAAAACCGGG AGACACAGTA AGAATACCTG TAAGATTCAG CGGTATACCA TCCAAGGGAA TAGCAAACTG TGACTTTGTA TACAGCTATG ACCCGAATGT ACTTGAGATA ATAGAGATAG AACCGGGAGA CATAATAGTT GACCCGAATC CTGACAAGAG CTTTGATACT GCAGTATATC CTGACAGAAA GATAATAGTA TTCCTGTTTG CGGAAGACAG CGGAACAGGA GCGTATGCAA TAACTAAAGA CGGAGTATTT GCTACGATAG TAGCGAAAGT AAAATCCGGA GCACCTAACG GACTCAGTGT AATCAAATTT GTAGAAGTAG GCGGATTTGC GAACAATGAC CTTGTAGAAC AGAAGACACA GTTCTTTGAC GGTGGAGTAA ATGTTGGAGA TACAACAGAA CCTGCAACAC CTACAACACC TGTAACAACA CCGACAACAA CAGATGATCT GGATGCAGTA AGGATTAAAG TGGACACAGT AAATGCAAAA CCGGGAGACA CAGTAAGAAT ACCTGTAAGA TTCAGCGGTA TACCATCCAA GGGAATAGCA AACTGTGACT TTGTATACAG CTATGACCCG AATGTACTTG AGATAATAGA GATAGAACCG GGAGACATAA TAGTTGACCC GAATCCTGAC AAGAGCTTTG ATACTGCAGT ATATCCTGAC AGAAAGATAA TAGTATTCCT GTTTGCGGAA GACAGCGGAA CAGGAGCGTA TGCAATAACT AAAGACGGAG TATTTGCTAC GATAGTAGCG AAAGTAAAAT CCGGAGCACC TAACGGACTC AGTGTAATCA AATTTGTAGA AGTAGGCGGA TTTGCGAACA ATGACCTTGT AGAACAGAAG ACACAGTTCT TTGACGGTGG AGTAAATGTT GGAGATACAA CAGAACCTGC AACACCTACA ACACCTGTAA CAACACCGAC AACAACAGAT GATCTGGATG CAGTAAGGAT TAAAGTGGAC ACAGTAAATG CAAAACCGGG AGACACAGTA AGAATACCTG TAAGATTCAG CGGTATACCA TCCAAGGGAA TAGCAAACTG TGACTTTGTA TACAGCTATG ACCCGAATGT ACTTGAGATA ATAGAGATAG AACCGGGAGA CATAATAGTT GACCCGAATC CTGACAAGAG CTTTGATACT GCAGTATATC CTGACAGAAA GATAATAGTA TTCCTGTTTG CAGAAGACAG CGGAACAGGA GCGTATGCAA TAACTAAAGA CGGAGTATTT GCTACGATAG TAGCGAAAGT AAAAGAAGGA GCACCTAACG GACTCAGTGT AATCAAATTT GTAGAAGTAG GCGGATTTGC GAACAATGAC CTTGTAGAAC AGAAGACACA GTTCTTTGAC GGTGGAGTAA ATGTTGGAGA TACAACAGAA CCTGCAACAC CTACAACACC TGTAACAACA CCGACAACAA CAGATGATCT GGATGCAGTA AGGATTAAAG TGGACACAGT AAATGCAAAA CCGGGAGACA CAGTAAGAAT ACCTGTAAGA TTCAGCGGTA TACCATCCAA GGGAATAGCA AACTGTGACT TTGTATACAG CTATGACCCG AATGTACTTG AGATAATAGA GATAGAACCG GGAGAATTGA TAGTTGACCC GAATCCTACC AAGAGCTTTG ATACTGCAGT ATATCCTGAC AGAAAGATGA TAGTATTCCT GTTTGCGGAA GACAGCGGAA CAGGAGCGTA TGCAATAACT GAAGATGGAG TATTTGCTAC GATAGTAGCG AAAGTAAAAT CCGGAGCACC TAACGGACTC AGTGTAATCA AATTTGTAGA AGTAGGCGGA TTTGCGAACA ATGACCTTGT AGAACAGAAG ACACAGTTCT TTGACGGTGG AGTAAATGTT GGAGATACAA CAGAACCTGC AACACCTACA ACACCTGTAA CAACACCGAC AACAACAGAT GATCTGGATG CAGTAAGGAT TAAAGTGGAC ACAGTAAATG CAAAACCGGG AGACACAGTA AGAATACCTG TAAGATTCAG CGGTATACCA TCCAAGGGAA TAGCAAACTG TGACTTTGTA TACAGCTATG ACCCGAATGT ACTTGAGATA ATAGAGATAG AACCGGGAGA CATAATAGTT GACCCGAATC CTGACAAGAG CTTTGATACT GCAGTATATC CTGACAGAAA GATAATAGTA TTCCTGTTTG CAGAAGACAG CGGAACGGGA GCGTATGCAA TAACTAAAGA CGGAGTATTT GCTACGATAG TAGCGAAAGT AAAAGAAGGA GCACCTAACG GACTCAGTGT AATCAAATTT GTAGAAGTAG GCGGATTTGC GAACAATGAC CTTGTAGAAC AGAAGACACA GTTCTTTGAC GGTGGAGTAA ATGTTGGAGA TACAACAGTA CCTACAACAT CGCCGACAAC AACACCGCCA GAGCCGACGA TAACTCCGAA CAAGTTGACA CTTAAGATAG GCAGAGCAGA AGGAAGACCT GGAGACACGG TGGAAATACC GGTTAACTTG TATGGAGTAC CTCAAAAAGG AATAGCAAGC GGTGACTTCG TAGTAAGCTA TGACCCGAAT GTACTTGAGA TAATAGAGAT AGAACCGGGA GAATTGATAG TTGACCCGAA TCCTACCAAG AGCTTTGATA CTGCAGTATA TCCTGACAGA AAGATGATAG TATTCCTGTT TGCGGAAGAC AGCGGAACAG GAGCGTATGC AATAACTGAA GATGGAGTAT TTGCTACGAT AGTAGCGAAA GTAAAAGAAG GAGCACCTGA AGGATTCAGT GCAATAGAAA TTTCTGAGTT TGGTGCATTT GCAGATAATG ATCTGGTAGA AGTGGAAACT GACCTTATCA ATGGTGGAGT ACTTGTAACT AATAAACCTG TAATAGAAGG ATATAAAGTA TCCGGATACA TTTTGCCAGA CTTCTCCTTC GACGCTACTG TTGCACCACT TGTAAAGGCC GGATTCAAAG TTGAAATAGT AGGAACAGAA TTGTATGCAG TAACAGATGC AAACGGATAC TTTGAAATAA CCGGAGTACC TGCAAATGCA AGCGGATATA CATTGAAGAT TTCAAGAGCA ACTTACTTGG ACAGAGTAAT TGCAAATGTT GTAGTAACGG GAGATACTTC AGTTTCAACT TCACAGGCTC CAATAATGAT GTGGGTAGGA GACATAGTGA AAGACAATTC TATCAACCTG TTGGACGTTG CAGAAGTTAT CCGTTGCTTC AACGCTACTA AAGGAAGCGC AAACTACGTA GAAGAACTTG ACATTAATAG AAACGGCGCA ATTAACATGC AAGACATAAT GATTGTTCAT AAGCACTTTG GAGCTACATC AAGTGATTAC GACGCACAGT AA
|
Protein sequence | MRKVISMLLV VAMLTTIFAA MIPQTVSAAT MTVEIGKVTA AVGSKVEIPI TLKGVPSKGM ANCDFVLGYD PNVLEVTEVK PGSIIKDPDP SKSFDSAIYP DRKMIVFLFA EDSGRGTYAI TQDGVFATIV ATVKSAAAAP ITLLEVGAFA DNDLVEISTT FVAGGVNLGS SVPTTQPNVP SDGVVVEIGK VTGSVGTTVE IPVYFRGVPS KGIANCDFVF RYDPNVLEII GIDPGDIIVD PNPTKSFDTA IYPDRKIIVF LFAEDSGTGA YAITKDGVFA KIRATVKSSA PGYITFDEVG GFADNDLVEQ KVSFIDGGVN VGNATPTKGA TPTNTATPTK SATATPTRPS VPTNTPTNTP ANTPVSGNLK VEFYNSNPSD TTNSINPQFK VTNTGSSAID LSKLTLRYYY TVDGQKDQTF WCDHAAIIGS NGSYNGITSN VKGTFVKMSS STNNADTYLE ISFTGGTLEP GAHVQIQGRF AKNDWSNYTQ SNDYSFKSAS QFVEWDQVTA YLNGVLVWGK EPGGSVVPST QPVTTPPATT KPPATTKPPA TTIPPSDDPN AIKIKVDTVN AKPGDTVNIP VRFSGIPSKG IANCDFVYSY DPNVLEIIEI KPGELIVDPN PDKSFDTAVY PDRKIIVFLF AEDSGTGAYA ITKDGVFATI VAKVKSGAPN GLSVIKFVEV GGFANNDLVE QRTQFFDGGV NVGDTTVPTT PTTPVTTPTD DSNAVRIKVD TVNAKPGDTV RIPVRFSGIP SKGIANCDFV YSYDPNVLEI IEIEPGDIIV DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG AYAITKDGVF ATIVAKVKSG APNGLSVIKF VEVGGFANND LVEQKTQFFD GGVNVGDTTE PATPTTPVTT PTTTDDLDAV RIKVDTVNAK PGDTVRIPVR FSGIPSKGIA NCDFVYSYDP NVLEIIEIEP GDIIVDPNPD KSFDTAVYPD RKIIVFLFAE DSGTGAYAIT KDGVFATIVA KVKSGAPNGL SVIKFVEVGG FANNDLVEQK TQFFDGGVNV GDTTEPATPT TPVTTPTTTD DLDAVRIKVD TVNAKPGDTV RIPVRFSGIP SKGIANCDFV YSYDPNVLEI IEIEPGDIIV DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG AYAITKDGVF ATIVAKVKEG APNGLSVIKF VEVGGFANND LVEQKTQFFD GGVNVGDTTE PATPTTPVTT PTTTDDLDAV RIKVDTVNAK PGDTVRIPVR FSGIPSKGIA NCDFVYSYDP NVLEIIEIEP GELIVDPNPT KSFDTAVYPD RKMIVFLFAE DSGTGAYAIT EDGVFATIVA KVKSGAPNGL SVIKFVEVGG FANNDLVEQK TQFFDGGVNV GDTTEPATPT TPVTTPTTTD DLDAVRIKVD TVNAKPGDTV RIPVRFSGIP SKGIANCDFV YSYDPNVLEI IEIEPGDIIV DPNPDKSFDT AVYPDRKIIV FLFAEDSGTG AYAITKDGVF ATIVAKVKEG APNGLSVIKF VEVGGFANND LVEQKTQFFD GGVNVGDTTV PTTSPTTTPP EPTITPNKLT LKIGRAEGRP GDTVEIPVNL YGVPQKGIAS GDFVVSYDPN VLEIIEIEPG ELIVDPNPTK SFDTAVYPDR KMIVFLFAED SGTGAYAITE DGVFATIVAK VKEGAPEGFS AIEISEFGAF ADNDLVEVET DLINGGVLVT NKPVIEGYKV SGYILPDFSF DATVAPLVKA GFKVEIVGTE LYAVTDANGY FEITGVPANA SGYTLKISRA TYLDRVIANV VVTGDTSVST SQAPIMMWVG DIVKDNSINL LDVAEVIRCF NATKGSANYV EELDINRNGA INMQDIMIVH KHFGATSSDY DAQ
|
| |