Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1257 |
Symbol | |
ID | 4809762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1525272 |
End bp | 1528424 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106680 |
Product | carbohydrate-binding, CenC-like protein |
Protein accession | YP_001037682 |
Protein GI | 125973772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACGGA GATTATCATA TAAGCAAAGT GCATTGCTTA TTGCAGTGTG TATATTAATA CAAATCATAT CTTTAGCTTC TTTTGGTTTG AATGTCTATG CAAATTCTGC AAACATAAGT TTGGAATTCT ATAATGGAGA TTTTGGTGCT TCTGTTTCTT CCATTTCTAT GAATTTTAGA ATTACCAACA ATGGTTCTTC TCAAATTTCA CTTTCGGACA TTAAATTAAG ATACTATTTT ACTGATGACG GAGTGTCTCC AATTACTGTT TTTATCGATT ATGCCAACAA TAACGGCAGA GGAATTAACA ACGATGTTAC TTATACTATA AAAGATATAA ATTCATCCGG AGCAAACAAA TATATTGAAT TTGGATTTAA CGCTCAGGCG GGAAGTCTTG AACCCAATAC ATCCGTGTTA ATGAGAGCCA GAGCCTATCA ATCCGAATAC AAACAGAGTT TTACACAAAC CAATGATTAT TCTTTCTGTC AGTCTAATAA CGATTTTGCG GCATGGAACA AGGTTACTGG GTATTTGAAC GGCGTTTTGT TCAGTGGAAC GGAACCGGTA ATGTATAGTC CCACCCCTTC TTTGTCAACT CCGACTCCGC AAATCAGTCC GACGCCGGTT GCAACACCTT CGTCTGTTCC GACTTACCAG GCCACGCCAA CACCTTCAGG TTTTGAGACT CCTGTAAATT GGGATGGAAA AGCAGTTCCC AACGGAGATT TTGAGAGCGG ATCGGTATTT TGGAGTTTTT ACTGCGACAG CTTATCCGGT GCGAATGCCA CAAATTTGAT CCATTCCGAG CCTTCCGGAA ATAAAATGTC CAAAACTTCT ATTACGAATG CAGGCTCGAA TCACTGGGCA ATTCAATTGA AGCATGACGG AATTGTACTT GAAAATTTAA AAACCTACAG GCTTACTTTT GATGCAAAGT CAACGGTTCC GAGAAATATA AGGGTGTCAT TGCAGAATGC CACAAGCAGT ATGATTGAGT ATTTCGGCAA GATAGTGGAA GTTGAGCCAA AGATGAAGAC TTATACCTGT GAATTCACAT TTAACAGCAC AACAGGTACA AATGTGGCGA TTGTGTTTGA AATGGGGAAA ATCGGAACAG AGACGGATAA AGCCCATGAC ATTGTTCTTG ACAATGTACA TATTGAAAAA ATAGCTTCGC CTTCCGTATC GCCCGTGCCG TCGGAGGACC CTCAGGGAGC CGGTATCACC GCTTCGAGAA GTTCCGTATA TGAGGCGGAG CTTGGAGAAG AAGTTGACAT AACATTATCC CAAAGCGGAG AAATTGCTTT GGAAGGCAGG ATGGACACAG AAAAAGAAAT TGTGCTTGTA TTGGATAACT CAGGTGTGTT AAATTCTTAT GTGGAAGATA TTTTATCACC GCTGGACTTT GGAATATATT CAAATCATAA TTTAACTGTA CAGGGAAAAG ATGCTTCCAT TAACGGAAGT GTGCATGCAA ATGATGTATT TACTTCTACA GCGGATAGTA TAAGTATTTC TCAAACTTGC TCAGCTGCCA GCTTCCACAT TACAAGTAAA AACGTAAATA TAAATGAATA TAAAAACATT ACGATTCCAA TAGAAATGCC CAATTTCCAT TCTAAACTTA TTGATGATGC GATGCGCAAT TCCATGGTTT TCAGACCCGA GGATTATTTT TTGAGTTGGT TCCCCCAGCC AATGCCGGGT CAAGAAGATA TCTTTATATT TTATAACTTA ATTGCCGGAC GGTTTGAAAT ATTTGGAGCA GGTACACTTG TTATAAATTC TTCAATGTAT TTTATGGGCA ATGTTCTTAT ATCACTGACA AATACCAATA ACGTCGGTGA GGGATTTATT GTTGCTGACG GAAATATAAT TATACAGGGA CAAAACTTGT ATCCCAACGG ACCTAATGAC AAGTTGTATG TTTACTCCAT AGGGGGAAAT ATAGAGTTTC AGACCAGTAA CTCTACTATA AACGGAATAG TTTATGCACC TGGAAATCCT GCAAACCCTA ATTCAGGAAA AATTTTCTTC TCAGGTGACA AAAATACAAT TAACGGTTCT ATTGCAGCGA ACGAACTTGA TTTTTTTGCC GGCGGCCTTG TGGTAAATCA TACTGAAGGG CAATTTGATA CTGTTGAGGA AAAGTATATT GACAAATCCA CTTATTTAAA ATTGGTAAAA GATGCTGCAA AGAACTTTGT AGACAAGTTT GCGGGTTCTA AGACCAAAAT GGCCGTTATT CAGTATTCCG ATTCTGCCAA TGATAATGAT TTTAAAAAGT ATGATTTGTC TTTGCCTGAT AAAGGAGCTG CTTTGAAGGA GACTATTGAC AAAATTAAGC CCGGAACATC AGGCCTGAGC AACATGGGTG ACGGAATGAG AAGAGCCTAT CATATTCTTA ATGGTCCTCC TCCAAAAGGT CAGATTTCAA AATATATAGT CGTTATTACA GGCTCTGTAC CGAACCGGTG GACGGCAGTG GACAATAAAA AGAATGAACC GAAGACAGAC AACGGACGTG CTGATTTTAT AAAGGCTGAC AATGAGTCAT ACAATTCACT GGATTATGCA AAGGATATGG GAAGAATCAT TACTTCAAAG GGTATCAATC TTGTGTTTAT AGACTTTTCA GAAGAGGATA TTGGCGACGT ACTGGAAGAA ATTGCAGCAG AAAGCGGAGC AAAACCGTTA GAAGGAACAG ACAGACACTA TTACAAGGCA AATAATTTCC TGGAACTTCT GGATATTTTA AACAATATGA CTTTGAAGAT ATATTATGAT GTTGTGCTTG ACAAGGTTTT GTACGAAGAG ATTCTTCCGC AGGGAGTGCT GCTGGTTGAG GCTCCTGAGT GGATAAGCAC GGAAAGTGTG CCGATGGGCG GAGTTAACAG GATAAAGCTT ACCGGTGAGA TAAACAACAT TCCGTTTACG TTTACAGGCA CGGGTTACAG TTTTGTAGTT GAAAGCTTCA AAATAAAAGT AAAATTCCTC AAACCGGGCA CAATAGTATT TGACGGGGCA GATTCCAGAC TAAGATACAA TTTTAATTAC GTTGACGGAG CAGGAAACAT TCATTCAAAG AGCGTGGACA AACATTTTGA CGACATGACG GTGAATGTCA CGATGAAGGT TGATATAAAC TGA
|
Protein sequence | MLRRLSYKQS ALLIAVCILI QIISLASFGL NVYANSANIS LEFYNGDFGA SVSSISMNFR ITNNGSSQIS LSDIKLRYYF TDDGVSPITV FIDYANNNGR GINNDVTYTI KDINSSGANK YIEFGFNAQA GSLEPNTSVL MRARAYQSEY KQSFTQTNDY SFCQSNNDFA AWNKVTGYLN GVLFSGTEPV MYSPTPSLST PTPQISPTPV ATPSSVPTYQ ATPTPSGFET PVNWDGKAVP NGDFESGSVF WSFYCDSLSG ANATNLIHSE PSGNKMSKTS ITNAGSNHWA IQLKHDGIVL ENLKTYRLTF DAKSTVPRNI RVSLQNATSS MIEYFGKIVE VEPKMKTYTC EFTFNSTTGT NVAIVFEMGK IGTETDKAHD IVLDNVHIEK IASPSVSPVP SEDPQGAGIT ASRSSVYEAE LGEEVDITLS QSGEIALEGR MDTEKEIVLV LDNSGVLNSY VEDILSPLDF GIYSNHNLTV QGKDASINGS VHANDVFTST ADSISISQTC SAASFHITSK NVNINEYKNI TIPIEMPNFH SKLIDDAMRN SMVFRPEDYF LSWFPQPMPG QEDIFIFYNL IAGRFEIFGA GTLVINSSMY FMGNVLISLT NTNNVGEGFI VADGNIIIQG QNLYPNGPND KLYVYSIGGN IEFQTSNSTI NGIVYAPGNP ANPNSGKIFF SGDKNTINGS IAANELDFFA GGLVVNHTEG QFDTVEEKYI DKSTYLKLVK DAAKNFVDKF AGSKTKMAVI QYSDSANDND FKKYDLSLPD KGAALKETID KIKPGTSGLS NMGDGMRRAY HILNGPPPKG QISKYIVVIT GSVPNRWTAV DNKKNEPKTD NGRADFIKAD NESYNSLDYA KDMGRIITSK GINLVFIDFS EEDIGDVLEE IAAESGAKPL EGTDRHYYKA NNFLELLDIL NNMTLKIYYD VVLDKVLYEE ILPQGVLLVE APEWISTESV PMGGVNRIKL TGEINNIPFT FTGTGYSFVV ESFKIKVKFL KPGTIVFDGA DSRLRYNFNY VDGAGNIHSK SVDKHFDDMT VNVTMKVDIN
|
| |