Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0413 |
Symbol | |
ID | 4808416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 519467 |
End bp | 523141 |
Gene Length | 3675 bp |
Protein Length | 1224 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105827 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036844 |
Protein GI | 125972934 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA GAAGGTCAAT TTGTACTGCT GTTTTGTTGG CGGTTTTATT GACACTTCTG GTACCGACAT CCGTGTTTGC CTTAGAAGAT AATTCTTCGA CTTTGCCGCC GTATAAAAAC GACCTTTTGT ATGAGAGGAC TTTTGATGAG GGACTTTGTT ATCCATGGCA TACCTGTGAA GACAGCGGAG GAAAATGCTC CTTTGATGTG GTCGATGTTC CGGGGCAGCC CGGTAATAAA GCATTTGCCG TTACTGTTCT TGACAAAGGG CAAAACAGAT GGAGCGTTCA GATGAGACAC CGTGGTCTTA CTCTTGAACA GGGACATACA TATAGAGTAC GGCTTAAGAT TTGGGCAGAT GCGTCCTGTA AAGTTTATAT AAAAATAGGA CAAATGGGCG AGCCCTATGC TGAATATTGG AACAACAAGT GGAGTCCATA CACACTGACA GCAGGTAAGG TATTGGAAAT TGACGAGACG TTTGTTATGG ACAAGCCAAC TGACGACACA TGCGAATTTA CATTCCATTT AGGTGGCGAA TTGGCAGCAA CTCCTCCATA TACAGTTTAT CTTGATGATG TATCCCTTTA TGACCCAGAA TATACGAAGC CTGTTGAATA TATACTTCCG CAGCCTGATG TACGTGTGAA CCAGGTTGGC TACCTGCCGG AGGGCAAGAA AGTTGCCACT GTGGTATGCA ATTCAACTCA GCCGGTAAAA TGGCAGCTTA AGAATGCTGC AGGCGTTGTA GTTTTGGAAG GTTATACCGA ACCAAAGGGT CTTGACAAAG ACTCGCAGGA TTATGTACAT TGGCTTGATT TTTCCGATTT TGCAACCGAA GGAATTGGTT ACTATTTTGA ACTTCCGACT GTAAACAGTC CTACAAACTA CAGTCATCCA TTTGACATTC GCAAAGACAT CTATACTCAG ATGAAATATG ATGCATTGGC ATTCTTCTAT CACAAGAGAA GCGGTATTCC TATTGAAATG CCGTATGCAG GAGGAGAACA GTGGACCAGA CCTGCAGGAC ATATCGGAAT TGAGCCGAAC AAGGGAGATA CAAATGTTCC TACATGGCCT CAGGATGATG AGTATGCAGG AATACCTCAG AAGAATTATA CAAAGGATGT AACCGGTGGA TGGTATGATG CCGGTGACCA CGGTAAATAT GTTGTAAACG GCGGTATAGC CGTCTGGACA TTAATGAACA TGTATGAGAG GGCAAAAATT AGAGGTCTTG ACAACTGGGG ACCATACAGG GACGGCGGAA TGAACATACC GGAGCAGAAT AACGGTTATC CGGACATTCT TGATGAAGCA AGATGGGAAA TTGAGTTCTT TAAGAAAATG CAGGTAACTG AAAAAGAGGA TCCTTCCATA GCCGGAATGG TACACCACAA AATTCACGAC TTCAGATGGA CTGCTTTGGG TATGTTGCCT CACGAAGATC CCCAGCCACG TTACTTAAGG CCGGTAAGTA CGGCTGCGAC TTTGAACTTT GCGGCAACTT TGGCACAAAG TGCACGTCTT TGGAAAGATT ATGATCCGAC TTTTGCTGCT GACTGTTTGG AAAAGGCTGA AATAGCATGG CAGGCGGCAT TAAAGCATCC TGATATTTAT GCTGAGTATA CTCCCGGTAG CGGTGGTCCC GGAGGCGGAC CATACAATGA CGACTATGTC GGAGACGAAT TCTACTGGGC AGCCTGCGAA CTTTATGTAA CAACAGGAAA AGACGAATAT AAGAATTACC TGATGAATTC ACCTCACTAT CTTGAAATGC CTGCAAAGAT GGGTGAAAAC GGTGGAGCAA ACGGAGAAGA CAACGGATTG TGGGGATGCT TCACCTGGGG AACTACTCAA GGATTGGGAA CCATTACTCT TGCATTGGTT GAAAACGGAT TGCCTGCTAC AGACATTCAA AAGGCAAGAA ACAATATAGC TAAAGCTGCT GACAGATGGC TTGAGAATAT TGAAGAGCAA GGTTACAGAC TGCCGATCAA ACAGGCGGAG GATGAGAGAG GCGGTTATCC ATGGGGTTCA AACTCCTTCA TTTTGAACCA GATGATAGTT ATGGGATATG CCTATGACTT TACAGGTGAC TCCAAATATC TCGATGGAAT GTTTGACGGC ATAAGCTACC TGTTGGGAAG AAACGCAATG GATCAGTCCT ATGTAACAGG GTATGGTGAG CGTCCGCTTC AGAATCCTCA TGACAGGTTC TGGACGCCGC AGACAAGTAA GAGATTCCCT GCTCCACCTC CGGGTATAAT TTCCGGCGGT CCGAACTCCC GTTTCGAGGA CCCGACAATA AATGCGGCCG TTAAGAAGGA TACACCGCCA CAGAAATGTT TTATCGACCA TACAGACTCA TGGTCAACCA ACGAGATAAC TGTTAACTGG AATGCTCCGT TTGCATGGGT TACAGCTTAT CTTGACGAGC AGTACACAGA CAGTGAAACC GATAAGGTAA CTATTGATTC GCCTGTTGCA GGAGAAAGAT TTGAAGCCGG TAAAGACATT AATATAAGCG CAACTGTTAA ATCAAAAACT CCTGTAAGCA AAGTAGAGTT TTACAATGGA GATACGCTTA TTTCCAGTGA CACAACTGCA CCTTACACAG CAAAGATAAC AGGAGCCGCT GTCGGAGCAT ATAACCTTAA AGCGGTTGCA GTGCTGTCTG ACGGAAGAAG AATTGAGTCA CCGGTAACTC CTGTACTTGT TAAGGTAATT GTGAAACCTA CTGTAAAACT TACTGCACCC AAGTCAAATG TTGTGGCTTA TGGAAATGAG TTCCTGAAGA TTACAGCAAC AGCCAGTGAC TCTGACGGCA AAATCTCCAG GGTTGATTTC CTTGTTGACG GTGAAGTAAT CGGTTCAGAC AGGGAAGCAC CTTATGAATA TGAGTGGAAA GCTGTGGAAG GCAATCACGA AATAAGTGTA ATTGCTTATG ATGATGACGA TGCGGCTTCA ACACCTGATT CCGTAAAAAT ATTTGTAAAA CAGGCACGGG ATGTAAAAGT ACAGTATTTG TGCGAAAATA CGCAAACATC CACTCAGGAA ATCAAGGGTA AATTCAATAT AGTTAACACA GGAAACAGAG ATTATTCGCT GAAAGATATA GTATTAAGAT ACTACTTTAC CAAGGAGCAC AATTCACAGC TTCAGTTTAT CTGCTATTAT ACACCCATAG GCTCCGGAAA TCTCATTCCG TCCTTTGGCG GCTCGGGTGA CGAGCATTAT CTGCAGCTGG AATTCAAAGA TGTCAAGCTG CCTGCCGGCG GTCAGACTGG GGAAATACAG TTTGTTATAA GATATGCAGA TAACTCCTTC CATGATCAGT CGAACGACTA TTCGTTCGAT CCAACTATAA AAGCGTTCCA GGATTATGGC AAGGTTACCC TGTATAAGAA TGGAGAACTT GTTTGGGGAA CGCCGCCGGG CGGTACAGAA CCTGAAGAAC CGGAAGAGCC TGCGATAGTT TACGGCGACT GTAATGATGA CGGCAAAGTA AATTCAACAG ACGTCGCAGT AATGAAGAGA TATTTAAAGA AAGAAAATGT TAATATTAAT CTTGACAATG CAGATGTGAA TGCGGACGGC AAAGTTAACT CAACAGACTT CTCAATACTT AAGAGATATG TTATGAAGAA CATAGAAGAA TTGCCATATC GATAA
|
Protein sequence | MKFRRSICTA VLLAVLLTLL VPTSVFALED NSSTLPPYKN DLLYERTFDE GLCYPWHTCE DSGGKCSFDV VDVPGQPGNK AFAVTVLDKG QNRWSVQMRH RGLTLEQGHT YRVRLKIWAD ASCKVYIKIG QMGEPYAEYW NNKWSPYTLT AGKVLEIDET FVMDKPTDDT CEFTFHLGGE LAATPPYTVY LDDVSLYDPE YTKPVEYILP QPDVRVNQVG YLPEGKKVAT VVCNSTQPVK WQLKNAAGVV VLEGYTEPKG LDKDSQDYVH WLDFSDFATE GIGYYFELPT VNSPTNYSHP FDIRKDIYTQ MKYDALAFFY HKRSGIPIEM PYAGGEQWTR PAGHIGIEPN KGDTNVPTWP QDDEYAGIPQ KNYTKDVTGG WYDAGDHGKY VVNGGIAVWT LMNMYERAKI RGLDNWGPYR DGGMNIPEQN NGYPDILDEA RWEIEFFKKM QVTEKEDPSI AGMVHHKIHD FRWTALGMLP HEDPQPRYLR PVSTAATLNF AATLAQSARL WKDYDPTFAA DCLEKAEIAW QAALKHPDIY AEYTPGSGGP GGGPYNDDYV GDEFYWAACE LYVTTGKDEY KNYLMNSPHY LEMPAKMGEN GGANGEDNGL WGCFTWGTTQ GLGTITLALV ENGLPATDIQ KARNNIAKAA DRWLENIEEQ GYRLPIKQAE DERGGYPWGS NSFILNQMIV MGYAYDFTGD SKYLDGMFDG ISYLLGRNAM DQSYVTGYGE RPLQNPHDRF WTPQTSKRFP APPPGIISGG PNSRFEDPTI NAAVKKDTPP QKCFIDHTDS WSTNEITVNW NAPFAWVTAY LDEQYTDSET DKVTIDSPVA GERFEAGKDI NISATVKSKT PVSKVEFYNG DTLISSDTTA PYTAKITGAA VGAYNLKAVA VLSDGRRIES PVTPVLVKVI VKPTVKLTAP KSNVVAYGNE FLKITATASD SDGKISRVDF LVDGEVIGSD REAPYEYEWK AVEGNHEISV IAYDDDDAAS TPDSVKIFVK QARDVKVQYL CENTQTSTQE IKGKFNIVNT GNRDYSLKDI VLRYYFTKEH NSQLQFICYY TPIGSGNLIP SFGGSGDEHY LQLEFKDVKL PAGGQTGEIQ FVIRYADNSF HDQSNDYSFD PTIKAFQDYG KVTLYKNGEL VWGTPPGGTE PEEPEEPAIV YGDCNDDGKV NSTDVAVMKR YLKKENVNIN LDNADVNADG KVNSTDFSIL KRYVMKNIEE LPYR
|
| |