Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0412 |
Symbol | |
ID | 4808415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 516253 |
End bp | 518940 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105826 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036843 |
Protein GI | 125972933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000331233 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTCA GAAGAATGTT GTGCGCAGCC ATAGTGTTGA CAATTGTACT GTCCATTATG CTGCCGTCAA CTGTTTTTGC TTTGGAAGAC AAGTCTCCAA AGTTGCCGGA TTATAAAAAC GACCTTTTGT ATGAAAGAAC ATTCGACGAA GGTCTTTGCT TTCCGTGGCA TACTTGCGAA GACAGTGGAG GAAAATGTGA TTTCGCTGTT GTTGATGTTC CAGGAGAGCC TGGGAACAAA GCTTTCCGCT TGACAGTAAT TGACAAAGGA CAAAACAAGT GGAGTGTCCA GATGAGACAC AGAGGTATTA CCCTCGAGCA AGGACATACA TACACGGTAA GGTTTACGAT TTGGTCTGAC AAATCCTGTA GGGTTTATGC TAAAATTGGT CAGATGGGTG AACCCTATAC TGAATATTGG AACAATAACT GGAATCCATT CAACCTTACA CCAGGACAGA AGCTTACAGT TGAACAGAAT TTTACAATGA ACTATCCTAC TGATGACACA TGCGAGTTCA CATTCCATTT GGGTGGAGAA CTTGCTGCAG GTACACCTTA CTATGTTTAC CTTGATGATG TATCTCTCTA CGATCCTAGG TTTGTAAAGC CTGTTGAATA TGTACTTCCG CAGCCGGATG TACGTGTTAA CCAGGTAGGA TACTTACCGT TTGCAAAGAA GTATGCTACT GTTGTATCTT CTTCAACCAG CCCGCTTAAG TGGCAGCTTC TCAATTCGGC AAATCAGGTT GTTTTGGAAG GTAATACAAT ACCAAAAGGA CTTGACAAAG ATTCACAGGA TTATGTACAT TGGATAGATT TCTCCAACTT TAAGACTGAA GGAAAAGGTT ATTACTTCAA GCTTCCGACT GTAAACAGCG ATACAAATTA CAGCCATCCT TTCGATATCA GTGCTGATAT TTACTCCAAG ATGAAATTTG ATGCATTGGC ATTCTTCTAT CACAAGAGAA GCGGTATTCC TATTGAAATG CCGTATGCAG GAGGAGAACA GTGGACCAGA CCTGCAGGAC ATATTGGTGT TGCTCCGAAC AAAGGAGACA CAAATGTTCC TACATGGCCT CAGGATGATG AATATGCAGG AAGACCTCAA AAATATTATA CAAAAGATGT AACCGGTGGA TGGTATGATG CCGGTGACCA CGGTAAATAT GTTGTAAACG GCGGTATAGC TGTTTGGACA TTGATGAACA TGTATGAAAG GGCAAAAATC AGAGGCATAG CTAATCAAGG TGCTTATAAA GACGGTGGAA TGAACATACC GGAGAGAAAT AACGGTTATC CGGACATTCT TGATGAAGCA AGATGGGAAA TTGAGTTCTT TAAGAAAATG CAGGTAACTG AAAAAGAGGA TCCTTCCATA GCCGGAATGG TACACCACAA AATTCACGAC TTCAGATGGA CTGCTTTGGG TATGTTGCCT CACGAAGATC CCCAGCCACG TTACTTAAGG CCGGTAAGTA CGGCTGCGAC TTTGAACTTT GCGGCAACTT TGGCACAAAG TGCACGTCTT TGGAAAGATT ATGATCCGAC TTTTGCTGCT GACTGTTTGG AAAAGGCTGA AATAGCATGG CAGGCGGCAT TAAAGCATCC TGATATTTAT GCTGAGTATA CTCCCGGTAG CGGTGGTCCC GGAGGCGGAC CATACAATGA CGACTATGTC GGAGACGAAT TCTACTGGGC AGCCTGCGAA CTTTATGTAA CAACAGGAAA AGACGAATAT AAGAATTACC TGATGAATTC ACCTCACTAT CTTGAAATGC CTGCAAAGAT GGGTGAAAAC GGTGGAGCAA ACGGAGAAGA CAACGGATTG TGGGGATGCT TCACCTGGGG AACTACTCAA GGATTGGGAA CTATTACTCT TGCATTAGTT GAAAACGGAT TGCCGTCTGC AGACATTCAA AAGGCAAGAA ACAATATAGC TAAAGCTGCA GACAAATGGC TTGAGAATAT TGAAGAGCAA GGTTACAGAC TGCCGATCAA ACAGGCGGAG GATGAGAGAG GCGGTTATCC ATGGGGTTCA AACTCCTTCA TTTTGAACCA GATGATAGTT ATGGGATACG CATATGACTT TACAGGCAAC AGCAAGTATC TTGACGGAAT GCAGGATGGT ATGAGCTACC TGTTGGGAAG AAACGGACTG GATCAGTCCT ATGTAACAGG GTATGGTGAG CGTCCACTTC AGAATCCTCA TGACAGATTC TGGACGCCAC AGACAAGTAA GAAATTCCCT GCTCCACCTC CGGGTATAAT TGCCGGTGGT CCGAACTCCC GTTTCGAAGA CCCGACAATA ACTGCAGCAG TTAAGAAGGA TACACCGCCG CAGAAGTGCT ACATTGACCA TACAGACTCA TGGTCAACCA ACGAGATAAC TATTAACTGG AATGCTCCGT TTGCATGGGT TACAGCTTAT CTCGATGAAA TTGACTTAAT AACACCGCCA GGAGGAGTAG ACCCAGAAGA ACCGGAGGTT ATTTATGGTG ACTGCAATGG CGACGGAAAA GTTAATTCAA CTGACGCTGT GGCATTGAAG AGATATATCT TGAGATCAGG TATAAGCATC AACACTGATA ATGCTGATGT AAATGCTGAT GGCAGAGTTA ACTCTACAGA CTTGGCAATA TTGAAGAGAT ATATTCTTAA AGAGATAGAT GTATTGCCAC ATAAATAA
|
Protein sequence | MNFRRMLCAA IVLTIVLSIM LPSTVFALED KSPKLPDYKN DLLYERTFDE GLCFPWHTCE DSGGKCDFAV VDVPGEPGNK AFRLTVIDKG QNKWSVQMRH RGITLEQGHT YTVRFTIWSD KSCRVYAKIG QMGEPYTEYW NNNWNPFNLT PGQKLTVEQN FTMNYPTDDT CEFTFHLGGE LAAGTPYYVY LDDVSLYDPR FVKPVEYVLP QPDVRVNQVG YLPFAKKYAT VVSSSTSPLK WQLLNSANQV VLEGNTIPKG LDKDSQDYVH WIDFSNFKTE GKGYYFKLPT VNSDTNYSHP FDISADIYSK MKFDALAFFY HKRSGIPIEM PYAGGEQWTR PAGHIGVAPN KGDTNVPTWP QDDEYAGRPQ KYYTKDVTGG WYDAGDHGKY VVNGGIAVWT LMNMYERAKI RGIANQGAYK DGGMNIPERN NGYPDILDEA RWEIEFFKKM QVTEKEDPSI AGMVHHKIHD FRWTALGMLP HEDPQPRYLR PVSTAATLNF AATLAQSARL WKDYDPTFAA DCLEKAEIAW QAALKHPDIY AEYTPGSGGP GGGPYNDDYV GDEFYWAACE LYVTTGKDEY KNYLMNSPHY LEMPAKMGEN GGANGEDNGL WGCFTWGTTQ GLGTITLALV ENGLPSADIQ KARNNIAKAA DKWLENIEEQ GYRLPIKQAE DERGGYPWGS NSFILNQMIV MGYAYDFTGN SKYLDGMQDG MSYLLGRNGL DQSYVTGYGE RPLQNPHDRF WTPQTSKKFP APPPGIIAGG PNSRFEDPTI TAAVKKDTPP QKCYIDHTDS WSTNEITINW NAPFAWVTAY LDEIDLITPP GGVDPEEPEV IYGDCNGDGK VNSTDAVALK RYILRSGISI NTDNADVNAD GRVNSTDLAI LKRYILKEID VLPHK
|
| |