Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2194 |
Symbol | |
ID | 4811059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2616652 |
End bp | 2618157 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107600 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038589 |
Protein GI | 125974679 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2382] Enterochelin esterase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.29628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAGAA AGGTTCTTAG TGTATTATTA ATTTGCCTGG TGCTTATAGC ATGTTTGGGC ACCGCAGTAA ACATTTCATC GGCAGCATCG CTGCCAACTA TGCCGCCGTC GGGGTATGAC CAGGTAAGGG GTGGCATCCA GAGAGGGCAG GTTGTTAATA TTTCTTATTA TTCCACAGCA ACAAACGGTA CACGGCCCGC AAAAGTTTAT TTGCCACCGG GATACTCGAC CAGTAAAAGG TATAGCGTTT TGTACCTATT GCATGGAATA GGGGGAAGCG AAGGCGATTG GTTTGCCGAT TGGGGAGGCA GAGCCAGCAT AATTGCCGAT AATCTGATTG CAGAGGGAAA AATCAAGCCT TTGATAATAG TTACACCCAA TACTAACGCA GCAGGACCTG GGATAGGTGA TGGTTACGAA AACTTTACAA AGGATTTAAT TAATTGCCTT ATTCCCTATA TAGAATCACG CTATTCCGTT TATACTGACC GTGAACATCG GGCAATTGCC GGTCTTTCAA TGGGAGGAGG TCAATCCTTT AATATTGGTT TGACCAACCT GGATAAATTT GCCTATATTG GTCCTATTTC TTCAGCTCCG AACACCTATC CCAATAACAG GCTGTTCCCC GATGGAGGAG CTGCTGCAAG GCAGAAGCTG AAATTGCTCT TCATTGCATG CGGAACCAAT GATTCTCTGA TAGGATTCGG ACAAAGGGTA CACGAATTTT GCGTTGCCAA TAATATTAAC CATATCTATT GGCTTATCCA GGGAGGAGGA CACGATTATA ATGTTTGGAA AGCGGGTTTG TGGAACTTCC TCCAATTAGC GGAACAGGCA GGATTAACAG ATTATAATGC GCCAACACCA CCGCCACCGG CTCCAAGGTC AGCTTTTACA CGTATCGAAG CGGAAGACTT CGATAACATG TCGGGAATAG AAAATGAAAG TTGTAGTGAA GGCGGACTGA ATATAGGTTA TATAGAGAAT GGGGATTATG TTGCTTACAG TAATATAGAT TTTGGTAACG GAGCAAAGGA ATTTCAGGCC AGGGTGGCAA GTGCTACCAG TGGAGGAAAA ATCGAGATAA GGCTTGACAG TATTACAGGT CCATTAATAG GAACGTGCTC GGTTTCAGGT ACCGGCGGTT GGCAGCAATG GGTTGATGTG AAATGCGAGG TCAGCGGCGT AAGCGGAACT CATGATCTCT ATTTGAAATT TACGGGTGGC AGCGGTTATC TGTTCAATAT AAACTGGTGG AAGTTCACTC AGGCCGATTC AAACCCAACG CCAACACCAC CGCCCAATGA GAATTTGGGC GATTTGAACG GAGACGGAAA TATAAACTCG ACAGACCTTC AGATTTTAAA GAAGCATTTA CTCCGTATAA CTTTGCTTAC GGGAAAAGAA CTTTCCAATG CGGATGTAAC CAAAGACGGC AAAGTAGATT CAACCGATTT AACTTTATTG AAAAGATATA TACTTCGGTT TGTAACGAAT TTTTAG
|
Protein sequence | MLRKVLSVLL ICLVLIACLG TAVNISSAAS LPTMPPSGYD QVRGGIQRGQ VVNISYYSTA TNGTRPAKVY LPPGYSTSKR YSVLYLLHGI GGSEGDWFAD WGGRASIIAD NLIAEGKIKP LIIVTPNTNA AGPGIGDGYE NFTKDLINCL IPYIESRYSV YTDREHRAIA GLSMGGGQSF NIGLTNLDKF AYIGPISSAP NTYPNNRLFP DGGAAARQKL KLLFIACGTN DSLIGFGQRV HEFCVANNIN HIYWLIQGGG HDYNVWKAGL WNFLQLAEQA GLTDYNAPTP PPPAPRSAFT RIEAEDFDNM SGIENESCSE GGLNIGYIEN GDYVAYSNID FGNGAKEFQA RVASATSGGK IEIRLDSITG PLIGTCSVSG TGGWQQWVDV KCEVSGVSGT HDLYLKFTGG SGYLFNINWW KFTQADSNPT PTPPPNENLG DLNGDGNINS TDLQILKKHL LRITLLTGKE LSNADVTKDG KVDSTDLTLL KRYILRFVTN F
|
| |