Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2822 |
Symbol | |
ID | 4809659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3337478 |
End bp | 3338464 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108242 |
Product | oxidoreductase-like protein |
Protein accession | YP_001039214 |
Protein GI | 125975304 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.340246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGATA AAATAAGATG GGGAATATTA GGTTGTGGAA GAATAGCAAG AACTTTTGCC GAAAGTTTGC AGCACGTACC TGATGCAAAA TTGGCGGCTG TAGCTTCAAA AACACCCGGA CGGGCTGCCG AATTTGCAAA AATTTTTAAT GCCGAGCGTT ATTATGAGGA CTATCAGGAA CTTGTAAAAG ACGAGTCTTT GGATGTCATA TATGTAGCCA CCACACACAA CTTTCACTAC GAACACGCTC TTCTTTGTCT CGAAAATAAA AAAGCCGTCC TGTGCGAAAA ACCTTTCACC GTTAATTCAA AACAGGCCGA AGATTTAATT AAAGTTGCAA GGCGAAACAA AGTTTTCTTG ATGGAAGCGA TGTGGACAAG ATTTTTGCCG TGCATAGTCG AGCTGAACCG TTTGCTTTCT CAGAACATAA TAGGAGACAT CGGATTTTTG CGGGCTGACT TTGGAAACAG TACCAAAGGT CGGGATGTGC CAAAATACTA CTTTGACCCC AATCTTGCCG GAGGTGCGCT GCTGGATTTG GGAGTGTATC CTGTTTCTTT TGCGAGACTT GTGTTTAAAA AATCTCCTGT TGGTATAAAG ACTTTTGGAT ATATAGCCGA TTCAAAAGTT GATGTGCATG CAGCCTATAT GTTTGAGTTT GGATTTGGAA AAGTGGCAAT GCTGTCATCC TCATATAGTG TTGACATGCC TCATGATGCT GTAATTTGCG GAACGGAAGG ATACATAAAG GTTCCGGATT TCTTTCATCC CACTCAGTTT TCAATTCATT TGAACGGGCG GGAAGAAAAG GTTGTGAGGA TTCCTTTTGT ATCCAACGGC TTGAACTACC AGGTTCAGGA AGTAAATAAA TGCCTTAAGG AAGGAAAGTT GGAAAGTGAT ATTATGCCTT TGAATGAGAG CCTGGAAGTC ATGAAAATTT TGGATGTACT CAGGGCAAAG TGGGGACTTA AATATCCTAT GGAATAA
|
Protein sequence | MADKIRWGIL GCGRIARTFA ESLQHVPDAK LAAVASKTPG RAAEFAKIFN AERYYEDYQE LVKDESLDVI YVATTHNFHY EHALLCLENK KAVLCEKPFT VNSKQAEDLI KVARRNKVFL MEAMWTRFLP CIVELNRLLS QNIIGDIGFL RADFGNSTKG RDVPKYYFDP NLAGGALLDL GVYPVSFARL VFKKSPVGIK TFGYIADSKV DVHAAYMFEF GFGKVAMLSS SYSVDMPHDA VICGTEGYIK VPDFFHPTQF SIHLNGREEK VVRIPFVSNG LNYQVQEVNK CLKEGKLESD IMPLNESLEV MKILDVLRAK WGLKYPME
|
| |