Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3020 |
Symbol | |
ID | 4811168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3543236 |
End bp | 3544315 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108441 |
Product | ech hydrogenase subunit E |
Protein accession | YP_001039409 |
Protein GI | 125975499 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAGA AAACAGTAAT CCCCTTCGGC CCTCAACATC CGGTTTTACC GGAGCCCATA CATTTAGATC TTGTGCTTGA GGATGAAACA GTGGTAGAGG CAATACCTTC TATTGGATAT ATACACCGCG GTCTGGAAAA ACTTGTGGAA AAAAAGGACT ATCAGCAGTT TGTTTATGTA GCGGAAAGAA TTTGCGGCAT TTGCTCTTTC ATGCACGGCA TGGGTTACTG CATGTCGATT GAAAACATAA TGGGAGTGCA AATTCCTGAA AGAGCAGAGT TTTTAAGAAC CATCTGGGCA GAGCTGTCAC GCATACACAG CCACATGCTT TGGTTGGGGC TTTTAGCCGA CGCCCTTGGA TTTGAAAGCC TGTTTATGCA TTCTTGGAGG CTAAGAGAGC AGATTCTTGA CATATTCGAA GAAACCACCG GAGGAAGAGT AATATTCTCC GTCTGCGATA TTGGCGGTGT AAGAAGAGAT ATAGATTCTG AAATGCTGAA AAAAATAAAC TCAATATTGG ATGGTTTTGA AAAAGAATTT TCAGAAATCA CAAAAGTATT TTTGAATGAT TCTTCCGTAA AACTTCGTAC CCAAGGCCTT GGTGTGCTTT CCCGTGAAGA GGCTTTTGAA CTGGGAGCAG TCGGGCCTAT GGCGAGAGCC AGCGGTATCG ATATTGACAT GAGAAAAAGC GGCTATGCCG CATACGGAAA ATTAAAGATA GAACCCGTTG TTGAAACCGC CGGAGATTGC TATGCCAGAA CATCGGTAAG AATCAGAGAA GTTTTTCAAT CCATTGACCT GATTCGCCAG TGCATATCCC TCATTCCTGA CGGTGAAATC AAGGTAAAGA TTGTGGGAAA TCCAAGCGGT GAATACTTTA CCCGCCTGGA GCAGCCCCGC GGAGAAGTTT TATATTATGT AAAGGCAAAC GGAACAAAGT TTCTGGAAAG ATTCAGGGTT CGTACTCCAA CCTTTGCAAA TATTCCGGCT CTGCTTCACA CGCTGAAAGG ATGTCAGCTT GCAGACGTCC CGGTATTGAT TCTGACCATT GACCCTTGCA TAAGCTGTAC CGAAAGATAA
|
Protein sequence | MGKKTVIPFG PQHPVLPEPI HLDLVLEDET VVEAIPSIGY IHRGLEKLVE KKDYQQFVYV AERICGICSF MHGMGYCMSI ENIMGVQIPE RAEFLRTIWA ELSRIHSHML WLGLLADALG FESLFMHSWR LREQILDIFE ETTGGRVIFS VCDIGGVRRD IDSEMLKKIN SILDGFEKEF SEITKVFLND SSVKLRTQGL GVLSREEAFE LGAVGPMARA SGIDIDMRKS GYAAYGKLKI EPVVETAGDC YARTSVRIRE VFQSIDLIRQ CISLIPDGEI KVKIVGNPSG EYFTRLEQPR GEVLYYVKAN GTKFLERFRV RTPTFANIPA LLHTLKGCQL ADVPVLILTI DPCISCTER
|
| |