Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3090 |
Symbol | |
ID | 4809964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3644371 |
End bp | 3645213 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108513 |
Product | fumarate hydratase |
Protein accession | YP_001039478 |
Protein GI | 125975568 |
COG category | [C] Energy production and conversion |
COG ID | [COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain |
TIGRFAM ID | [TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000169234 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACAA TTCATGTTGA TAGTATTACT GAAGCTGTGG AAAAGCTTTG CATGGACTCA AATTATTATC TTAACGATGA TATAATAAAT GGCTTGGAAA AAGGGCTTGA AAAAGAAGAA TCGGACAATG GCAAAGAGAT TTTAAGCAAG ATTATTGAGA ATGCGCAGAT AGCAAGGGAA AAGGCCGTGG CAATCTGTCA GGATACGGGA ATGGCCGTTG TTTTTATGGA CATAGGCCAG GATGTACATA TCACGGGAGG TAACCTCACT GATGCAATCA ATGAAGGGGT TAGAAGGGGC TATGAGAAGG GCTATCTCAG AAAGTCGGTT GTAAATGACC CTATAGAAAG GATAAACACA AAGGATAATA CTCCGGCGGT GATTCATTAC AACATTGTTG ACGGAGACAA AATAAAAATA ACTGTAGCTC CTAAAGGTTT TGGAAGTGAA AATATGAGTG CCCTGAAAAT GCTTACCCCT TCCCAGGGAA TAGAAGGAGT TAAGAATTTT ATAATTGAGA CGGTGGAAAA AGCGGGGCCG AATCCCTGTC CTCCTATTGT CGTCGGCGTA GGTATCGGAG GAACCATGGA AAAGGCGGCT TTTTTGGCCA AAAAAGCGCT TTTGAGGCCG ATAGACAAAA GGAATGACAT ACCGTATTTA AAGGAGCTGG AAGAGGAAAT GCTTGAGCGT ATAAACAGGC TGGGCATAGG TCCTTCGGGA CTGGGAGGAA GAATTACGGC ATTAGGTGTA AATATTGAAG TTTTTCCAAC CCATATTGCA GGTCTTCCGG TGGCCGTGAA TATAAACTGT CATGCAACAA GACATGCAGA AATAATTATT TAA
|
Protein sequence | MRTIHVDSIT EAVEKLCMDS NYYLNDDIIN GLEKGLEKEE SDNGKEILSK IIENAQIARE KAVAICQDTG MAVVFMDIGQ DVHITGGNLT DAINEGVRRG YEKGYLRKSV VNDPIERINT KDNTPAVIHY NIVDGDKIKI TVAPKGFGSE NMSALKMLTP SQGIEGVKNF IIETVEKAGP NPCPPIVVGV GIGGTMEKAA FLAKKALLRP IDKRNDIPYL KELEEEMLER INRLGIGPSG LGGRITALGV NIEVFPTHIA GLPVAVNINC HATRHAEIII
|
| |