Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0335 |
Symbol | |
ID | 4808484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 424321 |
End bp | 425667 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105749 |
Product | hydrogenase large subunit-like protein |
Protein accession | YP_001036766 |
Protein GI | 125972856 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCCGT ACTTCCACTC GGTGACTCTG GACGAGGTCA AATGTAAAGG TTGTACCAAT TGCATAAAAA GATGCCCCAC GGAAGCAATC AGAGTAAGAA AGAGCAAAGC CAGAATCATA AATGAAAGAT GCATAGATTG CGGAGAGTGT ATAAGGGTTT GTCCTTATCA TGCCAAAAAG GCGATAACCG ACCCTATTGG CCTGATAAAT GACTATAAAT ACAAAGTTGC GATACCTGCC CCTTCCCTGT ATGGACAGCT AAAGTCAGCG CCTTCAAGGG ATCATATCCT CACTGCTCTT AAAATGTGCG GATTTGACGA TGTTTTTGAG GTTGCAAGGG CAGCCGAGAT TATTACCAGT GAGACTAAAA AAATACTTGC ACAGCAAAGA TTTGAAAAGC CCCTTATATC TTCAGCATGC CCTGCTGTAG TAAGGCTCAT ACAAGTGAGG TTTCCAAATC TTATCGATAA TATATTAAAG CTAAAGTCCC CCATGGAGGT TGCGGCCAAA ATCGCAAAAG ATGAAATATC ACGAAACAGG AAAATTCGCC AGGAGGACAT CGGCGTATTT TTTATTACTC CCTGTGCCGC AAAAGTGACC AGCATAAAAG CTCCTTTGGA CAAGGAGAAG TCCTTTGTGG ACGGAGCAAT TTCAATTTCG GAAATATATT TAAAAATACT TTCTGCCCTG GGGAAAATTG ATAAACCTGA AAAGCTTTCA AGAGCCGGGT TTGTCGGAGT ACGCTGGGCA AATTCCGGAG GAGAAAGCAT CGCCCTTGGA ACGGACAAAT TTCTTGCGGT TGACGGTATT CACAATGTAA TTGCGATTCT TGAGGAAATT GAGGACGAAA AGCTGGAGGA TATAGATTTT GTTGAAGCTC TGGCATGTCA GGGAGGCTGT CTTGGCGGTC CACTTAACGT GGAAAATATG TATGTTGCCA GAAAAACAAT AAAGAAATTT ATTGATGATG CAAAAGAAAA AGGGTTGGAA ACGGCGGCAG ACGAAAATTG CAATTATGAT ATTGCCTGGA CGGGAAAGGT TGAATACAAA CCTGTCATGA AGCTGGACAA GGATTTTGAC GTGGCAATGA AGAAGCTGGA GACTCTTAAT GTCATAAACA GCGGACTTCC CGGACTGGAT TGCGGTGCGT GCGGTGCACC GAGCTGCAGA GCCCTTGCGG AAGACATAGT AAGGGGGAAT GCCAACGAGA CGGATTGCAT ATTTAAACTG AGAGAAAAAG TAAGAGACCT TGCATTTCAG ATGATGGAGC TGGAAGCAAA AATGCCTCCT GTGCTGGACA AGGATGAAAC TTCGGAAAAC AAATCAAAGA AAAGGGGAGA GTTTTGA
|
Protein sequence | MSPYFHSVTL DEVKCKGCTN CIKRCPTEAI RVRKSKARII NERCIDCGEC IRVCPYHAKK AITDPIGLIN DYKYKVAIPA PSLYGQLKSA PSRDHILTAL KMCGFDDVFE VARAAEIITS ETKKILAQQR FEKPLISSAC PAVVRLIQVR FPNLIDNILK LKSPMEVAAK IAKDEISRNR KIRQEDIGVF FITPCAAKVT SIKAPLDKEK SFVDGAISIS EIYLKILSAL GKIDKPEKLS RAGFVGVRWA NSGGESIALG TDKFLAVDGI HNVIAILEEI EDEKLEDIDF VEALACQGGC LGGPLNVENM YVARKTIKKF IDDAKEKGLE TAADENCNYD IAWTGKVEYK PVMKLDKDFD VAMKKLETLN VINSGLPGLD CGACGAPSCR ALAEDIVRGN ANETDCIFKL REKVRDLAFQ MMELEAKMPP VLDKDETSEN KSKKRGEF
|
| |