Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0430 |
Symbol | |
ID | 4808358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 540242 |
End bp | 541942 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105844 |
Product | hydrogenase, Fe-only |
Protein accession | YP_001036861 |
Protein GI | 125972951 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.735126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACA GAGAGTATAT GTTGATAGAC GGAATTCCTG TTGAGATAAA CGGTGAAAAA AACCTTCTTG AGCTTATTCG AAAAGCGGGC ATCAAGCTTC CGACATTTTG TTACCATTCC GAATTGTCGG TTTACGGTGC CTGCCGTATG TGCATGGTTG AAAATGAATG GGGCGGATTG GATGCTGCAT GTTCCACCCC GCCCAGAGCG GGCATGAGTA TAAAAACAAA CACTGAAAGA CTTCAAAAAT ACCGGAAAAT GATTTTGGAG CTTTTGCTTG CCAATCACTG TCGGGATTGT ACAACCTGCA ATAACAACGG AAAATGCAAA CTGCAGGATC TTGCGATGAG GTACAATATA AGTCATATTC GATTCCCAAA CACTGCTTCA AATCCGGATG TGGACGATTC TTCCCTTTGC ATAACAAGGG ACCGGAGCAA ATGCATACTC TGCGGAGACT GTGTTCGTGT ATGCAATGAA GTACAGAATG TGGGTGCTAT TGATTTTGCC TACAGAGGCT CTAAAATGAC AATCAGTACG GTATTTGACA AACCTATCTT TGAATCAAAT TGTGTGGGCT GCGGACAATG TGCCCTGGCA TGTCCGACAG GAGCAATTGT TGTAAAAGAT GATACACAGA AAGTATGGAA GGAAATCTAT GATAAAAATA CCCGGGTATC GGTGCAGATA GCTCCTGCTG TCCGTGTTGC TTTAGGAAAA GAACTTGGCT TAAATGACGG AGAAAATGCC ATAGGAAAAA TTGTGGCGGC ACTGCGGCGC ATGGGATTTG ATGATATATT TGACACGTCA ACCGGGGCGG ATTTGACGGT TCTGGAAGAA TCGGCAGAAC TGCTAAGGCG TATTAGAGAA GGAAAAAATG ACATGCCTCT TTTTACGTCA TGCTGTCCTG CATGGGTAAA CTACTGCGAA AAGTTCTATC CCGAGCTTTT GCCCCATGTT TCCACATGCC GTTCACCGAT GCAGATGTTT GCCTCCATAA TAAAAGAGGA ATATTCAACT TCCAGCAAAC GGTTGGTGCA TGTTGCAGTT ATGCCCTGCA CGGCAAAGAA ATTTGAAGCG GCCCGCAAAG AGTTTAAAGT CAATGGAGTT CCAAATGTTG ATTATGTTCT GACAACCCAG GAGCTTGTTC GAATGATCAA AGAATCGGGG ATAGTATTCT CTGAACTGGA GCCTGAAGCA ATTGACATGC CCTTTGGAAC GTATACAGGA GCCGGTGTGA TATTTGGGGT TTCCGGGGGT GTCACTGAAG CGGTTTTAAG AAGAGTGGTT TCGGACAAAT CTCCGACATC TTTCAGATCA CTTGCCTATA CCGGTGTTCG GGGTATGAAC GGAGTAAAAG AAGCTTCGGT GATGTACGGT GACAGAAAGC TTAAAGTTGC AGTGGTCAGC GGGCTTAAAA ATGCCGGTGA TTTGATTGAA AGGATTAAAG CCGGTGAGCA TTATGATCTT GTCGAGGTTA TGGCATGTCC GGGAGGTTGT ATAAACGGAG GAGGACAGCC CTTTGTTCAA AGTGAGGAAA GAGAAAAGAG GGGCAAGGGG TTGTACAGTG CGGATAAACT CTGCAATATC AAATCTTCTG AAGAAAATCC TCTTATGATG ACACTTTATA AAGGCATTTT AAAGGGCAGG GTTCATGAAC TTCTTCATGT TGATTATGCT TCAAAAAAGG AGGCAAAGTA G
|
Protein sequence | MDNREYMLID GIPVEINGEK NLLELIRKAG IKLPTFCYHS ELSVYGACRM CMVENEWGGL DAACSTPPRA GMSIKTNTER LQKYRKMILE LLLANHCRDC TTCNNNGKCK LQDLAMRYNI SHIRFPNTAS NPDVDDSSLC ITRDRSKCIL CGDCVRVCNE VQNVGAIDFA YRGSKMTIST VFDKPIFESN CVGCGQCALA CPTGAIVVKD DTQKVWKEIY DKNTRVSVQI APAVRVALGK ELGLNDGENA IGKIVAALRR MGFDDIFDTS TGADLTVLEE SAELLRRIRE GKNDMPLFTS CCPAWVNYCE KFYPELLPHV STCRSPMQMF ASIIKEEYST SSKRLVHVAV MPCTAKKFEA ARKEFKVNGV PNVDYVLTTQ ELVRMIKESG IVFSELEPEA IDMPFGTYTG AGVIFGVSGG VTEAVLRRVV SDKSPTSFRS LAYTGVRGMN GVKEASVMYG DRKLKVAVVS GLKNAGDLIE RIKAGEHYDL VEVMACPGGC INGGGQPFVQ SEEREKRGKG LYSADKLCNI KSSEENPLMM TLYKGILKGR VHELLHVDYA SKKEAK
|
| |