Gene Cthe_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0430 
Symbol 
ID4808358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp540242 
End bp541942 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content44% 
IMG OID640105844 
Producthydrogenase, Fe-only 
Protein accessionYP_001036861 
Protein GI125972951 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.735126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACA GAGAGTATAT GTTGATAGAC GGAATTCCTG TTGAGATAAA CGGTGAAAAA 
AACCTTCTTG AGCTTATTCG AAAAGCGGGC ATCAAGCTTC CGACATTTTG TTACCATTCC
GAATTGTCGG TTTACGGTGC CTGCCGTATG TGCATGGTTG AAAATGAATG GGGCGGATTG
GATGCTGCAT GTTCCACCCC GCCCAGAGCG GGCATGAGTA TAAAAACAAA CACTGAAAGA
CTTCAAAAAT ACCGGAAAAT GATTTTGGAG CTTTTGCTTG CCAATCACTG TCGGGATTGT
ACAACCTGCA ATAACAACGG AAAATGCAAA CTGCAGGATC TTGCGATGAG GTACAATATA
AGTCATATTC GATTCCCAAA CACTGCTTCA AATCCGGATG TGGACGATTC TTCCCTTTGC
ATAACAAGGG ACCGGAGCAA ATGCATACTC TGCGGAGACT GTGTTCGTGT ATGCAATGAA
GTACAGAATG TGGGTGCTAT TGATTTTGCC TACAGAGGCT CTAAAATGAC AATCAGTACG
GTATTTGACA AACCTATCTT TGAATCAAAT TGTGTGGGCT GCGGACAATG TGCCCTGGCA
TGTCCGACAG GAGCAATTGT TGTAAAAGAT GATACACAGA AAGTATGGAA GGAAATCTAT
GATAAAAATA CCCGGGTATC GGTGCAGATA GCTCCTGCTG TCCGTGTTGC TTTAGGAAAA
GAACTTGGCT TAAATGACGG AGAAAATGCC ATAGGAAAAA TTGTGGCGGC ACTGCGGCGC
ATGGGATTTG ATGATATATT TGACACGTCA ACCGGGGCGG ATTTGACGGT TCTGGAAGAA
TCGGCAGAAC TGCTAAGGCG TATTAGAGAA GGAAAAAATG ACATGCCTCT TTTTACGTCA
TGCTGTCCTG CATGGGTAAA CTACTGCGAA AAGTTCTATC CCGAGCTTTT GCCCCATGTT
TCCACATGCC GTTCACCGAT GCAGATGTTT GCCTCCATAA TAAAAGAGGA ATATTCAACT
TCCAGCAAAC GGTTGGTGCA TGTTGCAGTT ATGCCCTGCA CGGCAAAGAA ATTTGAAGCG
GCCCGCAAAG AGTTTAAAGT CAATGGAGTT CCAAATGTTG ATTATGTTCT GACAACCCAG
GAGCTTGTTC GAATGATCAA AGAATCGGGG ATAGTATTCT CTGAACTGGA GCCTGAAGCA
ATTGACATGC CCTTTGGAAC GTATACAGGA GCCGGTGTGA TATTTGGGGT TTCCGGGGGT
GTCACTGAAG CGGTTTTAAG AAGAGTGGTT TCGGACAAAT CTCCGACATC TTTCAGATCA
CTTGCCTATA CCGGTGTTCG GGGTATGAAC GGAGTAAAAG AAGCTTCGGT GATGTACGGT
GACAGAAAGC TTAAAGTTGC AGTGGTCAGC GGGCTTAAAA ATGCCGGTGA TTTGATTGAA
AGGATTAAAG CCGGTGAGCA TTATGATCTT GTCGAGGTTA TGGCATGTCC GGGAGGTTGT
ATAAACGGAG GAGGACAGCC CTTTGTTCAA AGTGAGGAAA GAGAAAAGAG GGGCAAGGGG
TTGTACAGTG CGGATAAACT CTGCAATATC AAATCTTCTG AAGAAAATCC TCTTATGATG
ACACTTTATA AAGGCATTTT AAAGGGCAGG GTTCATGAAC TTCTTCATGT TGATTATGCT
TCAAAAAAGG AGGCAAAGTA G
 
Protein sequence
MDNREYMLID GIPVEINGEK NLLELIRKAG IKLPTFCYHS ELSVYGACRM CMVENEWGGL 
DAACSTPPRA GMSIKTNTER LQKYRKMILE LLLANHCRDC TTCNNNGKCK LQDLAMRYNI
SHIRFPNTAS NPDVDDSSLC ITRDRSKCIL CGDCVRVCNE VQNVGAIDFA YRGSKMTIST
VFDKPIFESN CVGCGQCALA CPTGAIVVKD DTQKVWKEIY DKNTRVSVQI APAVRVALGK
ELGLNDGENA IGKIVAALRR MGFDDIFDTS TGADLTVLEE SAELLRRIRE GKNDMPLFTS
CCPAWVNYCE KFYPELLPHV STCRSPMQMF ASIIKEEYST SSKRLVHVAV MPCTAKKFEA
ARKEFKVNGV PNVDYVLTTQ ELVRMIKESG IVFSELEPEA IDMPFGTYTG AGVIFGVSGG
VTEAVLRRVV SDKSPTSFRS LAYTGVRGMN GVKEASVMYG DRKLKVAVVS GLKNAGDLIE
RIKAGEHYDL VEVMACPGGC INGGGQPFVQ SEEREKRGKG LYSADKLCNI KSSEENPLMM
TLYKGILKGR VHELLHVDYA SKKEAK