Gene Cthe_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3013 
Symbol 
ID4811161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3537193 
End bp3538170 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content44% 
IMG OID640108434 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001039402 
Protein GI125975492 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA ACATGTCCCA CGGCAGCGGC GGAAAACAGA CAAGCGATTT AATAAACCGG 
ATATTTTTAA AGCATTTCGG CAACAACATA TTAAACAGGC TTGAAGATGC CGCAGTGCTG
GATATAAAGG GTAAAATTGC CTATACCACC GATTCCTTTG TGGTAACTCC CCTGTTTTTC
AAGGGCGGTG ACATTGGAAA ACTTGCCGTT TGCGGCACAG TAAACGACAT TTGCATGATG
GGCGCCATTC CAAAATACCT CACGGCAGGC TTTATCATTG AGGAAGGAGC GGAAATTGAA
ACCATTGATA AAATTGCCCT TTCAATGAAG CTTGCCGCGG AAGAAGCAGG AATCAAAATT
GTTGCGGGAG ACACCAAAGT AATCGAAGGC CACGGCGGAA TCTATATAAA CACATCCGGT
ATCGGTGAAA TAGTAAAAAG CGGCATCAGT ATTTCCAATT GCCAAAAAGG CGATGTCATC
ATACTTTCAG GCAATTTGGG CGACCACCAC GCCGCTGTAA TGTCGGAGCG AATGGAGATT
GAGAACAATA TAAAAAGCGA CTGCGCTCCC CTTGTCCAAA TAGTAAAAAA TCTGATTGAA
AGCAATATAG AAATCCATTG CATGCGGGAC ATAACCAGGG GCGGTCTTGC AACAGTGCTC
AACGAAATAT CGTCAGCCTC AAACTGCGGC ATTGAGATAC ACGAAGCCGT TTTGCCCATC
AGCAATGAAG TAAGAGGATT TTGCAGTATC CTCGGGCTTG ACCCCCTTTA TATGGCAAAC
GAAGGGAAAA TGATAGCCGT TATACCCGAA AATGAGGCTA ACAAGGCTCT TGAAGTAATC
AGAAAAAGCA AATACGGAGA AAACGCCCAA ATTATCGGTC GTATTGTGGA CGGAAGCGGA
GTAACCATGA TTACAACCCT TCAAGGAAAC AGGATATTGG ACATTCTGTA TGGCGAAGGA
CTTCCCCGCA TTTGCTAA
 
Protein sequence
MKINMSHGSG GKQTSDLINR IFLKHFGNNI LNRLEDAAVL DIKGKIAYTT DSFVVTPLFF 
KGGDIGKLAV CGTVNDICMM GAIPKYLTAG FIIEEGAEIE TIDKIALSMK LAAEEAGIKI
VAGDTKVIEG HGGIYINTSG IGEIVKSGIS ISNCQKGDVI ILSGNLGDHH AAVMSERMEI
ENNIKSDCAP LVQIVKNLIE SNIEIHCMRD ITRGGLATVL NEISSASNCG IEIHEAVLPI
SNEVRGFCSI LGLDPLYMAN EGKMIAVIPE NEANKALEVI RKSKYGENAQ IIGRIVDGSG
VTMITTLQGN RILDILYGEG LPRIC