Gene Cthe_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3016 
Symbol 
ID4811164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3539444 
End bp3541705 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content44% 
IMG OID640108437 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001039405 
Protein GI125975495 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAG GAAAAAGCAA AAGCATAACA AAGAGAATAA CAGTTTCAGG AATAGTTCAG 
GGGGTCGGTT TCAGACCCTT TGTCCATAAT ATTGCCAAAA AGCATGGAAT ACCGGGAACC
GTACGAAACA TCGGGGGTTT GGTGGAAATA ATATTGCAAT CCTCTGAGGA AAAATATAAT
GAATTTTTAC AGGATTTAAA GGCAAATGCG CCTGTTGGTT CTGAAATAAC CAATATCGAA
ACGGAAGATA TAAAAGAGCG GGAATTTGAC GGCTTTCGAA TCATTGAAAG CAAGGACGAT
GAAGAAATCT CCATAATCCC GCCGGACCTC CCTGTCTGCG AAAGCTGTGA AAGGGAGCTT
TTCACCGGCA CCGACAGGCG CTTTTTAAAT CCTTTTATCA GCTGCATGTC CTGCGGTGCC
CGGTACACCA TTATAGAAGA GCTTCCCTAT GACAGGCACA ACACCACCAT GAGGGATTTT
GACATGTGCC CTGCCTGCCG GGAAGAATAC ACATCCCCAA GAAACAGGCG CTTTCACGCC
CAAACCATAT CGTGCAATGA CTGCGGGCCC TATTTAATTT TCAATGACCT TACCGGTGGC
AGTGAACTTA CTGAAAAAGA TGCCTTCCAT GCTGCGGCAA ATATCATAGA ATCCGGCGGA
ATAATCGCTG TAAAAGGCAT AGGAGGCTAT CATTTTGCCT GTTCTCCTTT TTTGGAAGAT
ACGGTTTTAA GGCTTAGAGA ATTAAAGGGC CGTGAAGCAA AGCCATTTGC AATAATGTTT
GAATCGGTGG ACTCCATACG CAAATACTGC GTTGTATCGG AAAAAGAAGA AGAACTTTTA
AAGTCAAAGG CAAGGCCCAT TGTTTTACTG TATTTAAAAA ACAACTCCAT GGCACCTTCC
GCCTGTCAGG GCAGCATATA CTGCGGAGCT TTTCTGCCCT ATACTCCTCT GCACATGCTG
CTTGTCAAAA GGTGCGGACC TCTGATTATG ACAAGCGCCA ATATATCCGA CAAGCCCATA
ATAAAGGACG ATTCAGAAAT GCTGTCGTTA AAATCGCCCT TGTTAAACGG TGTTTTATAC
AATAAACGAA GAATTGTACG CTCTGTTGAT GACTCGGTAG CAAAGGTTGT GGCAAATAGC
CCCCAGCTGA TTCGAAGAAG CCGCGGTTAT GTCCCCTACC CTGTTTTTTT AAAAAACAGA
AAAAAGGATT TGCAGATTTT TGCTGCAGGA AGTGACTTAA AAGCAGCCTT CTGCCTTTAC
AAAAACGGAA ACGCGGTAAT GTCCCAATAC TTTGGAGACT TGGAAGAAAA GACGGTTTTG
GAGCGGTATA AAGCCTCGTT CAGGGACCTT TGTCATCTTT TAAAAATTAC TCCCGATATT
GCAGTTTGTG ATATGCACCC AAACTACCAT TCATCCAGGT TTGCCGAAAA CCTGGGCATT
CCGCTCACCT ATGTCCAGCA TCATCATGCA CATGTTGCAT CGGTTATGGC AGAACATCAT
TTAAAGGAAC AGGTAATCGG CGTTGCCTTT GATGGAACGG GCTATGGAAC CGACGGCAAA
ATATGGGGCG GAGAATTTTT AATCTGCGAA GGCGCAGAGT TTAAAAGAGT GGCTCACCTT
CGCTACATCC CTGTTTTGGG CGGTGATTCA TCCATGCGGG ATGCCGCGAA AACCGCCGCT
TGCTTTTTGT TAAACTTGGG TCTTGACCAA TATGTAAAAG ACGAACGCAA GGACATAATA
AAAGCCGCTT TGAAGAATAA TATCAATACC GTGCCCACAT CAAGCATGGG AAGACTGTTT
GACGCCGTTT CATCCCTTTT GGAGATTCAG TATGAAAACC GCTATGAGGG AGAATGTGCG
GCAATGCTGG AAAAAGAGGC AGTTTTGGCT CTAAGGCACA AGATAGAGCC TAAAAAGCTT
GCCTTTGAAA TCAAGCGAAA AAGTGATTTA ATAGAAATAG ATCCAAAACC GATGCTTAAA
TCCATGTGTC ATCTTCAAAA CAAAGATGAC ACAGGTTCTC TTGCATTGGG TTTTCATTAT
GCCGTTGCGG ATATGATATT GGAAGTATGC GAAATAATAC GGGCAGAACA AAAAATTAAC
ACTGTCGCCT TAAGCGGCGG CGTATTTCAA AATACTTTGC TTATGGAACG AACACTCAAG
ATTTTAAGAG ACAGACACTT TAACGTATAC TATAACATGT CGGTTCCCCC AAATGACGGT
TCAATTGGTC TGGGACAGAC GTTTATTGGA TTAGTGAGGT GA
 
Protein sequence
MDKGKSKSIT KRITVSGIVQ GVGFRPFVHN IAKKHGIPGT VRNIGGLVEI ILQSSEEKYN 
EFLQDLKANA PVGSEITNIE TEDIKEREFD GFRIIESKDD EEISIIPPDL PVCESCEREL
FTGTDRRFLN PFISCMSCGA RYTIIEELPY DRHNTTMRDF DMCPACREEY TSPRNRRFHA
QTISCNDCGP YLIFNDLTGG SELTEKDAFH AAANIIESGG IIAVKGIGGY HFACSPFLED
TVLRLRELKG REAKPFAIMF ESVDSIRKYC VVSEKEEELL KSKARPIVLL YLKNNSMAPS
ACQGSIYCGA FLPYTPLHML LVKRCGPLIM TSANISDKPI IKDDSEMLSL KSPLLNGVLY
NKRRIVRSVD DSVAKVVANS PQLIRRSRGY VPYPVFLKNR KKDLQIFAAG SDLKAAFCLY
KNGNAVMSQY FGDLEEKTVL ERYKASFRDL CHLLKITPDI AVCDMHPNYH SSRFAENLGI
PLTYVQHHHA HVASVMAEHH LKEQVIGVAF DGTGYGTDGK IWGGEFLICE GAEFKRVAHL
RYIPVLGGDS SMRDAAKTAA CFLLNLGLDQ YVKDERKDII KAALKNNINT VPTSSMGRLF
DAVSSLLEIQ YENRYEGECA AMLEKEAVLA LRHKIEPKKL AFEIKRKSDL IEIDPKPMLK
SMCHLQNKDD TGSLALGFHY AVADMILEVC EIIRAEQKIN TVALSGGVFQ NTLLMERTLK
ILRDRHFNVY YNMSVPPNDG SIGLGQTFIG LVR