Gene Cthe_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3014 
Symbol 
ID4811162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3538175 
End bp3539224 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID640108435 
Producthydrogenase formation HypD protein 
Protein accessionYP_001039403 
Protein GI125975493 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACGC TTGAAGCTAT AAAAAAGGAG CTTGGAGAAT ACGAGGGCAA AAACATTAAA 
ATAATGGAAG TCTGCGGCAC CCACACTTCC AGCATTTTTA AGAACGGGAT CCGAAGCCTT
ATCTCCCCTC GTATACAGCT TATTTCGGGA CCGGGCTGCC CTGTATGCGT GACGGCCTCA
TCCTACATTG ACAGGCTTGT AGAATATTCT CTAAAGGACA ACCATTGTGT TTTGACTTTC
GGTGACATGA TGAAGGTAAA GGGACACAAA CTTTCTCTTA CCGAGGCAAA AGCAACGGGG
GGAAATGTAA AAATCCTCTA TTCTCCCCTC AGTGCGGTAA AAATTGCAAT CAAAAGCCCT
GAGACCCAAT TTATTTTTGC CGCAGTGGGG TTTGAGACCA CAGCCCCTAT TTATGCGCTG
ATGATTGAAG AAATAATTAA GAATAATATA AAAAACCTGA AACTGATGAC TTCGATTAAG
ACCATTATAC CTGCGATTTC ATATATTTGC GAAAATGAAA AAAATATAGA CGCTTTTCTC
TGCCCCGGTC ATGTCAGTGT TATCATCGGC TCCTCGGTTT ATACCGATAT GGCGTTAAAA
TACCGCAAGC CTTTTGTGAT AGGCGGTTTC GAGGGTGGGC ATATTATAGC GGCAATTTAT
GAAATAATCC GCCAGATTTC AAACAATGAA TACAGTATGA AAAACATGTA TCAAAGTGCT
GTAAGCCCTG AGGGAAATCA AAAGGCGAAA TCTTTGATTG ACAAATATTT TGAAGCTGCA
GACGACTATT GGCGGGGAAT CGGAATAATA AAAAACTCGG GGCTAAGGCT AAGAACGGAA
TACAGGGATT TTGATGCCGG GAGCACAATT TTTGAAGATG CTGACGCTGC ACCTTCGGGC
TGTAAATGTG CTGATGTAAT ACTGGGAAGA ATAACCCCTG CACAATGCCC TCTGTTCGGC
AAGGCCTGCA CTCCGTTAAA TGCCGTCGGA GCATGCATGG TTTCTTCAGA GGGCGCCTGC
GGAATCTGGT ACAGAAATTG TGAGGTGTAG
 
Protein sequence
MKTLEAIKKE LGEYEGKNIK IMEVCGTHTS SIFKNGIRSL ISPRIQLISG PGCPVCVTAS 
SYIDRLVEYS LKDNHCVLTF GDMMKVKGHK LSLTEAKATG GNVKILYSPL SAVKIAIKSP
ETQFIFAAVG FETTAPIYAL MIEEIIKNNI KNLKLMTSIK TIIPAISYIC ENEKNIDAFL
CPGHVSVIIG SSVYTDMALK YRKPFVIGGF EGGHIIAAIY EIIRQISNNE YSMKNMYQSA
VSPEGNQKAK SLIDKYFEAA DDYWRGIGII KNSGLRLRTE YRDFDAGSTI FEDADAAPSG
CKCADVILGR ITPAQCPLFG KACTPLNAVG ACMVSSEGAC GIWYRNCEV