Gene Cthe_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0665 
Symbol 
ID4810282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp819807 
End bp820775 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content41% 
IMG OID640106081 
ProductHflK protein 
Protein accessionYP_001037093 
Protein GI125973183 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID[TIGR01933] HflK protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCAA TAAACGTGGG AGGCAATTTT AGAAAAGCTG CAAAGCTTCC AGTGAAACTG 
ATTATTGGAG CAATTGTATT AGTAATCTTT GCAATTCTTT TTTTTAACTC ATTTTACACC
GTAACCGATC AGGAACAGGC TGTGGTGCTT ACTTTTGGCA AGGTTACAAG CATAGAAAGC
GCGGGAATTC ATTTTAAATT GCCATATCCG ATACAGTCGG TTATAAAAGT ACCGGTACAA
ATGACCCAAA AGCTGGAACT GGGCTACAGA GACCAAGGTG ACGGCAGGTA TGTAACTGTG
GATGAAGAGT CAAAAATGAT TACGGGAGAT TTTAATATAG TAAAGATTGA CTTCTTTATC
GAATGGAAGG TTTCCGATCC GAAAAAGTAT CTTTTTAATT CAGAGGATCC CAAAAACATA
CTCAGAGACT CAAGTCTAAG TGCCGCACGT TCTGTCGTAG GTTCATCAAC CATTGATGAT
GTGCTTACCA GCGGAAAAAT TGCAATTGAG AACGAGATTA AGGAAAAGCT GATAGCAAGC
CTTGATGCCT ATGATATCGG AATTCAGGTG CTGGATGTAA AAATACAGGA TTCGGAACCG
CCCACGGAAG AAGTGAAGCA GGCATTCAAG AACGTGGAAA ATGCAAAGCA GAGCAAGGAG
ACGGCCATGA ATGAGGCAAA CAAATACAGA AACACAGAGA TTCCAAAGGC CCAGGCGGAA
GCCGACCGTA TATTGCGCAA TGCAGAATCT CAAAAGCAGA CAAAGATAAA TGAGGCCAGG
GGAGAAGTGG CCAAGTTTTT AAAAATGTAT GAGGAATACA AGAATTATAA AGATGTCACA
AAGACAAGGC TTTATCTTGA GGCAATGGAA GAGATACTTC CGGGTATTAC GGTTTATATT
GAAGATAATT CTTCCGGTGT TCAAAAGCTT GTTCCGCTAA AGCCGTTTGA TTCAGAGGGG
GGCGAATAG
 
Protein sequence
MEAINVGGNF RKAAKLPVKL IIGAIVLVIF AILFFNSFYT VTDQEQAVVL TFGKVTSIES 
AGIHFKLPYP IQSVIKVPVQ MTQKLELGYR DQGDGRYVTV DEESKMITGD FNIVKIDFFI
EWKVSDPKKY LFNSEDPKNI LRDSSLSAAR SVVGSSTIDD VLTSGKIAIE NEIKEKLIAS
LDAYDIGIQV LDVKIQDSEP PTEEVKQAFK NVENAKQSKE TAMNEANKYR NTEIPKAQAE
ADRILRNAES QKQTKINEAR GEVAKFLKMY EEYKNYKDVT KTRLYLEAME EILPGITVYI
EDNSSGVQKL VPLKPFDSEG GE