Gene Cthe_1322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1322 
SymboldnaK 
ID4809462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1604299 
End bp1606125 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content44% 
IMG OID640106746 
Productmolecular chaperone DnaK 
Protein accessionYP_001037747 
Protein GI125973837 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGG TTATCGGCAT AGATCTTGGA ACGACCAACT CTTGCGTTGC AGTTATGGAA 
GGCGGAGAGC CTGTTGTAAT TCCAAATTCC GAGGGTTCCA GAACCACACC TTCAGTTGTG
GCTTTTACAA AAAACAATGA AAGACTTGTG GGACAGGTGG CAAAAAGGCA GGCTATTACG
AATCCTGAAA GAACTATTAT TTCTATTAAA AGGGACATGG GAACTGACAA AAGAGTAAAA
ATTGATGATA AAAGCTATAC TCCCCAGGAG ATTTCGGCAA TGATTCTTCA AAAAATAAAA
GCGGATGCCG AGGCATATTT AGGTGAGAAA GTTACCCAGG CTGTAATAAC AGTGCCGGCA
TATTTCAGCG ACTCTCAAAG ACAGGCTACA AAGGATGCTG GCAGGATCGC AGGTTTGGAA
GTTTTAAGAA TAATCAACGA GCCTACGGCA GCGGCTTTGG CATATGGACT GGACAAGGAA
AGCGACCAGA AAATTCTTGT CTTTGACCTC GGTGGTGGTA CGTTCGACGT ATCCATACTG
GAAATAGGAG ACGGAGTTTT TGAAGTCCTT GCAACAAGCG GTAACAACAG ATTGGGTGGA
GATGACTTTG ACCAAAGAAT AATAGACTAT TTGATAGACC TCTTCAAAAA AGAGCACGGC
ATTGATTTGA GCACTGACAA GATGGCAATG CAGAGACTCA AGGAAGCGGC AGAAAAAGCT
AAAATTGAGC TTTCCGGCGT TACCACCACA AATATAAATC TTCCTTTTAT TACGGCTGAT
GCAAATGGCC CGAAACACCT CGATGTTACG TTGACAAGAG CGAAGTTTGA AGAGTTGACG
GCAGATCTTG TGGAAAAGAC AATGGAACCT ACAAGAAGGG CGTTGGAGGA CTCGGGACTT
ACTCCGGATA AAATTGACAA GATTCTTTTG GTTGGCGGTT CCACAAGAAT ACCGGCCGTA
CAGGAAGCTG TAAGGAAATT CTTTGGCAAG GAGCCCTTTA AAGGAATAAA TCCTGACGAA
TGTGTTGCCA TAGGCGCAGC TATTCAGGCG GGGGTACTTA CCGGAGAAGT AAAAGATCTT
CTGCTTCTTG ATGTAACTCC TCTTTCTCTT GGAATTGAAA CTTTGGGTGG TGTGTTCACA
AAACTTATTG AAAGAAATAC TACGATACCG ACCAAGAAAA GCCAGATTTT CTCAACTGCC
GCAGACGGTC AGACAGCTGT TACCGTAAGG GTATTCCAGG GAGAAAGAGC CATGGCTGCT
GACAACAAGC TTTTGGGAGA ATTTACTCTT GACGGTATAC CACCGGCACC GAAAGGCGTG
CCTCAGATTG AGGTAACCTT TGACATAGAT GCGAACGGTA TTGTGCATGT TTCGGCAAAG
GACCTTGGTA CAGGAAAAGA GCAGCACATA ACCATAACTG CTTCTACGAA CCTGTCCGAA
GCTGAGATAG AAAAAGCTAT AAATGAAGCC AAGAAGTATG AAGAAGAGGA CAGAAAGAGA
AAAGAAAGTG CCGAGACAAG AAACAATGCC GACTCAATGG TATTCCAGGC TGAGAAAACA
TTAAAAGATT TGGGAGATAA ACTTAGCAGT GAAGACAAAG CGAAGATTGA AGCAGAAATA
GAAAAGGTCA GGGAAGCTTT GAAGGGAACG GATACCCAGG CAATCAAGAA AGCAACGGAA
GATTTGCAGC AGGCATTCTA CAGTGTATCG GCAAAAATCT ATCAGCAAGG GCAGGCTGCA
GGTGCAAATC CCGGTGCTCA AACGACCGGC GGAGAACAGG GAAATGTGTA TGACGCCGAG
TACAAGGTAG TGGATGACGA CAAATAA
 
Protein sequence
MGKVIGIDLG TTNSCVAVME GGEPVVIPNS EGSRTTPSVV AFTKNNERLV GQVAKRQAIT 
NPERTIISIK RDMGTDKRVK IDDKSYTPQE ISAMILQKIK ADAEAYLGEK VTQAVITVPA
YFSDSQRQAT KDAGRIAGLE VLRIINEPTA AALAYGLDKE SDQKILVFDL GGGTFDVSIL
EIGDGVFEVL ATSGNNRLGG DDFDQRIIDY LIDLFKKEHG IDLSTDKMAM QRLKEAAEKA
KIELSGVTTT NINLPFITAD ANGPKHLDVT LTRAKFEELT ADLVEKTMEP TRRALEDSGL
TPDKIDKILL VGGSTRIPAV QEAVRKFFGK EPFKGINPDE CVAIGAAIQA GVLTGEVKDL
LLLDVTPLSL GIETLGGVFT KLIERNTTIP TKKSQIFSTA ADGQTAVTVR VFQGERAMAA
DNKLLGEFTL DGIPPAPKGV PQIEVTFDID ANGIVHVSAK DLGTGKEQHI TITASTNLSE
AEIEKAINEA KKYEEEDRKR KESAETRNNA DSMVFQAEKT LKDLGDKLSS EDKAKIEAEI
EKVREALKGT DTQAIKKATE DLQQAFYSVS AKIYQQGQAA GANPGAQTTG GEQGNVYDAE
YKVVDDDK