Gene SeD_A0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0012 
SymboldnaK 
ID6873896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp11594 
End bp13510 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content52% 
IMG OID642783272 
Productmolecular chaperone DnaK 
Protein accessionYP_002213966 
Protein GI198242776 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA TTATTGGTAT CGACCTGGGT ACTACCAACT CTTGTGTAGC GATTATGGAT 
GGAACGCAGG CACGCGTGCT GGAGAACGCC GAGGGCGATC GCACTACGCC TTCTATCATT
GCTTATACCC AGGATGGTGA AACTCTGGTT GGTCAGCCGG CTAAACGTCA GGCAGTGACA
AACCCGCAAA ACACCCTGTT TGCGATTAAA CGCCTGATTG GCCGCCGCTT CCAGGACGAA
GAAGTTCAAC GTGACGTTTC TATCATGCCG TACAAAATCA TCGGCGCCGA CAACGGCGAC
GCATGGCTTG ATGTGAAAGG TCAGAAAATG GCGCCGCCGC AGATTTCTGC CGAAGTGCTG
AAGAAAATGA AGAAAACGGC TGAAGATTAT CTGGGCGAAC CGGTAACTGA AGCGGTTATC
ACCGTACCGG CTTACTTTAA CGATGCGCAG CGTCAGGCTA CCAAAGATGC TGGTCGTATC
GCGGGGCTGG AAGTTAAACG TATCATCAAC GAACCGACTG CCGCAGCGCT GGCTTACGGT
CTGGATAAAG AAGTCGGCAA CCGTACTATC GCGGTTTACG ACCTCGGTGG TGGTACTTTC
GATATCTCTA TTATCGAAAT CGACGAAGTT GATGGCGAAA AAACCTTTGA AGTTCTGGCA
ACCAACGGTG ATACCCACCT GGGTGGTGAA GACTTCGATA CCCGCCTGAT CAACTACCTC
GTTGACGAGT TTAAGAAAGA TCAGGGCATT GACCTGCGTA ACGATCCGCT GGCCATGCAG
CGCCTGAAAG AAGCCGCAGA AAAAGCCAAA ATCGAGCTGT CTTCTGCGCA GCAGACCGAC
GTGAACCTGC CGTACATTAC CGCAGATGCC ACCGGTCCGA AACACATGAA CATCAAAGTG
ACTCGTGCAA AACTGGAAAG CCTGGTTGAA GATCTGGTGA ACCGTTCTAT CGAGCCGCTG
AAAGTCGCAC TGCAGGACGC TGGCCTGTCC GTGTCTGATA TCAACGACGT GATCCTCGTC
GGCGGTCAGA CCCGTATGCC AATGGTGCAG AAAAAAGTGG CTGAATTCTT TGGTAAAGAG
CCGCGTAAAG ACGTTAACCC GGACGAAGCT GTGGCTATCG GCGCAGCGGT ACAGGGCGGC
GTGTTGACCG GTGATGTGAA AGACGTACTG CTGCTGGACG TTACCCCGCT GTCTCTGGGT
ATCGAAACGA TGGGTGGCGT GATGACTCCG CTTATCACCA AAAACACCAC CATCCCGACC
AAGCACAGCC AGGTGTTCTC TACTGCGGAA GACAACCAGT CTGCGGTAAC CATCCATGTG
CTGCAGGGTG AGCGTAAACG TGCGTCTGAT AACAAATCTC TGGGTCAGTT CAACCTGGAT
GGCATCAACC CGGCGCCGCG CGGTATGCCG CAGATCGAAG TCACCTTCGA TATCGATGCT
GACGGTATCC TGCACGTCTC CGCGAAAGAT AAAAATAGCG GTAAAGAGCA GAAGATCACT
ATCAAGGCGT CTTCTGGTCT GAACGAGGAA GAAATTCAGA AAATGGTTCG CGATGCAGAA
GCGAACGCTG AATCCGACCG TAAGTTCGAA GAGCTGGTTC AGACCCGTAA CCAGGGCGAC
CATCTGCTGC ACAGCACCCG TAAGCAGGTT GAAGAAGCAG GCGATAAACT GCCGGCTGAT
GACAAAACCG CTATCGAGTC TGCGCTGAAC GCGCTGGAAA CTGCCCTGAA AGGCGAAGAT
AAAGCCGCTA TCGAAGCGAA AATGCAGGAA CTGGCGCAGG TTTCCCAGAA ACTGATGGAA
ATCGCTCAGC AGCAACATGC GCAGCAGCAG GCTGGCTCCG CTGACGCTTC TGCAAACAAC
GCGAAAGATG ACGACGTTGT CGACGCTGAG TTTGAAGAAG TAAAAGATAA AAAATAA
 
Protein sequence
MGKIIGIDLG TTNSCVAIMD GTQARVLENA EGDRTTPSII AYTQDGETLV GQPAKRQAVT 
NPQNTLFAIK RLIGRRFQDE EVQRDVSIMP YKIIGADNGD AWLDVKGQKM APPQISAEVL
KKMKKTAEDY LGEPVTEAVI TVPAYFNDAQ RQATKDAGRI AGLEVKRIIN EPTAAALAYG
LDKEVGNRTI AVYDLGGGTF DISIIEIDEV DGEKTFEVLA TNGDTHLGGE DFDTRLINYL
VDEFKKDQGI DLRNDPLAMQ RLKEAAEKAK IELSSAQQTD VNLPYITADA TGPKHMNIKV
TRAKLESLVE DLVNRSIEPL KVALQDAGLS VSDINDVILV GGQTRMPMVQ KKVAEFFGKE
PRKDVNPDEA VAIGAAVQGG VLTGDVKDVL LLDVTPLSLG IETMGGVMTP LITKNTTIPT
KHSQVFSTAE DNQSAVTIHV LQGERKRASD NKSLGQFNLD GINPAPRGMP QIEVTFDIDA
DGILHVSAKD KNSGKEQKIT IKASSGLNEE EIQKMVRDAE ANAESDRKFE ELVQTRNQGD
HLLHSTRKQV EEAGDKLPAD DKTAIESALN ALETALKGED KAAIEAKMQE LAQVSQKLME
IAQQQHAQQQ AGSADASANN AKDDDVVDAE FEEVKDKK