Gene HS_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1194 
SymboldnaK 
ID4240695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1346950 
End bp1348857 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content40% 
IMG OID638104757 
Productmolecular chaperone DnaK 
Protein accessionYP_719406 
Protein GI113461337 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00921212 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA TTATCGGTAT TGACTTAGGA ACAACGAACT CTTGTGTGGC TGTAATGGAC 
GGCGACAAAC CACGTGTCAT TGAAAACGCT GAAGGCGAAC GCACAACCCC ATCCATTATT
GCTTATACAA ATGATAATGA AACCTTAGTA GGACAACCAG CGAAACGTCA GGCTGTGACG
AATCCGAAAA ATACATTATT TGCGATCAAA CGTTTAATTG GTCGTCGCTT TGAAGATCAA
GAAGTACAAC GTGACGTTGC GATTATGCCT TTTGAAATCA CTAAAGCCGA TAACGGCGAC
GCTTGGGTAT CGGTAAAAGG TGAAAAAATG GCACCTCCGC AAATTTCAGC TGAAGTGTTG
AAAAAAATGA AAAAAACTGC GGAAGATTTT TTAGGTGAAA CAGTAACAGA AGCTGTAATC
ACAGTGCCGG CTTATTTTAA TGACGCACAA CGCCAAGCAA CAAAAGATGC CGGACGTATT
GCCGGTCTTG AAGTCAAACG TATTATTAAC GAACCGACAG CGGCGGCTCT TGCTTATGGT
TTAGATAAAG GTAAAGGCAA CCAAACAATT GCCGTTTATG ACTTGGGTGG TGGTACATTT
GACTTATCTA TTATCGAAAT CGATGAAGTC GGTGGCGAAA AAACGTTTGA AGTTTTAGCA
ACAAACGGAG ATACTCACCT AGGAGGGGAA GACTTTGACA ACCGTGTAAT TAACTACCTA
GTTGATGAGT TCAAAAAAGA ACAGGGCGTA GATTTACGTA ACGATCCGCT CGCAATGCAA
CGTTTAAAAG AAGCCGGTGA AAAAGCAAAA ATTGAGCTTT CTTCCGCTCA ACAAACTGAT
GTCAATTTAC CTTATATCAC TGCTGATGCT ACCGGACCTA AACATTTAAA CATTAAATTG
ACTCGTGCGA AATTAGAAGC GTTAGTGGAA GATTTAGTTG CACGTTCAAT GGAACCGGTT
AAAGTTGCAC TTTCCGATGC AGGTTTAAGC GTTTCCGAAA TCAATGATGT CATTTTGGTG
GGCGGACAAA CTCGTATGCC GTTAGTACAA CAGAAAGTAG CTGAATTCTT TGGTAAAGAG
CCTCGCAGAG ACGTTAACCC AGATGAAGCG GTTGCTATTG GTGCAGCTGT ACAAGGCGGT
GTTTTAGCCG GTGATGTTAA AGATGTTTTA TTGTTAGACG TAACACCATT ATCACTCGGT
ATTGAAACAA TGGGCGGCGT GATGACAACT TTGATTGAGA AAAATACAAC AATTCCGACG
AAAAAATCGC AAGTATTTTC AACCGCTGAA GATAATCAAA GTGCGGTGAC TATTCACGTA
CTTCAAGGGG AACGTAAACA AGCCTCTGCC AATAAGTCTT TAGGTCAATT TAATCTTGAA
GGTATTAACC CTGCACCACG TGGTATGCCT CAAATTGAAG TAACCTTTGA TATTGATGCA
GATGGTATTA TCCATGTTTC AGCAAAAGAT AAAGGTACTG GTAAAGAACA AAAAATTACG
ATTAAAGCAT CTTCAGGTTT AAGTGATGAA GAAATTCAAC AAATGGTTCG TGATGCGGAA
GCTAATGCGG AAGCGGATCG TAAATTTGAA GAGTTAGTTC AAGCTCGTAA CCAAGCGGAT
CACTTAGTAC ATAGTACCCG CAAACAATTA GCAGAAGTTG GAGAGAAATT ATCAGCTGAA
GACAAAGCAC CAATTGAAAG TGCGGTCAAT GAACTTGAAA CTGCGGCAAA AGGTGAAGAT
AAAACTGAAA TTGATGCCAA AGTACAAGCA TTAATTCAAG TTTCCGAAAA ACTACTTCAA
GCAAGTCAAC AACAAGCACA AGCTGATGCC GGTGCACAAC AATCTCAAAG TACGAAAGGT
GGAGATGATG TTGTTGATGC TGAATTTGAA GAAGTAAAAG ATAAATAA
 
Protein sequence
MGKIIGIDLG TTNSCVAVMD GDKPRVIENA EGERTTPSII AYTNDNETLV GQPAKRQAVT 
NPKNTLFAIK RLIGRRFEDQ EVQRDVAIMP FEITKADNGD AWVSVKGEKM APPQISAEVL
KKMKKTAEDF LGETVTEAVI TVPAYFNDAQ RQATKDAGRI AGLEVKRIIN EPTAAALAYG
LDKGKGNQTI AVYDLGGGTF DLSIIEIDEV GGEKTFEVLA TNGDTHLGGE DFDNRVINYL
VDEFKKEQGV DLRNDPLAMQ RLKEAGEKAK IELSSAQQTD VNLPYITADA TGPKHLNIKL
TRAKLEALVE DLVARSMEPV KVALSDAGLS VSEINDVILV GGQTRMPLVQ QKVAEFFGKE
PRRDVNPDEA VAIGAAVQGG VLAGDVKDVL LLDVTPLSLG IETMGGVMTT LIEKNTTIPT
KKSQVFSTAE DNQSAVTIHV LQGERKQASA NKSLGQFNLE GINPAPRGMP QIEVTFDIDA
DGIIHVSAKD KGTGKEQKIT IKASSGLSDE EIQQMVRDAE ANAEADRKFE ELVQARNQAD
HLVHSTRKQL AEVGEKLSAE DKAPIESAVN ELETAAKGED KTEIDAKVQA LIQVSEKLLQ
ASQQQAQADA GAQQSQSTKG GDDVVDAEFE EVKDK