Gene P9211_18401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18401 
SymboldnaK 
ID5731723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1671825 
End bp1673729 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content39% 
IMG OID641286227 
Productmolecular chaperone DnaK 
Protein accessionYP_001551725 
Protein GI159904381 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.971493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGG TTGTTGGAAT TGACCTTGGT ACGACTAATA GCTGTGTAGC AGTTATGGAG 
GGAGGGAAGC CTACTGTAAT TGCCAATGCG GAGGGTTTTC GTACAACTCC TTCAGTGGTT
GCATATACAA AAAATCAAGA TCAATTGGTT GGGCAAATTG CCAAACGGCA GGCTGTAATG
AATCCGGAGA ATACTTTCTA TTCTTCAAAA CGTTTTGTAG GAAGACGTGT CGACGAGGTT
AATTCAGAAT CTAAAGAAGT CAGTTATTCG GTAGAAAAGG CAGGTTCTAA CGTGAAATTG
AAATGTCCCA TACTTGATAA GCAATTTTCA CCTGAAGAAG TTAGTGCTCA GGTTTTGAGA
AAACTGTCTG AAGATGCTGG AAAATATCTG GGTGAAAATG TTACTCAAGC AGTTATTACT
GTCCCAGCTT ATTTCAATGA TTCACAAAGA CAAGCAACTA AGGATGCTGG AAAGATAGCT
GGCTTAGAGG TTCTTAGAAT TATAAATGAG CCTACAGCGG CAGCTCTTGC TTATGGTTTA
GACAAAAAAA GTAATGAAAG AATTTTAGTA TTTGATCTTG GAGGAGGCAC ATTTGATGTT
TCTGTTCTAG AAGTTGGAGA TGGCGTTTTT GAAGTTTTAT CTACCTCTGG TGATACACAT
CTTGGAGGAG ATGATTTTGA TAAGGTGATT GTTGATCATT TAGCAAGCAC TTTTAAAAGC
AATGAAGGAA TTGATTTAAG ACAGGATAAA CAAGCCTTAC AACGTTTAAC CGAGTCTGCA
GAAAAGGCAA AAATTGAACT TTCTAATGCA ACTCAAAGCG AAATCAACCT TCCGTTTATT
ACAGCTACTC CAGAAGGGCC TAAGCATTTA GATTTAACTT TGACACGCGC AAAATTTGAA
GAACTAGCCT CAAATTTGAT TGATCGCTGT CGTGTACCAG TCGAACAGGC TCTTAAAGAT
GCAAAGTTGT CTACTGGAGA GATAGATGAA ATCGTGATGG TTGGAGGTTC CACACGTATG
CCAGCTGTTA AGGAATTGGT AAAAAGAGTG ACTGGGAAAG ACCCTAATCA AACTGTCAAT
CCTGATGAAG TTGTTGCCGT TGGTGCTGCT ATTCAGGGAG GCGTTTTGGC TGGTGAAGTC
AAAGATATTT TACTTCTTGA TGTCACCCCG CTTTCTCTAG GTGTTGAGAC GCTTGGTGGT
GTAATGACAA AAATGATTAC TAGAAATACG ACTGTACCGA CTAAAAAAGC AGAGACGTAT
TCAACAGCAG TTGATGGCCA GACAAATGTA GAGATTCATG TTCTCCAAGG TGAAAGAGAA
ATGGCCTCTG ACAATAAAAG CCTAGGTACA TTTCGCTTGG ATGGAATACC TCCTGCACCT
AGAGGAGTGC CTCAAATTGA GGTGACTTTT GATATAGATG CGAATGGAAT TCTTAGTGTG
ACTGCTAAGG ATAAAGGCAG TGGTAAAGAG CAAAGTATTT CTATTACTGG TGCTTCAACT
CTTTCAGATA ATGAAGTTGA TAAAATGGTT AAAGATGCTG AAGTTAATGC AACAGCAGAT
AAAGATAAGC GTGAAAGGAT TGATTTAAAA AACCAGGCAG AAACATTGGT TTATCAGACC
GAAAAGCAAC TTGGTGAGCT TGGAGATAAG GTTGAACCAG ATGCAAAAGT AAAAGTTGAA
GAAAAGCGTA TGAAATTGAA AGAGGCTACT GATAAGGATG ATTTTGAAAC TATGAAAACT
TTGGTTGAAG AACTACAACA AGAACTCTAT TCTTTAGGGG CTTCTGTTTA TCAACAAGCA
AATGCAGCTT CTCAAGCAGC CGAGACTTCT GGAAATGATA CCGGTAATTC TAATGGTGGG
AATAATGATG ATGTAATTGA TGCTGAATTT ACAGAATCTA AATAA
 
Protein sequence
MGKVVGIDLG TTNSCVAVME GGKPTVIANA EGFRTTPSVV AYTKNQDQLV GQIAKRQAVM 
NPENTFYSSK RFVGRRVDEV NSESKEVSYS VEKAGSNVKL KCPILDKQFS PEEVSAQVLR
KLSEDAGKYL GENVTQAVIT VPAYFNDSQR QATKDAGKIA GLEVLRIINE PTAAALAYGL
DKKSNERILV FDLGGGTFDV SVLEVGDGVF EVLSTSGDTH LGGDDFDKVI VDHLASTFKS
NEGIDLRQDK QALQRLTESA EKAKIELSNA TQSEINLPFI TATPEGPKHL DLTLTRAKFE
ELASNLIDRC RVPVEQALKD AKLSTGEIDE IVMVGGSTRM PAVKELVKRV TGKDPNQTVN
PDEVVAVGAA IQGGVLAGEV KDILLLDVTP LSLGVETLGG VMTKMITRNT TVPTKKAETY
STAVDGQTNV EIHVLQGERE MASDNKSLGT FRLDGIPPAP RGVPQIEVTF DIDANGILSV
TAKDKGSGKE QSISITGAST LSDNEVDKMV KDAEVNATAD KDKRERIDLK NQAETLVYQT
EKQLGELGDK VEPDAKVKVE EKRMKLKEAT DKDDFETMKT LVEELQQELY SLGASVYQQA
NAASQAAETS GNDTGNSNGG NNDDVIDAEF TESK