Gene NATL1_21861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21861 
SymboldnaK 
ID4779409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1846480 
End bp1848372 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content39% 
IMG OID640085484 
Productmolecular chaperone DnaK 
Protein accessionYP_001016006 
Protein GI124026891 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.686332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGG TTGTCGGAAT CGATCTAGGG ACTACAAATA GTTGTGTTGC TGTTATGGAG 
GGCGGTAAAC CTACAGTAAT AGCAAATGCT GAAGGATTTA GAACAACTCC ATCTGTCGTT
GCTTATACAA AAAATCAAGA CCAATTGGTT GGCCAAATCG CCAAACGACA AGCGGTGATG
AATCCTGAAA ATACTTTTTA TTCTTCAAAG AGGTTTGTTG GTCGTAGAGT TGACGAAGTA
AATGATGAAT CAAAGGAAGT TAGTTATGGA GTAGAAAAAG CAGGTTCCAA CGTCAAAATA
AAATGCCCAA TACTTGATAA GCAGTTTTCT CCTGAAGAAG TAAGTGCTCA GGTTTTAAGA
AAACTTTCTG ATGATGCAGG CAAATATTTA GGTGAAACGG TTACTCAAGC AGTTATTACA
GTTCCTGCTT ACTTCAATGA TTCACAAAGA CAAGCAACTA AAGATGCTGG AAAGATTGCA
GGCTTAGAAG TTCTTAGAAT TATTAATGAG CCAACAGCTG CTGCTTTAGC GTACGGCTTA
GATAAAAAGA GTAATGAAAG AATTTTAGTT TTCGATCTGG GTGGTGGAAC CTTTGATGTA
TCTGTTTTAG AAGTAGGAGA TGGAGTTTTC GAAGTTCTCT CTACTTCTGG AGATACTCAT
TTGGGTGGTG ATGATTTCGA TAGAGTAATT GTTGATCATT TGGCATCTAC TTTCAAAGGT
AATGAAGGCA TTGATTTACG CCAAGATAAG CAAGCACTGC AACGCTTAAC CGAAGCAGCT
GAAAAAGCTA AAATAGAATT ATCAAACGCT ACTCAAAGCG AAATAAATCT TCCATTTATA
ACTGCTACGC CTGAAGGTCC CAAGCATCTT GATTTGACTC TTACAAGAGG AAAATTTGAG
GAGTTGGCAT CAAACCTTAT TGATCGCTGC AGGGTCCCTG TAGAGCAAGC CTTGAAAGAT
GCAAAATTAT CTACTGGTGA AATAGATGAA ATTGTTATGG TTGGAGGCTC TACCCGAATG
CCAGCTGTTA AAGAGTTAGT AAAAAGAGTA ACTACTAAGG ATCCAAATCA AACAGTTAAT
CCAGATGAAG TAGTAGCGGT TGGAGCAGCT ATTCAAGGTG GAGTTCTTGC CGGAGAAGTT
AAAGATATTT TGCTACTTGA TGTAACCCCA TTGTCACTTG GGGTTGAAAC TTTGGGTGGG
GTAATGACAA AAATGATTTC AAGAAATACT ACTGTTCCCA CTAAAAAAGC TGAAACTTAC
TCGACTGCAG TCGATGGTCA AACCAATGTA GAAATTCACG TCTTGCAAGG GGAACGTGAG
ATGGCTTCTG ATAATAAAAG TTTAGGAACT TTTCGTTTAG ATGGAATTCC GCCTGCTCCT
CGGGGTGTTC CGCAAATCGA AGTAACTTTT GATATTGATG CTAATGGAAT TCTTAGCGTT
AATGCAAAAG ACAAAGGAAG CGGTAAAGAG CAGAGTATCT CTATTACTGG AGCATCAACT
TTGTCTGATA ACGAAGTTGA AAAAATGGTT AAAGACGCAG AAATGAATGC CTCTACTGAT
AAAGAAAAGC GTGAAAAAAT TGATATTAAA AATCAAGCTG AAACACTTGT TTATCAGGCT
GAAAAACAAA TTGGTGAGCT TGGCGACAAA GTTGATGAAG CAGCAAAAGC GAAAGTTGAA
GAGAAACGCG TCAAGTTAAA AGAAGCCACC GAAAAAGATG ATTATGAAAC CATGAAGAAG
CTAGTTGAGG AGCTACAACA AGAATTGTAT TCCTTAGGAG CTTCTGTATA TCAACAAGCG
AATGATGCTT CTCAAGCCGC GGCAGATTCT AATACTGACA GTAAAGTTGA TGGAGATGAT
GTTATTGATG CTGACTTTAC AGAGACAAAA TAA
 
Protein sequence
MGKVVGIDLG TTNSCVAVME GGKPTVIANA EGFRTTPSVV AYTKNQDQLV GQIAKRQAVM 
NPENTFYSSK RFVGRRVDEV NDESKEVSYG VEKAGSNVKI KCPILDKQFS PEEVSAQVLR
KLSDDAGKYL GETVTQAVIT VPAYFNDSQR QATKDAGKIA GLEVLRIINE PTAAALAYGL
DKKSNERILV FDLGGGTFDV SVLEVGDGVF EVLSTSGDTH LGGDDFDRVI VDHLASTFKG
NEGIDLRQDK QALQRLTEAA EKAKIELSNA TQSEINLPFI TATPEGPKHL DLTLTRGKFE
ELASNLIDRC RVPVEQALKD AKLSTGEIDE IVMVGGSTRM PAVKELVKRV TTKDPNQTVN
PDEVVAVGAA IQGGVLAGEV KDILLLDVTP LSLGVETLGG VMTKMISRNT TVPTKKAETY
STAVDGQTNV EIHVLQGERE MASDNKSLGT FRLDGIPPAP RGVPQIEVTF DIDANGILSV
NAKDKGSGKE QSISITGAST LSDNEVEKMV KDAEMNASTD KEKREKIDIK NQAETLVYQA
EKQIGELGDK VDEAAKAKVE EKRVKLKEAT EKDDYETMKK LVEELQQELY SLGASVYQQA
NDASQAAADS NTDSKVDGDD VIDADFTETK