Gene NATL1_09851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09851 
Symbol 
ID4779912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp903625 
End bp905625 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content39% 
IMG OID640084263 
Productmolecular chaperone DnaK 
Protein accessionYP_001014808 
Protein GI124025692 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.304248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGAA TAGTTGGCAT TGACTTGGGA ACGACTAATT CCGTTGTAGC GGTGTTGGAG 
GCTGGTAGGC CTGTTGTTAT TGCTAGTGCT GAAGGTGCTA GAACTACGCC ATCAGTTGTT
GGCTTTACTA AGGAATCTGA ATTGTTAGTC GGTCAACTTG CGAGACGCCA ATTAGTTCTT
AATCCTAAAA ACACTTTTTC AAATTTAAAA AGATTTGTTG GCAGAGCATG GGATGAGCTT
GAAGAAACAA GCCTTTCAGT TCCTTATAGT GTCCGCTCAA ATGATCAGGG AAATGTTCGC
ATCACCTCTC CCATAACAAA ACGGGAATAT GCGCCTGAGG AACTTATTGG CAATATCATT
AGAAAGTTAA TAGATGACGC TGAAACATAT TTAGGAGAAA ACGTTGATGC TGCGGTAATC
ACTGTTCCAG CTTATTTCAA CGATTCACAA AGACAAGCTA CCCGTGATGC TGCGATTTTG
GCGGGCATAT CTGTTGAAAG GATTCTAAAT GAACCAACCT CCGCCGCTCT TGCTTATGGA
TTTGATAAAA GCTCATCTAG AAAAGTTTTG GTTTTTGATT TAGGTGGTGG AACATTTGAT
GTGTCTTTAA TGTCCATTTC CAATGGTGTT TTTGATGTAA AGGCAACTTC AGGTGATACA
CAATTGGGAG GTAATGATTT TGATCAAAGA ATTGTTGATT GGCTTGCTGA AGATTTTTTA
GCAAAGAATA AACTTGACCT AAGAAGAGAT AGGCAATCAT TACAAAGACT TACTGAAGCT
GCTGAGAAAG CTAAACAAGA ACTTTCTGGT GTTCAAGCCA CACCTATCTC ATTACCTTTT
ATTTCTACAG GAAAAGATGG CCCATTACAT ATAGAAACTA CCCTAAGTAG AAAAAAGTAC
GAGAGTCTTT GCAATGACCT TTTAGACAGA TTATTTGATC CTGTAAATAC TGTTATTGAT
GATTCAGGCT GGAATCCTGA GGATATTGAT GAAGTTGTTC TTGTAGGTGG AAGTACACGT
ATGCCAATGG TAAAGCAATT AGTTAAAACA TTAGTTCCAA ATCCACCTTG TCAATCTGTT
AACCCTGATG AGGTTGTTGC TATTGGTGCG GCAATTCAAG GTGGGATTCT TTCAGGAGAG
TTGAGAGACC TTTTATTGAA TGATGTCACA CCTCTCTCTT TAGGACTAGA AACTGTAGGT
GGATTAATGA AGGTCCTAAT TCCACGTAAT ACGTCCATAC CAGTTAGACA ATCAGATGTG
TTTAGCACAT CTGCTTCCAA CCAATCATCA GTTGAAATTC ATATATGGCA AGGAGAGAGG
CAAATGGCCT CAGACAACAA ATCACTGGGA AAATTTAAAT TATCTGGTAT TCCTCCTGCC
CCTCGAGGTG TTCCTCAAGT TCAGGTGGCT TTTGATATTG ATGCCAATGG TTTATTAGAA
GTCAGTGCCA CTGACCGAAC TACTGGGAGG AAACAGTCTG TAAGTGTTAC TGGCGGTTCA
AATTTGAATC AAAATGAAGT TAATAAGTTG ATTGAGGAGT CCAAAGTAAA AGCATCTGAA
GATAGAAAAA AGCGAGCTTC TATTGATCAG AAAAATAATG CTTTAACACT TGTTGCTCAA
GCTGAGAGAC GACTAAGAGA CGCTTCACTT GAGTTGGGGC CCTATGGCGC CGAGAGACAA
CAAAGATCTG TAGAGGTTGC GATGAGGGAC GTTGAAGATT TGCTTCAAGA TAATGATTTG
CAAGAACTCG AATATGCAGT TGGTTCTCTA CAAGAAGCAT TATTTGGTTT GAATCGTCGC
CTATCTGCAG AAAGAAAAAC AGATTCAAAT CCCATACAAG GAATTAAAAA TACTTTTGGG
TCATTAAAGG ACGAGTTGTT TTCAGACGAC TACTGGGATG ATGATCCTTG GGATTATTCT
CAAGGACGTC AAAATAGAAA TGGTGATAAT AATTATGGGA GAAGGGATTT AGATCCTTGG
GATAATGACT ACTACCGTTG A
 
Protein sequence
MGRIVGIDLG TTNSVVAVLE AGRPVVIASA EGARTTPSVV GFTKESELLV GQLARRQLVL 
NPKNTFSNLK RFVGRAWDEL EETSLSVPYS VRSNDQGNVR ITSPITKREY APEELIGNII
RKLIDDAETY LGENVDAAVI TVPAYFNDSQ RQATRDAAIL AGISVERILN EPTSAALAYG
FDKSSSRKVL VFDLGGGTFD VSLMSISNGV FDVKATSGDT QLGGNDFDQR IVDWLAEDFL
AKNKLDLRRD RQSLQRLTEA AEKAKQELSG VQATPISLPF ISTGKDGPLH IETTLSRKKY
ESLCNDLLDR LFDPVNTVID DSGWNPEDID EVVLVGGSTR MPMVKQLVKT LVPNPPCQSV
NPDEVVAIGA AIQGGILSGE LRDLLLNDVT PLSLGLETVG GLMKVLIPRN TSIPVRQSDV
FSTSASNQSS VEIHIWQGER QMASDNKSLG KFKLSGIPPA PRGVPQVQVA FDIDANGLLE
VSATDRTTGR KQSVSVTGGS NLNQNEVNKL IEESKVKASE DRKKRASIDQ KNNALTLVAQ
AERRLRDASL ELGPYGAERQ QRSVEVAMRD VEDLLQDNDL QELEYAVGSL QEALFGLNRR
LSAERKTDSN PIQGIKNTFG SLKDELFSDD YWDDDPWDYS QGRQNRNGDN NYGRRDLDPW
DNDYYR