Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09851 |
Symbol | |
ID | 4779912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 903625 |
End bp | 905625 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640084263 |
Product | molecular chaperone DnaK |
Protein accession | YP_001014808 |
Protein GI | 124025692 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.304248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAGAA TAGTTGGCAT TGACTTGGGA ACGACTAATT CCGTTGTAGC GGTGTTGGAG GCTGGTAGGC CTGTTGTTAT TGCTAGTGCT GAAGGTGCTA GAACTACGCC ATCAGTTGTT GGCTTTACTA AGGAATCTGA ATTGTTAGTC GGTCAACTTG CGAGACGCCA ATTAGTTCTT AATCCTAAAA ACACTTTTTC AAATTTAAAA AGATTTGTTG GCAGAGCATG GGATGAGCTT GAAGAAACAA GCCTTTCAGT TCCTTATAGT GTCCGCTCAA ATGATCAGGG AAATGTTCGC ATCACCTCTC CCATAACAAA ACGGGAATAT GCGCCTGAGG AACTTATTGG CAATATCATT AGAAAGTTAA TAGATGACGC TGAAACATAT TTAGGAGAAA ACGTTGATGC TGCGGTAATC ACTGTTCCAG CTTATTTCAA CGATTCACAA AGACAAGCTA CCCGTGATGC TGCGATTTTG GCGGGCATAT CTGTTGAAAG GATTCTAAAT GAACCAACCT CCGCCGCTCT TGCTTATGGA TTTGATAAAA GCTCATCTAG AAAAGTTTTG GTTTTTGATT TAGGTGGTGG AACATTTGAT GTGTCTTTAA TGTCCATTTC CAATGGTGTT TTTGATGTAA AGGCAACTTC AGGTGATACA CAATTGGGAG GTAATGATTT TGATCAAAGA ATTGTTGATT GGCTTGCTGA AGATTTTTTA GCAAAGAATA AACTTGACCT AAGAAGAGAT AGGCAATCAT TACAAAGACT TACTGAAGCT GCTGAGAAAG CTAAACAAGA ACTTTCTGGT GTTCAAGCCA CACCTATCTC ATTACCTTTT ATTTCTACAG GAAAAGATGG CCCATTACAT ATAGAAACTA CCCTAAGTAG AAAAAAGTAC GAGAGTCTTT GCAATGACCT TTTAGACAGA TTATTTGATC CTGTAAATAC TGTTATTGAT GATTCAGGCT GGAATCCTGA GGATATTGAT GAAGTTGTTC TTGTAGGTGG AAGTACACGT ATGCCAATGG TAAAGCAATT AGTTAAAACA TTAGTTCCAA ATCCACCTTG TCAATCTGTT AACCCTGATG AGGTTGTTGC TATTGGTGCG GCAATTCAAG GTGGGATTCT TTCAGGAGAG TTGAGAGACC TTTTATTGAA TGATGTCACA CCTCTCTCTT TAGGACTAGA AACTGTAGGT GGATTAATGA AGGTCCTAAT TCCACGTAAT ACGTCCATAC CAGTTAGACA ATCAGATGTG TTTAGCACAT CTGCTTCCAA CCAATCATCA GTTGAAATTC ATATATGGCA AGGAGAGAGG CAAATGGCCT CAGACAACAA ATCACTGGGA AAATTTAAAT TATCTGGTAT TCCTCCTGCC CCTCGAGGTG TTCCTCAAGT TCAGGTGGCT TTTGATATTG ATGCCAATGG TTTATTAGAA GTCAGTGCCA CTGACCGAAC TACTGGGAGG AAACAGTCTG TAAGTGTTAC TGGCGGTTCA AATTTGAATC AAAATGAAGT TAATAAGTTG ATTGAGGAGT CCAAAGTAAA AGCATCTGAA GATAGAAAAA AGCGAGCTTC TATTGATCAG AAAAATAATG CTTTAACACT TGTTGCTCAA GCTGAGAGAC GACTAAGAGA CGCTTCACTT GAGTTGGGGC CCTATGGCGC CGAGAGACAA CAAAGATCTG TAGAGGTTGC GATGAGGGAC GTTGAAGATT TGCTTCAAGA TAATGATTTG CAAGAACTCG AATATGCAGT TGGTTCTCTA CAAGAAGCAT TATTTGGTTT GAATCGTCGC CTATCTGCAG AAAGAAAAAC AGATTCAAAT CCCATACAAG GAATTAAAAA TACTTTTGGG TCATTAAAGG ACGAGTTGTT TTCAGACGAC TACTGGGATG ATGATCCTTG GGATTATTCT CAAGGACGTC AAAATAGAAA TGGTGATAAT AATTATGGGA GAAGGGATTT AGATCCTTGG GATAATGACT ACTACCGTTG A
|
Protein sequence | MGRIVGIDLG TTNSVVAVLE AGRPVVIASA EGARTTPSVV GFTKESELLV GQLARRQLVL NPKNTFSNLK RFVGRAWDEL EETSLSVPYS VRSNDQGNVR ITSPITKREY APEELIGNII RKLIDDAETY LGENVDAAVI TVPAYFNDSQ RQATRDAAIL AGISVERILN EPTSAALAYG FDKSSSRKVL VFDLGGGTFD VSLMSISNGV FDVKATSGDT QLGGNDFDQR IVDWLAEDFL AKNKLDLRRD RQSLQRLTEA AEKAKQELSG VQATPISLPF ISTGKDGPLH IETTLSRKKY ESLCNDLLDR LFDPVNTVID DSGWNPEDID EVVLVGGSTR MPMVKQLVKT LVPNPPCQSV NPDEVVAIGA AIQGGILSGE LRDLLLNDVT PLSLGLETVG GLMKVLIPRN TSIPVRQSDV FSTSASNQSS VEIHIWQGER QMASDNKSLG KFKLSGIPPA PRGVPQVQVA FDIDANGLLE VSATDRTTGR KQSVSVTGGS NLNQNEVNKL IEESKVKASE DRKKRASIDQ KNNALTLVAQ AERRLRDASL ELGPYGAERQ QRSVEVAMRD VEDLLQDNDL QELEYAVGSL QEALFGLNRR LSAERKTDSN PIQGIKNTFG SLKDELFSDD YWDDDPWDYS QGRQNRNGDN NYGRRDLDPW DNDYYR
|
| |