Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09151 |
Symbol | dnaE |
ID | 4717622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 785966 |
End bp | 789463 |
Gene Length | 3498 bp |
Protein Length | 1165 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640078628 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001009306 |
Protein GI | 123968448 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTCG TTCCGCTTCA TAATCATAGT GACTACAGCT TACTTGATGG TGCCAGTCAA ATTTCAAAAA TTGTAGAAAG AGCTTGTGAT CTTGGGATGG ATTCTATTGC TCTCACAGAT CATGGAGTTA TGTATGGTGT TCTTGATTTG GTCAAGAAGT GTAAAGAGAA AGGTATAAAG CCAATTATTG GTAATGAAAT GTACGTTATT AATGGTTCTA TTGATGATCC TCAACCAAAA AAAGAAAAAA GATATCATTT GGTGGTGCTA GCAAAAAATT ATACTGGTTA TAAGAATCTA GTGAAGTTGA CAACAATTAG TCACCTAAAC GGGATGAGAG GTCGAGGCAT TTTTTCTAGG CCATGTATTG ATAAATCTCT TTTAAGCAAA TATAGTGATG GCCTAATAGT CTCTACAGCT TGTCTTGGTG GAGAGATACC TCAGGCTATC TTAAAAGGTA GGTTAGACGT AGCAGAGGAT ATAGCTCTTT GGTATAAAAA ATTATTTGCA GATGACTTTT ATCTAGAAAT ACAAGATCAC GGCTCTATTG AGGATAGAAT TGTTAACGTT GAATTAATAA AAATTGGGAA GAAGCACCAA ATAAAAGTCA TAGCCACCAA CGACGCCCAT TACTTATCAA GTATGGATGT TGAAGCTCAT GATGCCTTAC TTTGTGTATT AACTGGAAAA CTAATAAGTG ATGAAAAAAG ATTGAGATAT ACCGGTACAG AATATATTAA AAGTGAAAAT GAAATGCTTG AACTTTTTAA AGATCATATT GATGATAAAT CAATTATTGA TGCAGTGAAT AATACAGTAG AAATTTCTCA AAAAGTTGAG GTATTTGATT TGTTTGGTAA TTATAGAATG CCCAAATTTC CTCTTAATGA AGATAAAGAT TCATTTTCTT TCCTTAAACA ATTATCTAAT AAAGGTCTTT TAAAAAGACT TAAAAAAAAT GATCTTGATG AAGTTGATGA AAAATATAAA GAAAGACTAA CTTCTGAATT AAAAATTATA AAAGATATGG GTTTCCCAGA TTATTTTTTG GTTGTTTGGG ACTACATCAA ATTTGCTAGA GACAACTCTA TACCAGTAGG ACCAGGTAGA GGTTCTGCTG CGGGTTCACT AGTAGCTTAT GCACTTCAAA TCACAAATAT AGATCCTGTC GAGCATGGAT TGTTATTTGA GAGATTTTTA AATCCAGCAA GAAAGTCTAT GCCAGATATT GATACCGACT TTTGTATTGA TAGGAGAAAT GAAGTTATTG ATTATGTTAC TAATCGTTAT GGAGAGGATA AAGTTGCGCA AATAATTACT TTCAATAAAA TGACCTCTAA GGCGGTTTTA AAAGATGTTG CAAGGGTTCT AGATATTCCG TATGGAGAGG CTGATAAATT GGCTAAGTTA ATACCGGTTG TAAGAGGGAA ACCTTATAAA CTAAATGAAA TGATTGATAA GAATTCTCCT AGCCAAGAGT TTAGAGACAA ATATATTAAT GATAATAGGA TAAAAAAATG GGTTGATTTG GCTTTGAGAA TTGAAGGAAC TAATAAAACA TATGGAGTTC ATGCTGCTGG AGTTGTTATC GCATCAGATC CTCTCGACGA ACTTGTACCT CTTCAAAGGA ATAATGAAGG ACAAATAATA ACCCAATATT CTATGGATGA TATCGAATCA CTTGGATTAT TGAAAATGGA TTTCTTGGGT CTTAAGAATC TCACTATGAT TGAAAAGACA GTTTCTCTTG TTAATCAATC CTCCGGAAAG AAAATAAATA TCGATGAGTT ACCGCGAAAT GACAGTAAAA CCTTTGAGCT TATTGGAAGA GGAGATCTTG AAGGTATTTT TCAGCTTGAA TCTTCTGGTA TGAAACAGGT TGTTAAGGAT TTCAAACCTA ACTCTCTAGA GGATATTTCT TCCATACTGG CTCTTTATAG ACCTGGTCCT CTTGATGCGG GTCTCATTCC TAAATTTATA AATCGAAAAA ATGGGAATGA AAAGATTGAT TTTCCTCATC CTTTTATTAA GTCAATTCTT ACTGAAACCT ATGGAATTAT GGTTTATCAA GAGCAAATCA TGAAAATTGC TCAAGACCTA GCTGGCTATT CTTTAGGTGA TGCTGATTTA CTTAGAAGAG CAATGGGGAA AAAGAAAGTA TCTGAGATGG TAAAGCATAG GAATATTTTT GTAGAAGGTT CTATGAAGAA AGGTGTAAAT GAAAAATTAG CAAATGATCT TTTTGATCAA ATGGTTTTAT TCGCGGAATA TTGTTTTAAC AAAAGTCACT CAACTGCTTA CGGGGCTGTA ACTTATCAAA CTGCATTTTT AAAAGCCCAT TTTCCTGTTG CATATATGGC AGCCCTTCTA AGCGTAAACT CTGGTTCTAG CGATAAGATG CAAAGATATA TTTCTAATTG TTATTCCATG GGAATAGAAG TTATTTCACC AAGCATTAAT TTTTCTGGTG TTGATTTCAC TATTAAGAAT AATCAGATTT TATTCGGGTT ATCTGCAATT AAGAATTTAG GAGATTCTGC GATAAGAAAT ATAATTGAAA ACCGAAATAG TTTAGGAATA TTTAAGTCAC TAGCCGATTT GTGCGATCGT TTGCCTTCTA ATGTTCTTAA CAAAAGAAGT CTTGAATCTC TAATTCATTG TGGAGCACTA GATGAGTTTT CAATTGATAA TAATAGAGCT CAATTATTGT CAGATCTCGA AAATGTCATT GAGTGGGCCT CTTCAAGAAA TCGTGATAGG TTATCTGGGC AAGGCAATCT ATTTGATTCT AAAGAAGAAT TTTCTAATGT TGCTTTTTCA GATTCACAAT TAGCTAAGGT TGAGGATTAT TCACTTATTG AGAAGTTAAA GTTAGAAAAA CAGCTACTAG GTTTTTATTT ATCTGATCAT CCTCTAAAGC ATTTAACTAA GCCAGCAAAA CTTATATCTC CTATAAGCAT TTCGCATTTA GAAGAAACAA AAGATAGAAC CAAAGTCTCT TTAGTTGGAA TGATCCCTGA TTTGAAGCAA ATTACAACGA GAAAAGGAGA TAGGATGGCT ATAGTTCAGC TAGAAGATCT TTCAGGAAGT TGCGAAGCAA TAGTTTTTCC AAAAACCTAT GTAAGATTAT CAGAATTTCT TCTGACGGAT ACAAGATTAT TGGTTTGGGG AACAATAGAT AAAAAAAGTG ATAAGACTCA ATTAATAATT GATGATTGTA GAGAAATCGA TAACCTTAAA TTGCTAATTA TTAATCTTGA AAGTTCTCAA GCATCAGATG TACGCGTACA AAATACTTTG AGAAACTGTT TAATTAAATT TAAACCAGAT AAAGGTAGAT GTGGAATAAA GATTCCAGTT TTAGCTGCAG TAAGAAATAA AAATAGTGTT ACCTACGTTA AATTTGGCGA ACAATTTTGT ATTGGTGATA TTCAGGGAGC ATGCAAATTA TTAGAAGATA AATCATTTAA AGTTAACTTG AAATCTTTAG TTTCCTAG
|
Protein sequence | MAFVPLHNHS DYSLLDGASQ ISKIVERACD LGMDSIALTD HGVMYGVLDL VKKCKEKGIK PIIGNEMYVI NGSIDDPQPK KEKRYHLVVL AKNYTGYKNL VKLTTISHLN GMRGRGIFSR PCIDKSLLSK YSDGLIVSTA CLGGEIPQAI LKGRLDVAED IALWYKKLFA DDFYLEIQDH GSIEDRIVNV ELIKIGKKHQ IKVIATNDAH YLSSMDVEAH DALLCVLTGK LISDEKRLRY TGTEYIKSEN EMLELFKDHI DDKSIIDAVN NTVEISQKVE VFDLFGNYRM PKFPLNEDKD SFSFLKQLSN KGLLKRLKKN DLDEVDEKYK ERLTSELKII KDMGFPDYFL VVWDYIKFAR DNSIPVGPGR GSAAGSLVAY ALQITNIDPV EHGLLFERFL NPARKSMPDI DTDFCIDRRN EVIDYVTNRY GEDKVAQIIT FNKMTSKAVL KDVARVLDIP YGEADKLAKL IPVVRGKPYK LNEMIDKNSP SQEFRDKYIN DNRIKKWVDL ALRIEGTNKT YGVHAAGVVI ASDPLDELVP LQRNNEGQII TQYSMDDIES LGLLKMDFLG LKNLTMIEKT VSLVNQSSGK KINIDELPRN DSKTFELIGR GDLEGIFQLE SSGMKQVVKD FKPNSLEDIS SILALYRPGP LDAGLIPKFI NRKNGNEKID FPHPFIKSIL TETYGIMVYQ EQIMKIAQDL AGYSLGDADL LRRAMGKKKV SEMVKHRNIF VEGSMKKGVN EKLANDLFDQ MVLFAEYCFN KSHSTAYGAV TYQTAFLKAH FPVAYMAALL SVNSGSSDKM QRYISNCYSM GIEVISPSIN FSGVDFTIKN NQILFGLSAI KNLGDSAIRN IIENRNSLGI FKSLADLCDR LPSNVLNKRS LESLIHCGAL DEFSIDNNRA QLLSDLENVI EWASSRNRDR LSGQGNLFDS KEEFSNVAFS DSQLAKVEDY SLIEKLKLEK QLLGFYLSDH PLKHLTKPAK LISPISISHL EETKDRTKVS LVGMIPDLKQ ITTRKGDRMA IVQLEDLSGS CEAIVFPKTY VRLSEFLLTD TRLLVWGTID KKSDKTQLII DDCREIDNLK LLIINLESSQ ASDVRVQNTL RNCLIKFKPD KGRCGIKIPV LAAVRNKNSV TYVKFGEQFC IGDIQGACKL LEDKSFKVNL KSLVS
|
| |