Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09341 |
Symbol | dnaE |
ID | 4780080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 861391 |
End bp | 864909 |
Gene Length | 3519 bp |
Protein Length | 1172 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640084211 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001014757 |
Protein GI | 124025641 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.146562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTTG TTCCTATTCA TAACCATAGT GACTACAGCC TTCTTGATGG AGCCAGTCAA CTCCCTTTAA TGGTTCAACG GGCAAAGGAA TTGGGGATGC CAGCTCTGGC TCTGACTGAT CATGGAGTAA TGTATGGCGC GATCGAATTA TTGAAGTTAT GTAAGGCCGC GAATATAAAG CCAATTATTG GGAATGAGAT GTACGTTATC AATGGTTCAA TTGATGACCC TCAGCCCAAA AAGGAAAAAA GATATCATCT TGTTGTCGTA GCTAAAAACC AAATTGGTTA TGAAAATCTC GTAAAGTTAA CTACGCTTAG TCATTTAAAC GGTGTTAGAG GAAGAGGGAT TTTTTCAAGA CCTTGTATAG ATAAGTATTT ATTCAAAAAA TATAGCGAGG GGTTGATATG TTCAACAGCT TGCTTAGGTG GTGAAATTCC ACAAGCGATT TTAAAAGGAA GAATTGACGT TGCTAGAGAA GTAGCAGCTT GGTATAAAGA AGTTTTGGGT GATGATTTTT ATCTTGAAAT TCAAGACCAT GGATCAATTG AGGATAGAAT TGTTAATAGT GAAATAGTCA AAATATCCGA AGAACTTGAT ATTAAAATTA TTGCTACCAA TGATGCACAT TATTTATCAA AGAATGATAT TGAAGCTCAT GATGCATTGA TTTGTGTTTT GACTGGAAAG TTAATAAGTG ATCACAAAAG ATTGAGATAT ACAGGGACTG AATATATTAA ATCTGAGGAT GAAATGAGAA GTTTATTTAC TGATCATTTA GACAAAAATG TCATAAACAG TGCAATAGAA AATACAGTTA AACTATCAAA TAAAGTTGAA GAATATAAGA TATTAGGCAC TTATAAGATG CCTAATTTTC CTATACCTGA TGGTTATCAA CCAATTGAAT ATCTTAAAGA GATAACTATC AAAGGTTTAC TAGAAATTTT AGATATTTCT AAATTTGAAA ATCTTCCAAT CACATATAAA GAACGACTTG ATTATGAGTT GAAAGTAATA GAACAAATGG GGTTTCCTAC ATATTTCCTT GTTGTATGGG ATTATATAAG ATTTGCAAGA GAGCAAAATA TTCCTGTAGG CCCAGGTAGA GGATCAGCAG CTGGCTCTTT AGTCGCTTTT TCTCTTCATA TAACTAATAT TGACCCAGTA GAGAATGGTT TGTTATTTGA AAGATTTCTC AATCCTGAGA GAAAGTCAAT GCCTGATATT GATACTGATT TTTGTATTGA AAGACGTGGC GAAGTTATAG ATTATGTAAC TAAAAAGTAT GGTGAAGATA AAGTTGCACA GATAATTACA TTTAACAGAA TGACATCTAA GGCTGTTTTG AAAGATGTTG CTCGTGTCCT TGATATTCCC TATGGAGATG CAGACCGATT AGCGAAATTA ATTCCAGTTG TGAGGGGAAA GCCTGCGAAA TTGGCAGCTA TGATTTCTAA AGAATCGCCA AATAAAGATT TCTATGAAAA ATACAATAAT GATTCAAAAG TAAAGAAATG GGTTGATATG GCAATGAGGA TAGAAGGGAC AAATAAGACT TTTGGTGTTC ATGCAGCAGG TGTAGTTATT GCTGCTAATT CACTTGATAA TTTAGTTCCT CTTCAAAGAA ACAATGATGG ACAAATAATT ACTCAATATT TTATGGAAGA TATTGAATCA CTTGGACTTT TGAAGATGGA CTTTTTAGGA CTTAGAAATC TTACAATGAT CGAAAAGACA ATTGATTTAG TTGAGAAATC AATTGGTAAG AGATTAGATC CTGATTCTTT GCCTTTCACA GATGAAAAAA CATTCGAACT TCTTTCTAGG GGTGATTTAG AAGGAATTTT CCAACTTGAA TCTAGTGGAA TGAGACAAAT AGTAAAAGAT CTAAAGCCTT CATCTCTTGA GGATATTTCT TCAATTCTTG CTCTTTATCG TCCAGGTCCT CTTGATGCAG GATTGATTCC TAAATTTATA AATAGAAAAC ATGGGAAGGA GAGTATTGAT TTTCAACATC AATCACTTGA GCCAATTTTA AGTGAGACTT ATGGAATCAT GGTTTATCAA GAGCAGATCA TGAAGATTGC ACAGGATTTA GCCGGATATA CGCTTGGGCA AGCAGATTTA TTGAGAAGGG CAATGGGTAA GAAAAAAGTA TCCGAGATGC AGCGCCATAG AACGCTCTTT GTTGATGGAG CTGTTAAAAA TGGTGTCACA GATGTCATCG CTGAACAGTT ATTTGATCAA ATGGTTTTAT TTGCTGAATA CTGCTTTAAC AAAAGTCATT CAACTGCTTA TGGAGCGGTT ACTTATCAGA CTGCTTATTT AAAAGCACAT TATCCTGTCG CTTATATGGC GGCATTGCTT ACCGTTAATG CTGGATCGGC TGACAAGATT CAAAGATATA TTTCTAACTG TAATTCAATG GGCATAAACG TAATGCCTCC AAATATCAAT ACCTCTGGTG TTGATTTCAC TCCAAAAGAT AATTCAATTC TTTTTGGTTT TTCGGCTGTC AAAAATTTAG GTGATGGTGC AATTAGAAAA ATTATCACCT CTAGAGATGA AGATGGACAA TTTACTTCTT TAGCACAATT CTGTGATCGA ATTTCACTTG GTTCCGTTAA CCGAAGAGGT CTTGAGGCGT TGATACATAG TGGAGCGCTT GATTGTCTTG AAAAAAATGC AAATCGTGCT CAGCTTATTG CTGATTTGGA TTTAACTATT GAATGGGCTT CTTCTAGAGC AAAAGATAGA ACGAGCGGTC AAGGTAATCT CTTCGATTTA TCTAATTCCA CAAATAATGA ATCATCACCG AATGATGATT ATTCATCAGC TCCAAAGGCG AAAGAAGTCC AAGAGTATCT TCCTTCAGAC AAACTTAAAT TAGAAAAAGA GCATGTTGGA TTCTATCTAT CTGATCATCC TTTGAAGCAA CTTTCAGAAC CAGCAAAATT GATTGCTCCT ATCAGCCTTA GTTCTTTAGA AGAGCAAAAA GATAAGTCAA AGGTTAGTGT TATTGCAATG ATTCCAGAAA TGAGAGAAGT CACAACTAGA AAAGGTGACA GGATGGCAAT TATTCAATTA GAGGATTTAA CTGGTTCTTG TGAAGCTGTT GTTTTCCCAA AAAGCTATGA ACGATTATCA GATCATTTGA TGGTTGAAAC CAGGTTATTG ATATGGGGCA GCGTGGACAG GAGAGATGAA ACTGTTCAAT TGCTTATTGA TGATTGTCGT GAAATTGATG ACTTAAGATT TCTCTTGATT GATCTTCGTC CTGATCAAGC TACAGATATC AATATTCAGC ATAAATTAAG AGAATGCCTT TCTAAAAACA GACCTAACAG AAATGAATTA GGTGTACGTA TCCCAGTAGT AGCATGTCTA AAGGACAACA CTAATACTAG GTATGTAAGG TTGGGCGATC AATTTTGCGT TAAGGATGCA GACCTGGCTT TGGAGGCATT ATCTAAGAAT TCCTTCATTG CAAGATCAAG TAAAAGCCTC GTAATTTAA
|
Protein sequence | MAFVPIHNHS DYSLLDGASQ LPLMVQRAKE LGMPALALTD HGVMYGAIEL LKLCKAANIK PIIGNEMYVI NGSIDDPQPK KEKRYHLVVV AKNQIGYENL VKLTTLSHLN GVRGRGIFSR PCIDKYLFKK YSEGLICSTA CLGGEIPQAI LKGRIDVARE VAAWYKEVLG DDFYLEIQDH GSIEDRIVNS EIVKISEELD IKIIATNDAH YLSKNDIEAH DALICVLTGK LISDHKRLRY TGTEYIKSED EMRSLFTDHL DKNVINSAIE NTVKLSNKVE EYKILGTYKM PNFPIPDGYQ PIEYLKEITI KGLLEILDIS KFENLPITYK ERLDYELKVI EQMGFPTYFL VVWDYIRFAR EQNIPVGPGR GSAAGSLVAF SLHITNIDPV ENGLLFERFL NPERKSMPDI DTDFCIERRG EVIDYVTKKY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YGDADRLAKL IPVVRGKPAK LAAMISKESP NKDFYEKYNN DSKVKKWVDM AMRIEGTNKT FGVHAAGVVI AANSLDNLVP LQRNNDGQII TQYFMEDIES LGLLKMDFLG LRNLTMIEKT IDLVEKSIGK RLDPDSLPFT DEKTFELLSR GDLEGIFQLE SSGMRQIVKD LKPSSLEDIS SILALYRPGP LDAGLIPKFI NRKHGKESID FQHQSLEPIL SETYGIMVYQ EQIMKIAQDL AGYTLGQADL LRRAMGKKKV SEMQRHRTLF VDGAVKNGVT DVIAEQLFDQ MVLFAEYCFN KSHSTAYGAV TYQTAYLKAH YPVAYMAALL TVNAGSADKI QRYISNCNSM GINVMPPNIN TSGVDFTPKD NSILFGFSAV KNLGDGAIRK IITSRDEDGQ FTSLAQFCDR ISLGSVNRRG LEALIHSGAL DCLEKNANRA QLIADLDLTI EWASSRAKDR TSGQGNLFDL SNSTNNESSP NDDYSSAPKA KEVQEYLPSD KLKLEKEHVG FYLSDHPLKQ LSEPAKLIAP ISLSSLEEQK DKSKVSVIAM IPEMREVTTR KGDRMAIIQL EDLTGSCEAV VFPKSYERLS DHLMVETRLL IWGSVDRRDE TVQLLIDDCR EIDDLRFLLI DLRPDQATDI NIQHKLRECL SKNRPNRNEL GVRIPVVACL KDNTNTRYVR LGDQFCVKDA DLALEALSKN SFIARSSKSL VI
|
| |