Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_06991 |
Symbol | dnaE |
ID | 5730679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 611194 |
End bp | 614709 |
Gene Length | 3516 bp |
Protein Length | 1171 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285062 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001550584 |
Protein GI | 159903240 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.232183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTTG TACCTCTTCA TAATCACAGC GATTACAGTC TTCTTGATGG AGCTAGTCAG CTGCCAGCAA TGGTTTCCAG AGCCAAGGCG CTTGGATTTC CAGCTATTGC GCTAACTGAT CATGGAGTGA TGTATGGCGC TATTGAACTT TTAAAGCTTT GTAAGTCAGA AGGTATAAAA CCAATCATTG GGAATGAAAT GTATGTTGTT AATGGATCAA TAGAAGACCC TCAGCCGAAA AAAGAACGAA GATATCACCT TGTTGTACTT GCTAAGAATG ACGTTGGATA TAGGAATTTA GTCAAGCTAA CTACTATTAG CCATTTAAAT GGGATGAGAG GACGAGGAAT TTTTTCAAGA GCATGTATTG ATAAACAATT ACTTGAAACT TATAAAGAAG GCTTGATAAT TTCGACTGCA TGTCTTGGTG GCGAGATTCC TCAGGCTATT TTGCGAGACC GTATTGATGT GGCAAAGGAT GTTGCTAGTT GGTATAAAAA AGTTTTTGGG GATGATTTTT ATTTAGAGAT TCAGGACCAT GGATCTGTAG AAGATCGAAT AGTCAATACA GGGATATGTC GTATTGCTAA AGAATTAAAT ATTGAACTGA TTGCAACAAA TGATGCTCAC TACCTTACTA AAGATGATGT GGAAGCACAT GATGCGTTGT TATGTGTACT TACAGGGAAA TTGATAAGTG AAGAAAAACG TTTGAGATAT ACAGGAACAG AATATCTCAA ATCTGAAGAA GAGATGGGAA AGCTTTTCTC AGACCATATC GAGAGTAATA TTATACAAGA AGCAATAAAT AATACTGTTA CTGTCTCCGA AAAAGTAGAA GAATACAATA TTTTAGGAAC TTATAAAATG CCTCAGTTCC CTGTGCCTGA TGGTGGTCAG TCAACTGATT ATTTAAGGAA GGTATCTCAA GATGGATTGA TGACTAGACT TAAGCTTGAA TCACTAGAGA TGATTGAAAA TAAATATATA AATCGCCTTA ATAGTGAAAT AAAAATAATA GAACAGATGG GTTTCCCAGA CTACTTCCTT GTTGTATGGG ATTATATTCG TTTTGCAAGA GAAAATCATA TACCAGTTGG TCCAGGGAGG GGCTCTGCAG CAGGATCCCT CGTTGCATAT TCTTTGGGTA TTACAAATAT TGATCCTGTA ACAAATGGTC TTTTATTTGA AAGATTTTTA AACCCTGAAA GGAAATCTAT GCCTGATATA GATACTGACT TTTGTATTGA ACGTAGGGGT GAAGTTATTG ATTATGTTAC GAAGCGCTAT GGAGAAGATA AGGTTGCGCA AATTATTACT TTCAACCGAA TGACATCTAA AGCTGTTCTT AAAGATGTTG CAAGAGTTTT GGATATTCCA TATAGTGACG CAGATAGATT AGCAAAGTTA ATTCCTGTAG TCAGAGGTAA GCCAGCTAAA CTTTCTCAAA TGATTGGGGA TAATACTCCC AGTAAAGATT TTAGAGAAAA ATACCAAAAT GATCCTTTGG TAAAGAAGTG GTTGGATATG GCAATCAGAA TAGAAGGTAC TAATAAGACT TTTGGAGTTC ATGCAGCAGG TGTTGTTATT GCCTCGGACC CCTTGGATAA TTTAGTTCCT CTTCAAAGGA ATAATGATGG CCAAATAATT ACTCAATATT TTATGGAAGA TATTGAATCA TTAGGATTAC TTAAAATGGA TTTTTTAGGT CTTAAGAATC TTACTATGAT TGAAAAGGCA GTAACTTTAG TCGAAGATTC TTTGGGGGAA AAGCTTGATT TAGATCAATT AAATATGGAC GATACTAAAA CTTATGAGCT CTTATCAAAA GGCGATTTAG AGGGAATTTT TCAACTTGAG TCAACTGGAA TGAGACAAAT AGTTAAAGAT CTTAGGCCAT CTTCTTTGGA AGATATTTCT TCAATTCTTG CGTTGTATAG ACCAGGTCCA CTTGATGCAG GATTAATTCC AAAATTTATT AATCGAAAGC ATGGGAAAGA GCAGATTGAT TTTCCCCATG CTTCTTTAGC ACCAATACTT GGAGAGACAT ACGGCATAAT GCTTTATCAA GAGCAAATAA TGAAAATTGC CCAAGAACTG GCTGGTTATT CGTTAGGCCA GGCAGATCTT TTAAGAAGGG CTATGGGTAA GAAAAAGGTT GCGGAGATGG AAAAACATAG AAACTTTTTT CTTGAAGGCG CTAGTAAAAA TGGAATTAAT TCGAATATAG CCAATGAATT ATTTGAGCAA ATGCTCCTTT TTGCGGAGTA CTGCTTTAAC AAGAGTCACT CCACTGCTTA TGGAGCAGTT ACTTTCCAAA CAGCTTACTT AAAAGCTCAT TACCCAGTTG CATATATGGC TGCTCTGCTT ACTGTAAATG CTGGGTCTAG TGACAAAGTT CAACGCTATA TATCTAACTG TAATTCCATG GGCATAGAAG TTATGCCGCC AGATGTTAAC TCTTCGGGAA TAGATTTTAC TCCCAATGAA AATCACATTC TGTTTGGAAT GTCTGCCGTG AAAAACCTTG GTGATGGTGC TATTCGTGAA TTAATAAAAT CTCGTGAAGA AGATGGCTCT TTTATTTCCT TGGCAGATCT TTGTGATCGA ATTCCACCAA ATACCCTTAA CAGAAGAGGA TTAGAGTCTT TAATTCATTC TGGTGCGCTT GATTCCTTTG ATAAGAAGGC AAATAGGGCT CAGTTGTTAG CAGACCTTGA TCTGATTATT GAGTGGGCGA CTTCTAGAGC GCGAGATCGT ATTAGTGGTC AGGGAAACTT GTTTGATCTG GCATCTTCTT CTTCCGAGAA TCAAACATCA AACAGCCTTC ACACTGCTCC TAAAGCAGCT CCTGTAAGCG ACTATTCTCC TACAGAAAAG CTTCGCCTTG AGAAGGAACT CATTGGCTTT TATCTTTCTG ATCATCCACT TAAGCAACTC TCTGAGCCAG CCAAGCTTAT TGCTCCAATA AGCTTAGGGA CATTAGAAGA TCAACGGGAT AAGTCAAAAG TAAGTGTCAT TGCCATGATT AACGATATGA GAGTAGTAAC TACTCGCAAG GGAGATAAAA TGGCTATCCT TCAAATTGAG GATTTAACAG GCTCATGCGA AGCTGTTGTT TTCCCTAAGA GTTATCACAG ACTTTCAGAT CATCTGATTT CTGAGACACG TTTATTAGTT TGGGCATCAG TTGATAGAAG GGATGATAAT ACTCAATTAA TTGTTGATGA TTGCCGCTCA ATAGATGACA TGAGATTTGT TTTAGTTGAC TTATTGCCTG ATCAAATTTC TAATATTGAT TATCAATATC GGCTTAGAGA ATGTTTAAAT AATCATCGTC CTGCAAGAGA TGAGCTGGGC GTAAGAGTTC CTGTAGTTGC TGTAATTAGA GATGGTAGCA ATATTAAATA TATTCGTTTG GGTCATCAAT TTTGTGTTAA GGATGCAGCT GCGGCAGTTA AGTCTTTGCA GAATAGTTCT TTTAAAGCCA GTTTTAGTGA GAGTTTGGTT AACTAA
|
Protein sequence | MGFVPLHNHS DYSLLDGASQ LPAMVSRAKA LGFPAIALTD HGVMYGAIEL LKLCKSEGIK PIIGNEMYVV NGSIEDPQPK KERRYHLVVL AKNDVGYRNL VKLTTISHLN GMRGRGIFSR ACIDKQLLET YKEGLIISTA CLGGEIPQAI LRDRIDVAKD VASWYKKVFG DDFYLEIQDH GSVEDRIVNT GICRIAKELN IELIATNDAH YLTKDDVEAH DALLCVLTGK LISEEKRLRY TGTEYLKSEE EMGKLFSDHI ESNIIQEAIN NTVTVSEKVE EYNILGTYKM PQFPVPDGGQ STDYLRKVSQ DGLMTRLKLE SLEMIENKYI NRLNSEIKII EQMGFPDYFL VVWDYIRFAR ENHIPVGPGR GSAAGSLVAY SLGITNIDPV TNGLLFERFL NPERKSMPDI DTDFCIERRG EVIDYVTKRY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YSDADRLAKL IPVVRGKPAK LSQMIGDNTP SKDFREKYQN DPLVKKWLDM AIRIEGTNKT FGVHAAGVVI ASDPLDNLVP LQRNNDGQII TQYFMEDIES LGLLKMDFLG LKNLTMIEKA VTLVEDSLGE KLDLDQLNMD DTKTYELLSK GDLEGIFQLE STGMRQIVKD LRPSSLEDIS SILALYRPGP LDAGLIPKFI NRKHGKEQID FPHASLAPIL GETYGIMLYQ EQIMKIAQEL AGYSLGQADL LRRAMGKKKV AEMEKHRNFF LEGASKNGIN SNIANELFEQ MLLFAEYCFN KSHSTAYGAV TFQTAYLKAH YPVAYMAALL TVNAGSSDKV QRYISNCNSM GIEVMPPDVN SSGIDFTPNE NHILFGMSAV KNLGDGAIRE LIKSREEDGS FISLADLCDR IPPNTLNRRG LESLIHSGAL DSFDKKANRA QLLADLDLII EWATSRARDR ISGQGNLFDL ASSSSENQTS NSLHTAPKAA PVSDYSPTEK LRLEKELIGF YLSDHPLKQL SEPAKLIAPI SLGTLEDQRD KSKVSVIAMI NDMRVVTTRK GDKMAILQIE DLTGSCEAVV FPKSYHRLSD HLISETRLLV WASVDRRDDN TQLIVDDCRS IDDMRFVLVD LLPDQISNID YQYRLRECLN NHRPARDELG VRVPVVAVIR DGSNIKYIRL GHQFCVKDAA AAVKSLQNSS FKASFSESLV N
|
| |