Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15831 |
Symbol | dnaE |
ID | 4775941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1386587 |
End bp | 1390102 |
Gene Length | 3516 bp |
Protein Length | 1171 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640087092 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001017592 |
Protein GI | 124023285 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTCG TTCCTCTTCA CAACCACAGC GACTACAGCC TTCTGGACGG AGCTACGCAG CTCCCCCAGA TGGTGAAGCG AGCCAAGGAG CTAGGCATGC CTGCTCTGGC ACTCACGGAC CACGGTGTGA TGTATGGCGC CATTGAACTG CTGAAGCTTT GCAAGAACGC TGAGATCAAG CCGATCATCG GCAATGAGAT GTATGTGATC AACGGGTCCA TTGAGGATCC ACAACCGAAG AAGGAGCGTC GTTATCACCT GGTGGTGCTT GCTAAGAACG CTGTTGGCTA TCGCAATCTT GTGAAACTCA CCAGCATTAG CCATCTGCGC GGTATGCGCG GCCGAGGCAT CTTTGCAAGG CCATGCATTG ATAAAGAACT GCTCAAGGCC TATAGCGAGG GGTTAATTGT TGCCACCGCT TGTCTTGGTG GGGAGATTCC TCAAGCCATC TTGCGCGGTC GCATTGATGT AGCAAGGGAT GTGGCCCGTT GGTACCAGGA GGTCTTTGGC AAAGACTTCT ATCTCGAAGT TCAGGATCAC GGCTCGCCGG AAGACCGAAT CGTCAATGTG GAGATTGTGA GCATCGCCAA AGAGCTAGGG ATTGAGCTGA TTGCGACCAA TGACGCCCAT TACCTCAGCA AGAACGATGT GGAGGCCCAT GACGCCCTGC TTTGTGTGCT GACAGGCAAG TTGATCAGTG ATGAGAAGCG TCTGCGCTAC ACAGGTACTG AATACATCAA GTCTGAACAG GAGATGGGAC GGCTGTTCGC CGATCATCTC GAGCCTGATG TGCTGCAAGA GGCGATTGCC AACACAGCAG CCGTTGCCGA AAAGGTTGAG GAATACTCAA TCCTCGGTAG TTATCAGATG CCTCGTTTCC CGATTCCAGA AGGACATAGC GCCGTGAGTT ATCTACACGA GGTCTCAGAG CAGGGGCTAC GGCAAAGGTT GAAACTGGCA ACGGCAGATC CAATTGATGA CCATTACGGC GAAAGGCTCA CCTATGAGCT GGGTGTGATG GAACAGATGG GCTTCCCCAC CTATTTCCTG GTGGTATGGG ATTACATCCG CTTTGCACGT GAACAGGGCA TTCCAGTAGG ACCAGGAAGG GGGTCGGCCG CTGGCTCACT CGTGGCATAT GCCCTTGGTA TTACCAACAT TGATCCTGTT CAAAACGGAT TGTTATTTGA GCGATTCCTT AATCCTGAGC GTAAGTCGAT GCCTGATATT GACACTGATT TCTGTATTGA TCGTCGTGGT GAGGTGATCG ACTATGTCAC GCGTCGTTAC GGCGAAGACA AGGTTGCTCA AATTATCACT TTTAACCGAA TGACCTCCAA GGCCGTCTTG AAGGATGTAG CCCGGGTGCT TGATATTCCC TATGGAGATG CCGATCGACT TGCCAAGCTT ATTCCCGTAG TAAGGGGAAA GCCTGCAAAG CTAGCTGCGA TGATCGGCAG CGATTCGCCG AATGCTGAAT TCCGTGAGAA GTATCAGAAC GATCCAGTAG TTACAAAATG GGTCGATATG GCAATGCGAA TTGAAGGTAC AAATAAAACC TTTGGCGTTC ATGCCGCTGG AGTCGTCATT GCTGCTGAGC CCCTAGATAA TCTTGTACCG CTTCAGCGTA ATAACGATGG ACAGGTAATT ACTCAATACT TCATGGAGGA TGTGGAGTCG ATGGGGTTAT TGAAGATGGA TTTTCTTGGG CTCAAGAATC TCACCATGAT TGACAAAACA CTTGAGCTTG TTGAAATCAG CAATGGAGAG AGAATTGATC CTGATCAATT GCCACCAGAG GATCCTGAAA CTTTTGCCTT ACTTGCAAGA GGAGATCTTG AGGGCATCTT TCAACTTGAA TCGAGTGGGA TGAGACAGAT TGTGCGTGAC CTTCGCCCCT CATCACTTGA AGATATCTCC TCAATTTTAG CTTTGTACAG ACCAGGTCCT TTGGATGCCG GATTGATTCC AAAATTCATC AATCGGAAAC ATGGTCGAGA GGCAATTGAT TTTGCTCATG CTGCCCTTGA ACCAATCCTT AAGGAGACTT ACGGGATCAT GGTTTATCAG GAGCAGATCA TGAAGATTGC CCAGGATCTT GCTGGCTATT CTCTCGGCGA AGCTGACTTG CTGCGGCGTG CAATGGGCAA GAAAAAGGTT TCAGAGATGC AGAAACATCG CAGTATTTTT GTTGAAGGTG CAAGTCGAAG TGGTGTTGAT AAGAAGATCG CCGATGAGCT TTTCGACCAA ATGGTTTTGT TCGCCGAATA TTGCTTCAAC AAGAGTCACT CAACGGCTTA TGGCGCTGTT ACTTATCAAA CTGCCTATTT AAAGGCACAT TATCCAGTTG CCTATATGGC GTCATTACTG ACAGTCAATG CTGGCGCTAG TGACAAGGTG CAGCGCTATA TCTCGAATTG CAATGCGATG GGGATTGAAG TGATGCCGCC AGATGTGAAT GCTTCAGGGA TTGATTTCAC CCCTGCTGGT GATCGCATCT TGTTTGGTCT TTCTGCTGTG AGAAATCTTG GCGATGGTGC AATCAGGCAG CTAATTGCCA ATCGCGATGG TGATGGCCCC TTTGTCTCCC TTGCCGATCT CTGTGATCGT CTGCCCTCCA ATGTTCTGAA TCGTCGCGGG TTGGAATCTC TTATTCATTG CGGAGCCCTA GATGCCATAG ACCCTGAATC GAACCGGGCC CAGTTAATTG CCGACTTGGA GCTTCTGATC AACTGGGCTG CTTCTCGTGC CCGTGATCGA CTAAGTGGTC AGGGCAACCT ATTTGATCTT GTGGCTGGAG CAGCAGACGA GCAAACGTCT GATGAGCTGA GCACTGCACC CAAGGCAGCA CCGGTTCCCG ACTACCCACC GACTGAAAAG CTGAGACTTG AAAAAGAGTT GGTTGGTTTC TACCTTTCTG ATCACCCTCT CAAGCAGCTC ACTGCTCCAG CTCAATTGCT GGCGCCCATT GGTCTTGCCA GCCTTGAGGA TCAGCCTGAC AAGGCGAAGG TCAGTGTGAT CACGATGCTG ACGGAGATGC GCCAAGTCAC AACCCGCAAG GGCGATCGCA TGGCAGTTCT CAAGATTGAG GATCTCACCG GTGGTTGCGA AGCTGTGGTG TTCCCAAAAA GCTATGCCCG TCTATCAGAT CACCTCATGT TGGAAGCGCG ACTGCTTATC TGGGCCTCTG TTGATCGTCG CGACGACCGT ATCCAATTGA TCATTGATGA TTGCCGCGCC ATCGATGACC TACGACTGCT GTTGGTGGAG TTGATGCCTG ATGAAGCCTG TGACATCACT GTGCAGCACA AGCTTCGGGA ATGTCTCCAT CAGCATCGCC CAGCCAAGGA TGAATTTGGC GTGCGGGTAC CCGTGGTGGC AGCGGTTCGC CAGGGTCCCC AGGTACGTTA CGTATGTCTA GGTCATCAGT TCTGCGTTCG TGACGCTTCT GCTGCACTCA GTTCCCTTCA ACAGCAGGAA TTCAAAGCTC GATGCAGCGA CCGACTATTT GTCTGA
|
Protein sequence | MAFVPLHNHS DYSLLDGATQ LPQMVKRAKE LGMPALALTD HGVMYGAIEL LKLCKNAEIK PIIGNEMYVI NGSIEDPQPK KERRYHLVVL AKNAVGYRNL VKLTSISHLR GMRGRGIFAR PCIDKELLKA YSEGLIVATA CLGGEIPQAI LRGRIDVARD VARWYQEVFG KDFYLEVQDH GSPEDRIVNV EIVSIAKELG IELIATNDAH YLSKNDVEAH DALLCVLTGK LISDEKRLRY TGTEYIKSEQ EMGRLFADHL EPDVLQEAIA NTAAVAEKVE EYSILGSYQM PRFPIPEGHS AVSYLHEVSE QGLRQRLKLA TADPIDDHYG ERLTYELGVM EQMGFPTYFL VVWDYIRFAR EQGIPVGPGR GSAAGSLVAY ALGITNIDPV QNGLLFERFL NPERKSMPDI DTDFCIDRRG EVIDYVTRRY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YGDADRLAKL IPVVRGKPAK LAAMIGSDSP NAEFREKYQN DPVVTKWVDM AMRIEGTNKT FGVHAAGVVI AAEPLDNLVP LQRNNDGQVI TQYFMEDVES MGLLKMDFLG LKNLTMIDKT LELVEISNGE RIDPDQLPPE DPETFALLAR GDLEGIFQLE SSGMRQIVRD LRPSSLEDIS SILALYRPGP LDAGLIPKFI NRKHGREAID FAHAALEPIL KETYGIMVYQ EQIMKIAQDL AGYSLGEADL LRRAMGKKKV SEMQKHRSIF VEGASRSGVD KKIADELFDQ MVLFAEYCFN KSHSTAYGAV TYQTAYLKAH YPVAYMASLL TVNAGASDKV QRYISNCNAM GIEVMPPDVN ASGIDFTPAG DRILFGLSAV RNLGDGAIRQ LIANRDGDGP FVSLADLCDR LPSNVLNRRG LESLIHCGAL DAIDPESNRA QLIADLELLI NWAASRARDR LSGQGNLFDL VAGAADEQTS DELSTAPKAA PVPDYPPTEK LRLEKELVGF YLSDHPLKQL TAPAQLLAPI GLASLEDQPD KAKVSVITML TEMRQVTTRK GDRMAVLKIE DLTGGCEAVV FPKSYARLSD HLMLEARLLI WASVDRRDDR IQLIIDDCRA IDDLRLLLVE LMPDEACDIT VQHKLRECLH QHRPAKDEFG VRVPVVAAVR QGPQVRYVCL GHQFCVRDAS AALSSLQQQE FKARCSDRLF V
|
| |