Gene P9303_15831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_15831 
SymboldnaE 
ID4775941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1386587 
End bp1390102 
Gene Length3516 bp 
Protein Length1171 aa 
Translation table11 
GC content51% 
IMG OID640087092 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001017592 
Protein GI124023285 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCG TTCCTCTTCA CAACCACAGC GACTACAGCC TTCTGGACGG AGCTACGCAG 
CTCCCCCAGA TGGTGAAGCG AGCCAAGGAG CTAGGCATGC CTGCTCTGGC ACTCACGGAC
CACGGTGTGA TGTATGGCGC CATTGAACTG CTGAAGCTTT GCAAGAACGC TGAGATCAAG
CCGATCATCG GCAATGAGAT GTATGTGATC AACGGGTCCA TTGAGGATCC ACAACCGAAG
AAGGAGCGTC GTTATCACCT GGTGGTGCTT GCTAAGAACG CTGTTGGCTA TCGCAATCTT
GTGAAACTCA CCAGCATTAG CCATCTGCGC GGTATGCGCG GCCGAGGCAT CTTTGCAAGG
CCATGCATTG ATAAAGAACT GCTCAAGGCC TATAGCGAGG GGTTAATTGT TGCCACCGCT
TGTCTTGGTG GGGAGATTCC TCAAGCCATC TTGCGCGGTC GCATTGATGT AGCAAGGGAT
GTGGCCCGTT GGTACCAGGA GGTCTTTGGC AAAGACTTCT ATCTCGAAGT TCAGGATCAC
GGCTCGCCGG AAGACCGAAT CGTCAATGTG GAGATTGTGA GCATCGCCAA AGAGCTAGGG
ATTGAGCTGA TTGCGACCAA TGACGCCCAT TACCTCAGCA AGAACGATGT GGAGGCCCAT
GACGCCCTGC TTTGTGTGCT GACAGGCAAG TTGATCAGTG ATGAGAAGCG TCTGCGCTAC
ACAGGTACTG AATACATCAA GTCTGAACAG GAGATGGGAC GGCTGTTCGC CGATCATCTC
GAGCCTGATG TGCTGCAAGA GGCGATTGCC AACACAGCAG CCGTTGCCGA AAAGGTTGAG
GAATACTCAA TCCTCGGTAG TTATCAGATG CCTCGTTTCC CGATTCCAGA AGGACATAGC
GCCGTGAGTT ATCTACACGA GGTCTCAGAG CAGGGGCTAC GGCAAAGGTT GAAACTGGCA
ACGGCAGATC CAATTGATGA CCATTACGGC GAAAGGCTCA CCTATGAGCT GGGTGTGATG
GAACAGATGG GCTTCCCCAC CTATTTCCTG GTGGTATGGG ATTACATCCG CTTTGCACGT
GAACAGGGCA TTCCAGTAGG ACCAGGAAGG GGGTCGGCCG CTGGCTCACT CGTGGCATAT
GCCCTTGGTA TTACCAACAT TGATCCTGTT CAAAACGGAT TGTTATTTGA GCGATTCCTT
AATCCTGAGC GTAAGTCGAT GCCTGATATT GACACTGATT TCTGTATTGA TCGTCGTGGT
GAGGTGATCG ACTATGTCAC GCGTCGTTAC GGCGAAGACA AGGTTGCTCA AATTATCACT
TTTAACCGAA TGACCTCCAA GGCCGTCTTG AAGGATGTAG CCCGGGTGCT TGATATTCCC
TATGGAGATG CCGATCGACT TGCCAAGCTT ATTCCCGTAG TAAGGGGAAA GCCTGCAAAG
CTAGCTGCGA TGATCGGCAG CGATTCGCCG AATGCTGAAT TCCGTGAGAA GTATCAGAAC
GATCCAGTAG TTACAAAATG GGTCGATATG GCAATGCGAA TTGAAGGTAC AAATAAAACC
TTTGGCGTTC ATGCCGCTGG AGTCGTCATT GCTGCTGAGC CCCTAGATAA TCTTGTACCG
CTTCAGCGTA ATAACGATGG ACAGGTAATT ACTCAATACT TCATGGAGGA TGTGGAGTCG
ATGGGGTTAT TGAAGATGGA TTTTCTTGGG CTCAAGAATC TCACCATGAT TGACAAAACA
CTTGAGCTTG TTGAAATCAG CAATGGAGAG AGAATTGATC CTGATCAATT GCCACCAGAG
GATCCTGAAA CTTTTGCCTT ACTTGCAAGA GGAGATCTTG AGGGCATCTT TCAACTTGAA
TCGAGTGGGA TGAGACAGAT TGTGCGTGAC CTTCGCCCCT CATCACTTGA AGATATCTCC
TCAATTTTAG CTTTGTACAG ACCAGGTCCT TTGGATGCCG GATTGATTCC AAAATTCATC
AATCGGAAAC ATGGTCGAGA GGCAATTGAT TTTGCTCATG CTGCCCTTGA ACCAATCCTT
AAGGAGACTT ACGGGATCAT GGTTTATCAG GAGCAGATCA TGAAGATTGC CCAGGATCTT
GCTGGCTATT CTCTCGGCGA AGCTGACTTG CTGCGGCGTG CAATGGGCAA GAAAAAGGTT
TCAGAGATGC AGAAACATCG CAGTATTTTT GTTGAAGGTG CAAGTCGAAG TGGTGTTGAT
AAGAAGATCG CCGATGAGCT TTTCGACCAA ATGGTTTTGT TCGCCGAATA TTGCTTCAAC
AAGAGTCACT CAACGGCTTA TGGCGCTGTT ACTTATCAAA CTGCCTATTT AAAGGCACAT
TATCCAGTTG CCTATATGGC GTCATTACTG ACAGTCAATG CTGGCGCTAG TGACAAGGTG
CAGCGCTATA TCTCGAATTG CAATGCGATG GGGATTGAAG TGATGCCGCC AGATGTGAAT
GCTTCAGGGA TTGATTTCAC CCCTGCTGGT GATCGCATCT TGTTTGGTCT TTCTGCTGTG
AGAAATCTTG GCGATGGTGC AATCAGGCAG CTAATTGCCA ATCGCGATGG TGATGGCCCC
TTTGTCTCCC TTGCCGATCT CTGTGATCGT CTGCCCTCCA ATGTTCTGAA TCGTCGCGGG
TTGGAATCTC TTATTCATTG CGGAGCCCTA GATGCCATAG ACCCTGAATC GAACCGGGCC
CAGTTAATTG CCGACTTGGA GCTTCTGATC AACTGGGCTG CTTCTCGTGC CCGTGATCGA
CTAAGTGGTC AGGGCAACCT ATTTGATCTT GTGGCTGGAG CAGCAGACGA GCAAACGTCT
GATGAGCTGA GCACTGCACC CAAGGCAGCA CCGGTTCCCG ACTACCCACC GACTGAAAAG
CTGAGACTTG AAAAAGAGTT GGTTGGTTTC TACCTTTCTG ATCACCCTCT CAAGCAGCTC
ACTGCTCCAG CTCAATTGCT GGCGCCCATT GGTCTTGCCA GCCTTGAGGA TCAGCCTGAC
AAGGCGAAGG TCAGTGTGAT CACGATGCTG ACGGAGATGC GCCAAGTCAC AACCCGCAAG
GGCGATCGCA TGGCAGTTCT CAAGATTGAG GATCTCACCG GTGGTTGCGA AGCTGTGGTG
TTCCCAAAAA GCTATGCCCG TCTATCAGAT CACCTCATGT TGGAAGCGCG ACTGCTTATC
TGGGCCTCTG TTGATCGTCG CGACGACCGT ATCCAATTGA TCATTGATGA TTGCCGCGCC
ATCGATGACC TACGACTGCT GTTGGTGGAG TTGATGCCTG ATGAAGCCTG TGACATCACT
GTGCAGCACA AGCTTCGGGA ATGTCTCCAT CAGCATCGCC CAGCCAAGGA TGAATTTGGC
GTGCGGGTAC CCGTGGTGGC AGCGGTTCGC CAGGGTCCCC AGGTACGTTA CGTATGTCTA
GGTCATCAGT TCTGCGTTCG TGACGCTTCT GCTGCACTCA GTTCCCTTCA ACAGCAGGAA
TTCAAAGCTC GATGCAGCGA CCGACTATTT GTCTGA
 
Protein sequence
MAFVPLHNHS DYSLLDGATQ LPQMVKRAKE LGMPALALTD HGVMYGAIEL LKLCKNAEIK 
PIIGNEMYVI NGSIEDPQPK KERRYHLVVL AKNAVGYRNL VKLTSISHLR GMRGRGIFAR
PCIDKELLKA YSEGLIVATA CLGGEIPQAI LRGRIDVARD VARWYQEVFG KDFYLEVQDH
GSPEDRIVNV EIVSIAKELG IELIATNDAH YLSKNDVEAH DALLCVLTGK LISDEKRLRY
TGTEYIKSEQ EMGRLFADHL EPDVLQEAIA NTAAVAEKVE EYSILGSYQM PRFPIPEGHS
AVSYLHEVSE QGLRQRLKLA TADPIDDHYG ERLTYELGVM EQMGFPTYFL VVWDYIRFAR
EQGIPVGPGR GSAAGSLVAY ALGITNIDPV QNGLLFERFL NPERKSMPDI DTDFCIDRRG
EVIDYVTRRY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YGDADRLAKL IPVVRGKPAK
LAAMIGSDSP NAEFREKYQN DPVVTKWVDM AMRIEGTNKT FGVHAAGVVI AAEPLDNLVP
LQRNNDGQVI TQYFMEDVES MGLLKMDFLG LKNLTMIDKT LELVEISNGE RIDPDQLPPE
DPETFALLAR GDLEGIFQLE SSGMRQIVRD LRPSSLEDIS SILALYRPGP LDAGLIPKFI
NRKHGREAID FAHAALEPIL KETYGIMVYQ EQIMKIAQDL AGYSLGEADL LRRAMGKKKV
SEMQKHRSIF VEGASRSGVD KKIADELFDQ MVLFAEYCFN KSHSTAYGAV TYQTAYLKAH
YPVAYMASLL TVNAGASDKV QRYISNCNAM GIEVMPPDVN ASGIDFTPAG DRILFGLSAV
RNLGDGAIRQ LIANRDGDGP FVSLADLCDR LPSNVLNRRG LESLIHCGAL DAIDPESNRA
QLIADLELLI NWAASRARDR LSGQGNLFDL VAGAADEQTS DELSTAPKAA PVPDYPPTEK
LRLEKELVGF YLSDHPLKQL TAPAQLLAPI GLASLEDQPD KAKVSVITML TEMRQVTTRK
GDRMAVLKIE DLTGGCEAVV FPKSYARLSD HLMLEARLLI WASVDRRDDR IQLIIDDCRA
IDDLRLLLVE LMPDEACDIT VQHKLRECLH QHRPAKDEFG VRVPVVAAVR QGPQVRYVCL
GHQFCVRDAS AALSSLQQQE FKARCSDRLF V