Gene P9211_06991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_06991 
SymboldnaE 
ID5730679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp611194 
End bp614709 
Gene Length3516 bp 
Protein Length1171 aa 
Translation table11 
GC content37% 
IMG OID641285062 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001550584 
Protein GI159903240 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.232183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTG TACCTCTTCA TAATCACAGC GATTACAGTC TTCTTGATGG AGCTAGTCAG 
CTGCCAGCAA TGGTTTCCAG AGCCAAGGCG CTTGGATTTC CAGCTATTGC GCTAACTGAT
CATGGAGTGA TGTATGGCGC TATTGAACTT TTAAAGCTTT GTAAGTCAGA AGGTATAAAA
CCAATCATTG GGAATGAAAT GTATGTTGTT AATGGATCAA TAGAAGACCC TCAGCCGAAA
AAAGAACGAA GATATCACCT TGTTGTACTT GCTAAGAATG ACGTTGGATA TAGGAATTTA
GTCAAGCTAA CTACTATTAG CCATTTAAAT GGGATGAGAG GACGAGGAAT TTTTTCAAGA
GCATGTATTG ATAAACAATT ACTTGAAACT TATAAAGAAG GCTTGATAAT TTCGACTGCA
TGTCTTGGTG GCGAGATTCC TCAGGCTATT TTGCGAGACC GTATTGATGT GGCAAAGGAT
GTTGCTAGTT GGTATAAAAA AGTTTTTGGG GATGATTTTT ATTTAGAGAT TCAGGACCAT
GGATCTGTAG AAGATCGAAT AGTCAATACA GGGATATGTC GTATTGCTAA AGAATTAAAT
ATTGAACTGA TTGCAACAAA TGATGCTCAC TACCTTACTA AAGATGATGT GGAAGCACAT
GATGCGTTGT TATGTGTACT TACAGGGAAA TTGATAAGTG AAGAAAAACG TTTGAGATAT
ACAGGAACAG AATATCTCAA ATCTGAAGAA GAGATGGGAA AGCTTTTCTC AGACCATATC
GAGAGTAATA TTATACAAGA AGCAATAAAT AATACTGTTA CTGTCTCCGA AAAAGTAGAA
GAATACAATA TTTTAGGAAC TTATAAAATG CCTCAGTTCC CTGTGCCTGA TGGTGGTCAG
TCAACTGATT ATTTAAGGAA GGTATCTCAA GATGGATTGA TGACTAGACT TAAGCTTGAA
TCACTAGAGA TGATTGAAAA TAAATATATA AATCGCCTTA ATAGTGAAAT AAAAATAATA
GAACAGATGG GTTTCCCAGA CTACTTCCTT GTTGTATGGG ATTATATTCG TTTTGCAAGA
GAAAATCATA TACCAGTTGG TCCAGGGAGG GGCTCTGCAG CAGGATCCCT CGTTGCATAT
TCTTTGGGTA TTACAAATAT TGATCCTGTA ACAAATGGTC TTTTATTTGA AAGATTTTTA
AACCCTGAAA GGAAATCTAT GCCTGATATA GATACTGACT TTTGTATTGA ACGTAGGGGT
GAAGTTATTG ATTATGTTAC GAAGCGCTAT GGAGAAGATA AGGTTGCGCA AATTATTACT
TTCAACCGAA TGACATCTAA AGCTGTTCTT AAAGATGTTG CAAGAGTTTT GGATATTCCA
TATAGTGACG CAGATAGATT AGCAAAGTTA ATTCCTGTAG TCAGAGGTAA GCCAGCTAAA
CTTTCTCAAA TGATTGGGGA TAATACTCCC AGTAAAGATT TTAGAGAAAA ATACCAAAAT
GATCCTTTGG TAAAGAAGTG GTTGGATATG GCAATCAGAA TAGAAGGTAC TAATAAGACT
TTTGGAGTTC ATGCAGCAGG TGTTGTTATT GCCTCGGACC CCTTGGATAA TTTAGTTCCT
CTTCAAAGGA ATAATGATGG CCAAATAATT ACTCAATATT TTATGGAAGA TATTGAATCA
TTAGGATTAC TTAAAATGGA TTTTTTAGGT CTTAAGAATC TTACTATGAT TGAAAAGGCA
GTAACTTTAG TCGAAGATTC TTTGGGGGAA AAGCTTGATT TAGATCAATT AAATATGGAC
GATACTAAAA CTTATGAGCT CTTATCAAAA GGCGATTTAG AGGGAATTTT TCAACTTGAG
TCAACTGGAA TGAGACAAAT AGTTAAAGAT CTTAGGCCAT CTTCTTTGGA AGATATTTCT
TCAATTCTTG CGTTGTATAG ACCAGGTCCA CTTGATGCAG GATTAATTCC AAAATTTATT
AATCGAAAGC ATGGGAAAGA GCAGATTGAT TTTCCCCATG CTTCTTTAGC ACCAATACTT
GGAGAGACAT ACGGCATAAT GCTTTATCAA GAGCAAATAA TGAAAATTGC CCAAGAACTG
GCTGGTTATT CGTTAGGCCA GGCAGATCTT TTAAGAAGGG CTATGGGTAA GAAAAAGGTT
GCGGAGATGG AAAAACATAG AAACTTTTTT CTTGAAGGCG CTAGTAAAAA TGGAATTAAT
TCGAATATAG CCAATGAATT ATTTGAGCAA ATGCTCCTTT TTGCGGAGTA CTGCTTTAAC
AAGAGTCACT CCACTGCTTA TGGAGCAGTT ACTTTCCAAA CAGCTTACTT AAAAGCTCAT
TACCCAGTTG CATATATGGC TGCTCTGCTT ACTGTAAATG CTGGGTCTAG TGACAAAGTT
CAACGCTATA TATCTAACTG TAATTCCATG GGCATAGAAG TTATGCCGCC AGATGTTAAC
TCTTCGGGAA TAGATTTTAC TCCCAATGAA AATCACATTC TGTTTGGAAT GTCTGCCGTG
AAAAACCTTG GTGATGGTGC TATTCGTGAA TTAATAAAAT CTCGTGAAGA AGATGGCTCT
TTTATTTCCT TGGCAGATCT TTGTGATCGA ATTCCACCAA ATACCCTTAA CAGAAGAGGA
TTAGAGTCTT TAATTCATTC TGGTGCGCTT GATTCCTTTG ATAAGAAGGC AAATAGGGCT
CAGTTGTTAG CAGACCTTGA TCTGATTATT GAGTGGGCGA CTTCTAGAGC GCGAGATCGT
ATTAGTGGTC AGGGAAACTT GTTTGATCTG GCATCTTCTT CTTCCGAGAA TCAAACATCA
AACAGCCTTC ACACTGCTCC TAAAGCAGCT CCTGTAAGCG ACTATTCTCC TACAGAAAAG
CTTCGCCTTG AGAAGGAACT CATTGGCTTT TATCTTTCTG ATCATCCACT TAAGCAACTC
TCTGAGCCAG CCAAGCTTAT TGCTCCAATA AGCTTAGGGA CATTAGAAGA TCAACGGGAT
AAGTCAAAAG TAAGTGTCAT TGCCATGATT AACGATATGA GAGTAGTAAC TACTCGCAAG
GGAGATAAAA TGGCTATCCT TCAAATTGAG GATTTAACAG GCTCATGCGA AGCTGTTGTT
TTCCCTAAGA GTTATCACAG ACTTTCAGAT CATCTGATTT CTGAGACACG TTTATTAGTT
TGGGCATCAG TTGATAGAAG GGATGATAAT ACTCAATTAA TTGTTGATGA TTGCCGCTCA
ATAGATGACA TGAGATTTGT TTTAGTTGAC TTATTGCCTG ATCAAATTTC TAATATTGAT
TATCAATATC GGCTTAGAGA ATGTTTAAAT AATCATCGTC CTGCAAGAGA TGAGCTGGGC
GTAAGAGTTC CTGTAGTTGC TGTAATTAGA GATGGTAGCA ATATTAAATA TATTCGTTTG
GGTCATCAAT TTTGTGTTAA GGATGCAGCT GCGGCAGTTA AGTCTTTGCA GAATAGTTCT
TTTAAAGCCA GTTTTAGTGA GAGTTTGGTT AACTAA
 
Protein sequence
MGFVPLHNHS DYSLLDGASQ LPAMVSRAKA LGFPAIALTD HGVMYGAIEL LKLCKSEGIK 
PIIGNEMYVV NGSIEDPQPK KERRYHLVVL AKNDVGYRNL VKLTTISHLN GMRGRGIFSR
ACIDKQLLET YKEGLIISTA CLGGEIPQAI LRDRIDVAKD VASWYKKVFG DDFYLEIQDH
GSVEDRIVNT GICRIAKELN IELIATNDAH YLTKDDVEAH DALLCVLTGK LISEEKRLRY
TGTEYLKSEE EMGKLFSDHI ESNIIQEAIN NTVTVSEKVE EYNILGTYKM PQFPVPDGGQ
STDYLRKVSQ DGLMTRLKLE SLEMIENKYI NRLNSEIKII EQMGFPDYFL VVWDYIRFAR
ENHIPVGPGR GSAAGSLVAY SLGITNIDPV TNGLLFERFL NPERKSMPDI DTDFCIERRG
EVIDYVTKRY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YSDADRLAKL IPVVRGKPAK
LSQMIGDNTP SKDFREKYQN DPLVKKWLDM AIRIEGTNKT FGVHAAGVVI ASDPLDNLVP
LQRNNDGQII TQYFMEDIES LGLLKMDFLG LKNLTMIEKA VTLVEDSLGE KLDLDQLNMD
DTKTYELLSK GDLEGIFQLE STGMRQIVKD LRPSSLEDIS SILALYRPGP LDAGLIPKFI
NRKHGKEQID FPHASLAPIL GETYGIMLYQ EQIMKIAQEL AGYSLGQADL LRRAMGKKKV
AEMEKHRNFF LEGASKNGIN SNIANELFEQ MLLFAEYCFN KSHSTAYGAV TFQTAYLKAH
YPVAYMAALL TVNAGSSDKV QRYISNCNSM GIEVMPPDVN SSGIDFTPNE NHILFGMSAV
KNLGDGAIRE LIKSREEDGS FISLADLCDR IPPNTLNRRG LESLIHSGAL DSFDKKANRA
QLLADLDLII EWATSRARDR ISGQGNLFDL ASSSSENQTS NSLHTAPKAA PVSDYSPTEK
LRLEKELIGF YLSDHPLKQL SEPAKLIAPI SLGTLEDQRD KSKVSVIAMI NDMRVVTTRK
GDKMAILQIE DLTGSCEAVV FPKSYHRLSD HLISETRLLV WASVDRRDDN TQLIVDDCRS
IDDMRFVLVD LLPDQISNID YQYRLRECLN NHRPARDELG VRVPVVAVIR DGSNIKYIRL
GHQFCVKDAA AAVKSLQNSS FKASFSESLV N