Gene A9601_09151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_09151 
SymboldnaE 
ID4717622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp785966 
End bp789463 
Gene Length3498 bp 
Protein Length1165 aa 
Translation table11 
GC content33% 
IMG OID640078628 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001009306 
Protein GI123968448 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTCG TTCCGCTTCA TAATCATAGT GACTACAGCT TACTTGATGG TGCCAGTCAA 
ATTTCAAAAA TTGTAGAAAG AGCTTGTGAT CTTGGGATGG ATTCTATTGC TCTCACAGAT
CATGGAGTTA TGTATGGTGT TCTTGATTTG GTCAAGAAGT GTAAAGAGAA AGGTATAAAG
CCAATTATTG GTAATGAAAT GTACGTTATT AATGGTTCTA TTGATGATCC TCAACCAAAA
AAAGAAAAAA GATATCATTT GGTGGTGCTA GCAAAAAATT ATACTGGTTA TAAGAATCTA
GTGAAGTTGA CAACAATTAG TCACCTAAAC GGGATGAGAG GTCGAGGCAT TTTTTCTAGG
CCATGTATTG ATAAATCTCT TTTAAGCAAA TATAGTGATG GCCTAATAGT CTCTACAGCT
TGTCTTGGTG GAGAGATACC TCAGGCTATC TTAAAAGGTA GGTTAGACGT AGCAGAGGAT
ATAGCTCTTT GGTATAAAAA ATTATTTGCA GATGACTTTT ATCTAGAAAT ACAAGATCAC
GGCTCTATTG AGGATAGAAT TGTTAACGTT GAATTAATAA AAATTGGGAA GAAGCACCAA
ATAAAAGTCA TAGCCACCAA CGACGCCCAT TACTTATCAA GTATGGATGT TGAAGCTCAT
GATGCCTTAC TTTGTGTATT AACTGGAAAA CTAATAAGTG ATGAAAAAAG ATTGAGATAT
ACCGGTACAG AATATATTAA AAGTGAAAAT GAAATGCTTG AACTTTTTAA AGATCATATT
GATGATAAAT CAATTATTGA TGCAGTGAAT AATACAGTAG AAATTTCTCA AAAAGTTGAG
GTATTTGATT TGTTTGGTAA TTATAGAATG CCCAAATTTC CTCTTAATGA AGATAAAGAT
TCATTTTCTT TCCTTAAACA ATTATCTAAT AAAGGTCTTT TAAAAAGACT TAAAAAAAAT
GATCTTGATG AAGTTGATGA AAAATATAAA GAAAGACTAA CTTCTGAATT AAAAATTATA
AAAGATATGG GTTTCCCAGA TTATTTTTTG GTTGTTTGGG ACTACATCAA ATTTGCTAGA
GACAACTCTA TACCAGTAGG ACCAGGTAGA GGTTCTGCTG CGGGTTCACT AGTAGCTTAT
GCACTTCAAA TCACAAATAT AGATCCTGTC GAGCATGGAT TGTTATTTGA GAGATTTTTA
AATCCAGCAA GAAAGTCTAT GCCAGATATT GATACCGACT TTTGTATTGA TAGGAGAAAT
GAAGTTATTG ATTATGTTAC TAATCGTTAT GGAGAGGATA AAGTTGCGCA AATAATTACT
TTCAATAAAA TGACCTCTAA GGCGGTTTTA AAAGATGTTG CAAGGGTTCT AGATATTCCG
TATGGAGAGG CTGATAAATT GGCTAAGTTA ATACCGGTTG TAAGAGGGAA ACCTTATAAA
CTAAATGAAA TGATTGATAA GAATTCTCCT AGCCAAGAGT TTAGAGACAA ATATATTAAT
GATAATAGGA TAAAAAAATG GGTTGATTTG GCTTTGAGAA TTGAAGGAAC TAATAAAACA
TATGGAGTTC ATGCTGCTGG AGTTGTTATC GCATCAGATC CTCTCGACGA ACTTGTACCT
CTTCAAAGGA ATAATGAAGG ACAAATAATA ACCCAATATT CTATGGATGA TATCGAATCA
CTTGGATTAT TGAAAATGGA TTTCTTGGGT CTTAAGAATC TCACTATGAT TGAAAAGACA
GTTTCTCTTG TTAATCAATC CTCCGGAAAG AAAATAAATA TCGATGAGTT ACCGCGAAAT
GACAGTAAAA CCTTTGAGCT TATTGGAAGA GGAGATCTTG AAGGTATTTT TCAGCTTGAA
TCTTCTGGTA TGAAACAGGT TGTTAAGGAT TTCAAACCTA ACTCTCTAGA GGATATTTCT
TCCATACTGG CTCTTTATAG ACCTGGTCCT CTTGATGCGG GTCTCATTCC TAAATTTATA
AATCGAAAAA ATGGGAATGA AAAGATTGAT TTTCCTCATC CTTTTATTAA GTCAATTCTT
ACTGAAACCT ATGGAATTAT GGTTTATCAA GAGCAAATCA TGAAAATTGC TCAAGACCTA
GCTGGCTATT CTTTAGGTGA TGCTGATTTA CTTAGAAGAG CAATGGGGAA AAAGAAAGTA
TCTGAGATGG TAAAGCATAG GAATATTTTT GTAGAAGGTT CTATGAAGAA AGGTGTAAAT
GAAAAATTAG CAAATGATCT TTTTGATCAA ATGGTTTTAT TCGCGGAATA TTGTTTTAAC
AAAAGTCACT CAACTGCTTA CGGGGCTGTA ACTTATCAAA CTGCATTTTT AAAAGCCCAT
TTTCCTGTTG CATATATGGC AGCCCTTCTA AGCGTAAACT CTGGTTCTAG CGATAAGATG
CAAAGATATA TTTCTAATTG TTATTCCATG GGAATAGAAG TTATTTCACC AAGCATTAAT
TTTTCTGGTG TTGATTTCAC TATTAAGAAT AATCAGATTT TATTCGGGTT ATCTGCAATT
AAGAATTTAG GAGATTCTGC GATAAGAAAT ATAATTGAAA ACCGAAATAG TTTAGGAATA
TTTAAGTCAC TAGCCGATTT GTGCGATCGT TTGCCTTCTA ATGTTCTTAA CAAAAGAAGT
CTTGAATCTC TAATTCATTG TGGAGCACTA GATGAGTTTT CAATTGATAA TAATAGAGCT
CAATTATTGT CAGATCTCGA AAATGTCATT GAGTGGGCCT CTTCAAGAAA TCGTGATAGG
TTATCTGGGC AAGGCAATCT ATTTGATTCT AAAGAAGAAT TTTCTAATGT TGCTTTTTCA
GATTCACAAT TAGCTAAGGT TGAGGATTAT TCACTTATTG AGAAGTTAAA GTTAGAAAAA
CAGCTACTAG GTTTTTATTT ATCTGATCAT CCTCTAAAGC ATTTAACTAA GCCAGCAAAA
CTTATATCTC CTATAAGCAT TTCGCATTTA GAAGAAACAA AAGATAGAAC CAAAGTCTCT
TTAGTTGGAA TGATCCCTGA TTTGAAGCAA ATTACAACGA GAAAAGGAGA TAGGATGGCT
ATAGTTCAGC TAGAAGATCT TTCAGGAAGT TGCGAAGCAA TAGTTTTTCC AAAAACCTAT
GTAAGATTAT CAGAATTTCT TCTGACGGAT ACAAGATTAT TGGTTTGGGG AACAATAGAT
AAAAAAAGTG ATAAGACTCA ATTAATAATT GATGATTGTA GAGAAATCGA TAACCTTAAA
TTGCTAATTA TTAATCTTGA AAGTTCTCAA GCATCAGATG TACGCGTACA AAATACTTTG
AGAAACTGTT TAATTAAATT TAAACCAGAT AAAGGTAGAT GTGGAATAAA GATTCCAGTT
TTAGCTGCAG TAAGAAATAA AAATAGTGTT ACCTACGTTA AATTTGGCGA ACAATTTTGT
ATTGGTGATA TTCAGGGAGC ATGCAAATTA TTAGAAGATA AATCATTTAA AGTTAACTTG
AAATCTTTAG TTTCCTAG
 
Protein sequence
MAFVPLHNHS DYSLLDGASQ ISKIVERACD LGMDSIALTD HGVMYGVLDL VKKCKEKGIK 
PIIGNEMYVI NGSIDDPQPK KEKRYHLVVL AKNYTGYKNL VKLTTISHLN GMRGRGIFSR
PCIDKSLLSK YSDGLIVSTA CLGGEIPQAI LKGRLDVAED IALWYKKLFA DDFYLEIQDH
GSIEDRIVNV ELIKIGKKHQ IKVIATNDAH YLSSMDVEAH DALLCVLTGK LISDEKRLRY
TGTEYIKSEN EMLELFKDHI DDKSIIDAVN NTVEISQKVE VFDLFGNYRM PKFPLNEDKD
SFSFLKQLSN KGLLKRLKKN DLDEVDEKYK ERLTSELKII KDMGFPDYFL VVWDYIKFAR
DNSIPVGPGR GSAAGSLVAY ALQITNIDPV EHGLLFERFL NPARKSMPDI DTDFCIDRRN
EVIDYVTNRY GEDKVAQIIT FNKMTSKAVL KDVARVLDIP YGEADKLAKL IPVVRGKPYK
LNEMIDKNSP SQEFRDKYIN DNRIKKWVDL ALRIEGTNKT YGVHAAGVVI ASDPLDELVP
LQRNNEGQII TQYSMDDIES LGLLKMDFLG LKNLTMIEKT VSLVNQSSGK KINIDELPRN
DSKTFELIGR GDLEGIFQLE SSGMKQVVKD FKPNSLEDIS SILALYRPGP LDAGLIPKFI
NRKNGNEKID FPHPFIKSIL TETYGIMVYQ EQIMKIAQDL AGYSLGDADL LRRAMGKKKV
SEMVKHRNIF VEGSMKKGVN EKLANDLFDQ MVLFAEYCFN KSHSTAYGAV TYQTAFLKAH
FPVAYMAALL SVNSGSSDKM QRYISNCYSM GIEVISPSIN FSGVDFTIKN NQILFGLSAI
KNLGDSAIRN IIENRNSLGI FKSLADLCDR LPSNVLNKRS LESLIHCGAL DEFSIDNNRA
QLLSDLENVI EWASSRNRDR LSGQGNLFDS KEEFSNVAFS DSQLAKVEDY SLIEKLKLEK
QLLGFYLSDH PLKHLTKPAK LISPISISHL EETKDRTKVS LVGMIPDLKQ ITTRKGDRMA
IVQLEDLSGS CEAIVFPKTY VRLSEFLLTD TRLLVWGTID KKSDKTQLII DDCREIDNLK
LLIINLESSQ ASDVRVQNTL RNCLIKFKPD KGRCGIKIPV LAAVRNKNSV TYVKFGEQFC
IGDIQGACKL LEDKSFKVNL KSLVS