Gene PCC7424_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4231 
Symbol 
ID7108152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp4692746 
End bp4695646 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content35% 
IMG OID643482455 
ProductDNA polymerase I 
Protein accessionYP_002379469 
Protein GI218441140 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAG ACACAGCCCC TTTATTGATT TTAATCGATG GTCACTCTTT GGCTTTTCGG 
GCTTATCATG CCTTTGCTCA CACCAAACAA GGCCCTTTAC GTACCTCTAC AGGAATTCCC
ACTAGCGTCT GTTTTGGGTT TCTCAACTCT TTATTACAAG TAATCGAGTC TCAGCAGCCC
CAATGTGTGA TAATTGCTTT TGACCGTAAA GAACCTTCCT TTCGTCATCA ACTCGATCCT
AATTATAAAG GCGATCGCAA GGAAACCCCA GAAGAGTTTA TTCCGGACTT AGAAAATCTC
AAATTGTTAC TTTCTGCTTT AAATTTACAA ATTGTCACGG TTGCAGGGTA TGAAGCGGAT
GATATTTTAG GAACTTTAGC CCTCAAAGCG TCTCAGGCTA ATTATAAAGT TAAAATTGTG
ACTGGCGATC GAGATTTATT TCAATTAGTA GATGCTCAAA AAAAGATTAG TGTTCTTTAT
TTAGAAAAGA ATGCCTTTAA AGCCTCTTCT CCCAATGGAT ATACAGAAGT TAACCCGGCA
GAGGTAGAAC AAAAGTTAGG GGTAAAACCT AATCAAGTGG TTGATTATAA AGCCTTATGT
GGAGATAAAT CTGATAGTAT TCCAGGAATA TTAGGAATAG GGGAAAAAAC CGCCGTTACT
CTTCTGAAAG AGTATGGGAC TTTAGAAGGA ATTTACCAAA ATTTAGAAAG CATTAAAGGG
GCACTTAAAA AGAAATTAGA AACGGGAGAA GAAAATGCTA AACACTCTCG AATTTTAGCC
CAATTAGCTT TAGATGTGCC AGTTGAGTTT GATTTCAACA CTTGTCAATT AAAAGGATTT
GAACTAGAGA CAATTCGTCC TTTATTAGAA AAATTAGAAC TGAAGAAATT TATTCAAAAT
ATTAATCGGT TACAAGAAAA ATTTGGAGGA GTTGTATCCT TACCCTCTCA ATCTAACGAA
TCTCAACAAC TTTCTTTATT TCCCGTTTCT GGATCAGATT CTATTGAACA AGTTAACCAA
ACCGAGTCAA TAACTAAGTT AAATTTTATT GAACCCCAAC TAATTAATAC TTCAGAAAAA
CTGACTCAAT TAGTCAAACT ATTAAAACAG TATACTAACC CCGCTCAACC CGTTGCTTGG
GATACAGAAA CCACTTCCCT AGAACCCAAA GATACAACCT TAGTAGGAAT AGGATGTTGT
TGGGGAGAGC AACCGACAGA GGTTGCTTAT ATTCCCTTAA ATCATACGGA AGGAGAACAG
TTACCCCAAG AAGAGGTTTT ATCTGCCTTA AGTGTTATCT TAGAAAGCGA AAATTATCCC
AAGGTTTTTC AGAATACTAA ATTTGATCGA ATTGTTTTAC TCAATAAGGG AATTAAATTA
GCCGGTGTGG TTTTTGATAC CATGTTAGCA AGTTATGTTT TACGTCCTGA ATTGAGTCAT
AAATTGAGTG ATTTATGTGA GCGGTATTTA GAAAATATTA AAGCCTTAAA TTATCGAGAT
TTAGAAATCC CTAAAACTCA AACCATTGCT CATTTAAGTC TAGAAAAAGT CGCTCATTAT
TGCGGAATGG ATGCTTATGC TACTTTTATG TTAGTCCCTA AATTAATTGC CGAACTTAAG
CAAGCTCCCG ACTTATATGA GCTATTATTA AAAGTTGAGC AACCGTTAGA ACCCGTTCTA
GCTGAGATGG AAAATACAGG GGTTTGTATT GATACCGCTT ATCTTAACCA GCTTTCTCAA
CAATTAGAGC AAGATTTACA AATCCTAGAA ACAAAAGCTT ATGAAGCAGC CGGAGAAAGT
TTTAATTTAG GTTCTCCTAA ACAATTAAGT GAGATTTTAT TTGAAAAATT AGGGTTAAAT
AAGAGAAAAT CTCGCAAACT TAAAACCGGT TATTCAACAG ATCATGCTAC CTTAGAAAAA
TTGCAAGGAG ATCACCCTAT CATCGATTAT ATTTTAGAAC ATCGAACCCT TGCTAAATTA
AAATCTACCT ATGTGGATGC TTTACCGGCT TTAGTTCATC CTCAAACTGG ACGAGTACAT
ACTGATTTTA ATCAAGCAGT AACCACCACA GGAAGATTAT CTTCATCGAA TCCTAATTTA
CAAAATATTC CCATTAGAAC CGAATTTTCT CGTCAAATTC GTAAGGCATT TATTACCCAA
GATGATTGGT TATTAGTCTC AGCAGATTAT TCTCAAATTG AATTACGAAT TTTGGCTCAT
TTAAGTCAAG AACCGGTTTT ATTAGAAGCT TATCAAAATT ATCAAGATGT TCACCGAGTA
ACGGCACAAT TGTTATTTGA TAAAGAAGTG ATTACTTCAG AAGAACGGAG TATCGGTAAA
ACGATTAATT TTGGGGTAAT CTATGGAATG GGAGCGCAAA GATTTGCCCG ATCGATGGGG
TTAAGTTTTC AAGAAGGCAA AGATTTTATT GATAAATATC ATCAAAAATA TGCTAGGGTT
TTTGAGTATT TAGAAAGAGT GAAAAAAGAA GCGATCGCCA AAGGATTTGT CACCACGATT
AAAGGAAGAC GACGGTATTT TGAATTTTTT GATGATAAGT TAAATCATTT ACGGGGAGAG
AAACCAGAAA ATCTAGATTT AGACAAACTC AATTTAAATT ATTCTGATGC TCAATTATTG
AGGGCGGCGG CTAATGCTCC TATTCAAGGA TCAAGTGCTG ATATTATTAA AATAGCGATG
GTGCAACTCC ATGAAATTTT ACAGCACTAT CAAGCTAGAT TATTATTACA AGTTCATGAT
GAGTTAGTCT TTGAAATTCC CCCCGATGAA TGGGAAGATT TACAAGTTAA AATAAAAGAT
ACTATGGAAA ATGCGGTTAA GTTAACCGTT CCTTTGGTGG TTGATATTCG TTCAGGTAAA
AATTGGATGG AAGCTAAATG A
 
Protein sequence
MTADTAPLLI LIDGHSLAFR AYHAFAHTKQ GPLRTSTGIP TSVCFGFLNS LLQVIESQQP 
QCVIIAFDRK EPSFRHQLDP NYKGDRKETP EEFIPDLENL KLLLSALNLQ IVTVAGYEAD
DILGTLALKA SQANYKVKIV TGDRDLFQLV DAQKKISVLY LEKNAFKASS PNGYTEVNPA
EVEQKLGVKP NQVVDYKALC GDKSDSIPGI LGIGEKTAVT LLKEYGTLEG IYQNLESIKG
ALKKKLETGE ENAKHSRILA QLALDVPVEF DFNTCQLKGF ELETIRPLLE KLELKKFIQN
INRLQEKFGG VVSLPSQSNE SQQLSLFPVS GSDSIEQVNQ TESITKLNFI EPQLINTSEK
LTQLVKLLKQ YTNPAQPVAW DTETTSLEPK DTTLVGIGCC WGEQPTEVAY IPLNHTEGEQ
LPQEEVLSAL SVILESENYP KVFQNTKFDR IVLLNKGIKL AGVVFDTMLA SYVLRPELSH
KLSDLCERYL ENIKALNYRD LEIPKTQTIA HLSLEKVAHY CGMDAYATFM LVPKLIAELK
QAPDLYELLL KVEQPLEPVL AEMENTGVCI DTAYLNQLSQ QLEQDLQILE TKAYEAAGES
FNLGSPKQLS EILFEKLGLN KRKSRKLKTG YSTDHATLEK LQGDHPIIDY ILEHRTLAKL
KSTYVDALPA LVHPQTGRVH TDFNQAVTTT GRLSSSNPNL QNIPIRTEFS RQIRKAFITQ
DDWLLVSADY SQIELRILAH LSQEPVLLEA YQNYQDVHRV TAQLLFDKEV ITSEERSIGK
TINFGVIYGM GAQRFARSMG LSFQEGKDFI DKYHQKYARV FEYLERVKKE AIAKGFVTTI
KGRRRYFEFF DDKLNHLRGE KPENLDLDKL NLNYSDAQLL RAAANAPIQG SSADIIKIAM
VQLHEILQHY QARLLLQVHD ELVFEIPPDE WEDLQVKIKD TMENAVKLTV PLVVDIRSGK
NWMEAK