Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_4231 |
Symbol | |
ID | 7108152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 4692746 |
End bp | 4695646 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643482455 |
Product | DNA polymerase I |
Protein accession | YP_002379469 |
Protein GI | 218441140 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAG ACACAGCCCC TTTATTGATT TTAATCGATG GTCACTCTTT GGCTTTTCGG GCTTATCATG CCTTTGCTCA CACCAAACAA GGCCCTTTAC GTACCTCTAC AGGAATTCCC ACTAGCGTCT GTTTTGGGTT TCTCAACTCT TTATTACAAG TAATCGAGTC TCAGCAGCCC CAATGTGTGA TAATTGCTTT TGACCGTAAA GAACCTTCCT TTCGTCATCA ACTCGATCCT AATTATAAAG GCGATCGCAA GGAAACCCCA GAAGAGTTTA TTCCGGACTT AGAAAATCTC AAATTGTTAC TTTCTGCTTT AAATTTACAA ATTGTCACGG TTGCAGGGTA TGAAGCGGAT GATATTTTAG GAACTTTAGC CCTCAAAGCG TCTCAGGCTA ATTATAAAGT TAAAATTGTG ACTGGCGATC GAGATTTATT TCAATTAGTA GATGCTCAAA AAAAGATTAG TGTTCTTTAT TTAGAAAAGA ATGCCTTTAA AGCCTCTTCT CCCAATGGAT ATACAGAAGT TAACCCGGCA GAGGTAGAAC AAAAGTTAGG GGTAAAACCT AATCAAGTGG TTGATTATAA AGCCTTATGT GGAGATAAAT CTGATAGTAT TCCAGGAATA TTAGGAATAG GGGAAAAAAC CGCCGTTACT CTTCTGAAAG AGTATGGGAC TTTAGAAGGA ATTTACCAAA ATTTAGAAAG CATTAAAGGG GCACTTAAAA AGAAATTAGA AACGGGAGAA GAAAATGCTA AACACTCTCG AATTTTAGCC CAATTAGCTT TAGATGTGCC AGTTGAGTTT GATTTCAACA CTTGTCAATT AAAAGGATTT GAACTAGAGA CAATTCGTCC TTTATTAGAA AAATTAGAAC TGAAGAAATT TATTCAAAAT ATTAATCGGT TACAAGAAAA ATTTGGAGGA GTTGTATCCT TACCCTCTCA ATCTAACGAA TCTCAACAAC TTTCTTTATT TCCCGTTTCT GGATCAGATT CTATTGAACA AGTTAACCAA ACCGAGTCAA TAACTAAGTT AAATTTTATT GAACCCCAAC TAATTAATAC TTCAGAAAAA CTGACTCAAT TAGTCAAACT ATTAAAACAG TATACTAACC CCGCTCAACC CGTTGCTTGG GATACAGAAA CCACTTCCCT AGAACCCAAA GATACAACCT TAGTAGGAAT AGGATGTTGT TGGGGAGAGC AACCGACAGA GGTTGCTTAT ATTCCCTTAA ATCATACGGA AGGAGAACAG TTACCCCAAG AAGAGGTTTT ATCTGCCTTA AGTGTTATCT TAGAAAGCGA AAATTATCCC AAGGTTTTTC AGAATACTAA ATTTGATCGA ATTGTTTTAC TCAATAAGGG AATTAAATTA GCCGGTGTGG TTTTTGATAC CATGTTAGCA AGTTATGTTT TACGTCCTGA ATTGAGTCAT AAATTGAGTG ATTTATGTGA GCGGTATTTA GAAAATATTA AAGCCTTAAA TTATCGAGAT TTAGAAATCC CTAAAACTCA AACCATTGCT CATTTAAGTC TAGAAAAAGT CGCTCATTAT TGCGGAATGG ATGCTTATGC TACTTTTATG TTAGTCCCTA AATTAATTGC CGAACTTAAG CAAGCTCCCG ACTTATATGA GCTATTATTA AAAGTTGAGC AACCGTTAGA ACCCGTTCTA GCTGAGATGG AAAATACAGG GGTTTGTATT GATACCGCTT ATCTTAACCA GCTTTCTCAA CAATTAGAGC AAGATTTACA AATCCTAGAA ACAAAAGCTT ATGAAGCAGC CGGAGAAAGT TTTAATTTAG GTTCTCCTAA ACAATTAAGT GAGATTTTAT TTGAAAAATT AGGGTTAAAT AAGAGAAAAT CTCGCAAACT TAAAACCGGT TATTCAACAG ATCATGCTAC CTTAGAAAAA TTGCAAGGAG ATCACCCTAT CATCGATTAT ATTTTAGAAC ATCGAACCCT TGCTAAATTA AAATCTACCT ATGTGGATGC TTTACCGGCT TTAGTTCATC CTCAAACTGG ACGAGTACAT ACTGATTTTA ATCAAGCAGT AACCACCACA GGAAGATTAT CTTCATCGAA TCCTAATTTA CAAAATATTC CCATTAGAAC CGAATTTTCT CGTCAAATTC GTAAGGCATT TATTACCCAA GATGATTGGT TATTAGTCTC AGCAGATTAT TCTCAAATTG AATTACGAAT TTTGGCTCAT TTAAGTCAAG AACCGGTTTT ATTAGAAGCT TATCAAAATT ATCAAGATGT TCACCGAGTA ACGGCACAAT TGTTATTTGA TAAAGAAGTG ATTACTTCAG AAGAACGGAG TATCGGTAAA ACGATTAATT TTGGGGTAAT CTATGGAATG GGAGCGCAAA GATTTGCCCG ATCGATGGGG TTAAGTTTTC AAGAAGGCAA AGATTTTATT GATAAATATC ATCAAAAATA TGCTAGGGTT TTTGAGTATT TAGAAAGAGT GAAAAAAGAA GCGATCGCCA AAGGATTTGT CACCACGATT AAAGGAAGAC GACGGTATTT TGAATTTTTT GATGATAAGT TAAATCATTT ACGGGGAGAG AAACCAGAAA ATCTAGATTT AGACAAACTC AATTTAAATT ATTCTGATGC TCAATTATTG AGGGCGGCGG CTAATGCTCC TATTCAAGGA TCAAGTGCTG ATATTATTAA AATAGCGATG GTGCAACTCC ATGAAATTTT ACAGCACTAT CAAGCTAGAT TATTATTACA AGTTCATGAT GAGTTAGTCT TTGAAATTCC CCCCGATGAA TGGGAAGATT TACAAGTTAA AATAAAAGAT ACTATGGAAA ATGCGGTTAA GTTAACCGTT CCTTTGGTGG TTGATATTCG TTCAGGTAAA AATTGGATGG AAGCTAAATG A
|
Protein sequence | MTADTAPLLI LIDGHSLAFR AYHAFAHTKQ GPLRTSTGIP TSVCFGFLNS LLQVIESQQP QCVIIAFDRK EPSFRHQLDP NYKGDRKETP EEFIPDLENL KLLLSALNLQ IVTVAGYEAD DILGTLALKA SQANYKVKIV TGDRDLFQLV DAQKKISVLY LEKNAFKASS PNGYTEVNPA EVEQKLGVKP NQVVDYKALC GDKSDSIPGI LGIGEKTAVT LLKEYGTLEG IYQNLESIKG ALKKKLETGE ENAKHSRILA QLALDVPVEF DFNTCQLKGF ELETIRPLLE KLELKKFIQN INRLQEKFGG VVSLPSQSNE SQQLSLFPVS GSDSIEQVNQ TESITKLNFI EPQLINTSEK LTQLVKLLKQ YTNPAQPVAW DTETTSLEPK DTTLVGIGCC WGEQPTEVAY IPLNHTEGEQ LPQEEVLSAL SVILESENYP KVFQNTKFDR IVLLNKGIKL AGVVFDTMLA SYVLRPELSH KLSDLCERYL ENIKALNYRD LEIPKTQTIA HLSLEKVAHY CGMDAYATFM LVPKLIAELK QAPDLYELLL KVEQPLEPVL AEMENTGVCI DTAYLNQLSQ QLEQDLQILE TKAYEAAGES FNLGSPKQLS EILFEKLGLN KRKSRKLKTG YSTDHATLEK LQGDHPIIDY ILEHRTLAKL KSTYVDALPA LVHPQTGRVH TDFNQAVTTT GRLSSSNPNL QNIPIRTEFS RQIRKAFITQ DDWLLVSADY SQIELRILAH LSQEPVLLEA YQNYQDVHRV TAQLLFDKEV ITSEERSIGK TINFGVIYGM GAQRFARSMG LSFQEGKDFI DKYHQKYARV FEYLERVKKE AIAKGFVTTI KGRRRYFEFF DDKLNHLRGE KPENLDLDKL NLNYSDAQLL RAAANAPIQG SSADIIKIAM VQLHEILQHY QARLLLQVHD ELVFEIPPDE WEDLQVKIKD TMENAVKLTV PLVVDIRSGK NWMEAK
|
| |