Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0194 |
Symbol | |
ID | 3775802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 192655 |
End bp | 195516 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637798600 |
Product | DNA polymerase I |
Protein accession | YP_399213 |
Protein GI | 81299005 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAG ACTCTCCCCT CTTGCTTCTC GTGGATGGCC ACTCCTTGGC CTTCCGCAGT TACTACGCTT TTGCCAAAGG TCGCGATGGT GGGTTACGCA CCCGCACCGG TATTCCCACC AGTGTCTGTT TTGGCTTCCT CAAAGCGCTG CTGGAGGTGA TGGAGCAACA GCAACCCAAA GCGTTGGCGA TCGCCTTCGA TTTGGGTGGG CCAACCTTCC GCCACGAAGC AGACGAGAAC TACAAGGCCA ACCGCGACGA AGCCCCCGAA GACTTCAAGA TCGATACGGA TAACCTCGTC GCGCTGCTGC AAACCCTCAA TCTGCCAATT CTGGTGGAGC CAGGCTATGA GGCAGATGAC CTACTCGGCA CAGTCGCCCA ACGCGGAGCC GAAGCGGGCT ATCGAGTACG GATTCTCAGC GGCGATCGCG ATCTCTTCCA GCTCGTTGAC CCCGAGGGTG CGATTCGCGT GCTCTACCTC GGCAATACCT TTGGCCGCAG CGCTAATCGA GAAGCCGCCC GTGAGATTGA TCCGGCCGCG GTGATCGATA AGCTGGGTGT ACCGCCTGAG CAAGTAATTG ATTTCAAAGC GCTTTGCGGT GATAGCTCTG ATAACATCCC TGGCGTCAAA GGCATTGGTC CCAAAACAGC CGTGGATCTG CTGCAGGCTT GGGGTGATCT CGATCGCATT TACGACAATC TGGAAGCGAT CAAACCCGCC GTTCGCAAGA AATTAGAGAG CGATCGCGAG GCAGCCTATC ACTCCCGCAA ACTGGCGCAA ATCGTCACGG ATATTCCCCT AGCGATCGAC TGGGATCACT ATGCTCTGAC CGGCTTTGAT GAACAGGAAG TCTTGCCTTG GCTGGAAAAA CTGGAACTGC AAGCCTTCCG GCGACAGGTC GATCGCCTGC AACAGCTCTT CGGGGGCCAG CCACCAACTG TCGAAGCACT CAGTGATGAA TCCCTCGACT TTTGGACCGC CGAAGAAACT GCTGCGCAGC AACCGCGCTG GCCACAACTG CAGCCTCAGA TCATCACCAC GGCCGCAGCG CTGACTGACC TCGTGACTTT GCTGGAACAG CGCGATAGTC CTGAAGCGAT CGTGGCTTGG GACACGGAAA CAACCGACCT CGATCCGCGC TTGGCGCAAC TGGTGGGTAT TGGCTGCGCT TGGGGAGAAG AGCCAGATCA GCTTGCCTAC ATTCCCCTCG GGCATGAGGA AGGCGAGCAA TTGCCGCTCC AGACAGTCCT CACCGCGCTA CGACCGATTT TGGAAAGCGA TCGCCATCCC AAGGCGCTGC AAAATGCCAA GTTCGATCGC CTGATTCTGC GTCACCAAGG CATTGAGCTG GCGGGTGTGG TCTTTGACAC AATGCTGGCT AGCTACCTAC TCAACCCCAG TCTTGGCCAC AGCTTAGATG CTTTAGCCGA TCGCTGGTTA AAGCTGCAAA CGCGCAGCTA CAGCGACCTC GTCCCCAAGG GCAAAACCAT CGCCCAAGTC GCGATCGCGG CAGTTGCACA ATACTGCGGC AGCGATGTCC ATGTGGTGCA GCGGCTGATT CCGTTGCTGA AAGCTGGGAT CGCCGAATCT CCGGCGCTTC AATTCCTGCT GGAAACCGTC GAGCTGCCGC TCGAAGCCGT GCTTGCCGAG ATGGAAGATC GCGGCATTCG CATTGATGAA GGATATCTAG CAGAACTGTC GGAGCATCTC AAGGGCGAGC TCGATCGCTT GGAAGGAGCA GCGCATACCC TAGCGGGCGA TCGCTTTAAT CTCGGGTCGC CCAAGCAGCT GAGCGAACTG CTGTTCGAGA AGCTAGGGCT GAATGTCAAA AAGTCCCGCA AGACCAAAAC GGGCTACTCC ACCGATGCAG CCGTGCTCGA AAAACTCCAG GGTGATCACC CGATCATCGA CCTGATTCTG GAGCACCGCA CCCTCGCCAA ACTGAAGTCG ACCTATGTGG ATGCGCTGCC GAGTCTGGTT GCTGCCGATG GACGCATTCA TACTGATTTC AACCAAGCGG TGACGGCGAC GGGACGGCTG TCCTCTTCTA ATCCCAACCT GCAGAACATT CCGATTCGCA CCGAATTCAG CCGTCAAATT CGCAAGGCTT TTCTGCCCCG TGAAGGCTGG CTACTAGCCG CAGCAGATTA CTCGCAAATT GAGCTGCGCA TCCTCGCTCA CCTCAGCCAA GAGCCAGTGC TGCTGGAAGC CTACCGGCAG GGCGATGATG TACACCGACT TACGGCCAGT CTGCTCTTCG ATCGCGAGGA GATCACGTCC GAGGAACGAC GCATTGGCAA AATCATCAAC TTTGGTGTGA TTTACGGCAT GGGTGCCCAG CGCTTTGCCC GCGAAACTGG CAGCAGCACC AAGGAAGCCC AAGGCTTTAT CGATCGCTTC TACGATCGCT ATCCTCGGGT GTTTACCTAC CTGCAAAGCC TGGAACGCCA AGCGATCGCC CGCGGTTATG TGGAAACAGT CTTGGGGCGG CGGCGTTACT TTGACTTTGA GGACACTGGC CTCCAGAAGC TACGCGGGAG CGATCCCGAG AGCATCGATC TCGACAAGAT TCGTCCCAGC CGCTTCGAGG CGCAATTGCT GCGAGCCGCC GCCAATGCGC CCATTCAGGG GTCAAGTGCC GACATCATCA AAGTGGCGAT GGTGCAGTTG CAGGCGCTGT TGCAGTCCTA TCAAGCGCGG ATGCTGTTGC AAGTCCATGA CGAACTCGTC CTAGAACTGC CGCCGGAGGA ATGGGACAGC CTCGCACCCC AAATCCAGCA GACGATGGAG CAGGCAGTTC AGCTGACCGT GCCGCTGGCT GTGGAACTGC ATGCAGGCCA CAACTGGATG GAGGCAAAGT AA
|
Protein sequence | MSVDSPLLLL VDGHSLAFRS YYAFAKGRDG GLRTRTGIPT SVCFGFLKAL LEVMEQQQPK ALAIAFDLGG PTFRHEADEN YKANRDEAPE DFKIDTDNLV ALLQTLNLPI LVEPGYEADD LLGTVAQRGA EAGYRVRILS GDRDLFQLVD PEGAIRVLYL GNTFGRSANR EAAREIDPAA VIDKLGVPPE QVIDFKALCG DSSDNIPGVK GIGPKTAVDL LQAWGDLDRI YDNLEAIKPA VRKKLESDRE AAYHSRKLAQ IVTDIPLAID WDHYALTGFD EQEVLPWLEK LELQAFRRQV DRLQQLFGGQ PPTVEALSDE SLDFWTAEET AAQQPRWPQL QPQIITTAAA LTDLVTLLEQ RDSPEAIVAW DTETTDLDPR LAQLVGIGCA WGEEPDQLAY IPLGHEEGEQ LPLQTVLTAL RPILESDRHP KALQNAKFDR LILRHQGIEL AGVVFDTMLA SYLLNPSLGH SLDALADRWL KLQTRSYSDL VPKGKTIAQV AIAAVAQYCG SDVHVVQRLI PLLKAGIAES PALQFLLETV ELPLEAVLAE MEDRGIRIDE GYLAELSEHL KGELDRLEGA AHTLAGDRFN LGSPKQLSEL LFEKLGLNVK KSRKTKTGYS TDAAVLEKLQ GDHPIIDLIL EHRTLAKLKS TYVDALPSLV AADGRIHTDF NQAVTATGRL SSSNPNLQNI PIRTEFSRQI RKAFLPREGW LLAAADYSQI ELRILAHLSQ EPVLLEAYRQ GDDVHRLTAS LLFDREEITS EERRIGKIIN FGVIYGMGAQ RFARETGSST KEAQGFIDRF YDRYPRVFTY LQSLERQAIA RGYVETVLGR RRYFDFEDTG LQKLRGSDPE SIDLDKIRPS RFEAQLLRAA ANAPIQGSSA DIIKVAMVQL QALLQSYQAR MLLQVHDELV LELPPEEWDS LAPQIQQTME QAVQLTVPLA VELHAGHNWM EAK
|
| |