Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_0693 |
Symbol | |
ID | 3743976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 701399 |
End bp | 704395 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637770865 |
Product | DNA polymerase I |
Protein accession | YP_376705 |
Protein GI | 78184270 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.433628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAGG CCGTCACCAA GCCCCTACTC CTGCTTGTGG ATGGCCACTC ACTGGCTTTT CGCAGTTTTT ACGCCTTCAG CAAAGGCGGC GAAGGGGGCC TTGCCACAAA AGATGGACGG CCAACGAGTG TCACTTATGG CTTTTTGAAG TCGCTCCTGG ACACCGGCAA AACGTTCAAG CCCCAGGGCG TCGCGATTGC CTTCGACACC GCGGAACCCA CATTCCGCCA CAAAGCGGAC ACGAATTACA AAGCCCACCG GGACGTGGCA CCAGAGGTTT TCTTCCAAGA CCTCGAACAG TTGCAGCAGA TCCTTAGCGA TCAAATGAAG TTGCCCCTGT GCATGGCACC GGGATACGAA GCCGATGACG TGCTCGGCAC CCTTGCCAAC CGTGCCGCCG ACGCCGGCTG GGGGGTTCGG ATTCTCTCGG GAGACCGTGA TCTTTTTCAG CTCGTCGACA ACAGCCGCGA CATCGCGGTG CTGTACATGG GTGGCGGGCC CTATGCCAAG GGAAGCGGGC CAACGCTGAT TCAAGAGGAC GGCGTTGTGA GCAAGTTGGG AGTCATGCCC GACAAGGTGG TGGACCTCAA GGCCCTCACA GGAGACAGCT CCGACAACAT CCCAGGGGTG CGGGGCGTTG GTCCCAAAAC CGCCATCAAC CTGCTGAAAG ACAACATCGA CCTGGACGCG GTGTACGCGA CCTTGGCGGA GGTGGAAGCT GAAGGTCCGA AGGCCAGTCG CGGGGCCATC AAAGGGGCCT TGGTCGGCAA GCTGCGCTCC GATCGCGACA ACGCCTATCT GTCGAGAATG CTGGCGGAAA TCCTCGTGGA CATTCCCCTC CCGAAGGATC CAGAACTCGG CCTACAGCAG GTTGATGCTG AAGGGTTGAG CGATCGCCTC GAAGACCTCG AGCTCAATAG CCTCGTCCGT CAAGTCGGTG GGTTTGTGGC GACCTTTTCC ACCGGTGGGT ATGGCGCCAA TGCCAGTAGC GAAGCTGAGA AGACCCCTAA ACGCTCCCCA GCGAAAACGA ACCCTGATTC CACCAGTTCA ATCGGGAGTG CAGCCGCTCC GCAGCAAGGT TCAGGGGAAG CGGAGGTTGG AGCAATCCCT CCCCTCCAAC CGCAGCTGAT CGAGAGCAAC GAAGCCCTGC AAGCCCTTGT GCAACGGCTG ATGGACTGCA CCGATCCCAA AGCGCCAATT GCCGTCGACA CCGAAACCAC AGATCTCAAC CCATTCAAGG CCGAACTCGT GGGGGTAGGG GTGTGCTGGG GCGAGGCCAA CGATGCCCTC GCCTACATCC CCATCGGCCA CAAGCCCCCC AGCGAACTCT CCGAGGCAAC ACCACCCCAG CAGCTTCCGC TGGAAACGGT GCTAATGGCG CTCTCGCCCT GGCTTGCGAG TCCGCAGCAC CCCAAAGCCC TACAGAACGC CAAATACGAC CGACTCATCC TGCTACGCCA TGGGCTCACG CTCAACGGCG TCGTGATCGA CACCCTGCTG GCGGATTACC TGCGGGATGC CGCGGCCAAG CACGGTTTGG ACCTGATGAG CGAACGGGAG TTTGGCTTCC GCCCCACCAC CTACGGCGAT CTTGTTGGCA AGAAACAAAC CTTTGCCGAT GTGGCAATCG AACCTGCCAG CCTCTATTGC GGCATGGACG TGCACGTCAC CAGGCGATTG GCCCTGCAGC TCCGCCAAAC CTTGCAAGCG ATGGGCCCCC AATTACTGCC TCTCCTGGAG GGCGTTGAAC AACCGCTGGA ACCCGTTCTG GCCCAAATGG AGGCCACCGG CATCCGCATC GATGTGCCTT ACCTCAAAAC ACTGTCGGAC GAATTAGGCA GCACCCTGAA CCGGTTAGAA ACCGACGCCA AACAGGTCGC TGAGGTGGAC TTCAACCTCG CCTCTCCAAA GCAACTTGGG GAATTGTTGT TCGACACGCT CGGACTGGAT CGCAAGAAGT CGAGGCGCAC GAAAACCGGA TTCAGCACTG ATGCCACCGT TCTCGAAAAA CTCGAAAACG ACCATCCCGT CGTTCCTCTC GTGCTGGAGC ACCGCGTCTT GAGCAAGCTC AAGAGCACCT ACGTTGATGC TCTGCCCCAA CTCGTCGAAG CGGAAACCGG CCGCGTCCAC ACCGACTTCA ACCAAGCCGT AACAGCGACG GGCCGCTTGA GCAGCAGCAA CCCAAATCTG CAAAACATTC CCGTTCGCAC GGAATACAGC CGTCGCATCC GCAAAGCGTT CCTCCCCCAA GAGGGCTGGA CACTGCTCAG CGCCGACTAC TCCCAAATCG AACTCCGAAT CCTCACCCAC CTCTCCGGGG AAGAGGTGCT GCAGGAGGCC TACAGCACGG GCGACGACGT GCACGCACTC ACCGCACGCT TACTGCTGGA TAAAGACGAC GTGAGCGCCG ATGAACGTCG CCTCGGAAAA ACGATCAACT TCGGGGTGAT TTACGGCATG GGCGCCCAAC GCTTTGCGCG GGAAACAGGG GTGAGCTCGG CCGAAGCGAA GGAGTTCCTC ACCAAATACA AACAGCGCTA CCCCAAAGTG TTTGCCTTCC TTGAGCTTCA GGAGCGGCTC GCCCTAAGCC GCGGCTACGT GGAAACAATC TTGGGTCGTC GTCGCCCATT TCATTTCGAT CGCAACGGCC TCGGCCGCTT ATTGGGAAAA GATCCCCTCG AAATTGATCT CGATGTGGCA CGGCGAGGTG GGATGGAAGC ACAACAACTG CGCGCCGCCG CCAACGCCCC CATTCAGGGC TCCAGCGCCG ACATCATCAA GGTGGCGATG GTGCAATTAC AAGCGGTGCT TCTTAGCCAA GGGATCCCCG CCCGCCTACT CCTGCAGGTG CATGACGAAC TGGTCCTCGA AGTGGCGCCA GACGCATTGG ACACCACGCG AAACCTTGTG GTGAACACCA TGGAAAACGC CGTCAAGCTC AGCGTGCCTC TCGTGGTGGA AACCGGCGTT GGTCGCGACT GGATGGAAGC GAAATAA
|
Protein sequence | MPEAVTKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGR PTSVTYGFLK SLLDTGKTFK PQGVAIAFDT AEPTFRHKAD TNYKAHRDVA PEVFFQDLEQ LQQILSDQMK LPLCMAPGYE ADDVLGTLAN RAADAGWGVR ILSGDRDLFQ LVDNSRDIAV LYMGGGPYAK GSGPTLIQED GVVSKLGVMP DKVVDLKALT GDSSDNIPGV RGVGPKTAIN LLKDNIDLDA VYATLAEVEA EGPKASRGAI KGALVGKLRS DRDNAYLSRM LAEILVDIPL PKDPELGLQQ VDAEGLSDRL EDLELNSLVR QVGGFVATFS TGGYGANASS EAEKTPKRSP AKTNPDSTSS IGSAAAPQQG SGEAEVGAIP PLQPQLIESN EALQALVQRL MDCTDPKAPI AVDTETTDLN PFKAELVGVG VCWGEANDAL AYIPIGHKPP SELSEATPPQ QLPLETVLMA LSPWLASPQH PKALQNAKYD RLILLRHGLT LNGVVIDTLL ADYLRDAAAK HGLDLMSERE FGFRPTTYGD LVGKKQTFAD VAIEPASLYC GMDVHVTRRL ALQLRQTLQA MGPQLLPLLE GVEQPLEPVL AQMEATGIRI DVPYLKTLSD ELGSTLNRLE TDAKQVAEVD FNLASPKQLG ELLFDTLGLD RKKSRRTKTG FSTDATVLEK LENDHPVVPL VLEHRVLSKL KSTYVDALPQ LVEAETGRVH TDFNQAVTAT GRLSSSNPNL QNIPVRTEYS RRIRKAFLPQ EGWTLLSADY SQIELRILTH LSGEEVLQEA YSTGDDVHAL TARLLLDKDD VSADERRLGK TINFGVIYGM GAQRFARETG VSSAEAKEFL TKYKQRYPKV FAFLELQERL ALSRGYVETI LGRRRPFHFD RNGLGRLLGK DPLEIDLDVA RRGGMEAQQL RAAANAPIQG SSADIIKVAM VQLQAVLLSQ GIPARLLLQV HDELVLEVAP DALDTTRNLV VNTMENAVKL SVPLVVETGV GRDWMEAK
|
| |