Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_1966 |
Symbol | |
ID | 3735658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | + |
Start bp | 1785070 |
End bp | 1788030 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637776551 |
Product | DNA polymerase I |
Protein accession | YP_382262 |
Protein GI | 78213483 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.378529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGG CCACCGCCAA GCCACTTCTG CTGCTTGTGG ATGGCCATTC CCTGGCATTC CGCAGCTTTT ACGCCTTCAG CAAGGGTGGA GAAGGGGGTC TGGCCACCAA AGACGGCCGG CCCACCAGCG TGACCTACGG CTTTCTCAAA GCCCTGCTGG ACAACAGCAA AACCCTGCAG CCAGAAGGCG TTGCCATTGC CTTCGACACA GCCGAGCCCA CCTTTCGCCA CAAGGCAGAC GCCAACTACA AGGCCCACCG GGATGTGGCC CCTGAGGTGT TCTTTCAGGA CCTTGAGCAG CTGCAGCAGA TCCTCGAAAC GCACCTGCAG CTCCCTCTCT GCATGGCCCC CGGCTTCGAA GCCGACGATG TGCTGGGAAC CCTGGCGAAC CGAGCGGCCG ACTCGGGATG GGGCGTACGA ATCCTTTCGG GAGACCGCGA CCTTTTCCAG CTGGTGGATG ACAGCCGCGA CATCGCGGTG CTTTACATGG GTGGAGGCCC CTACGCCAAA AGCAGTGGCC CAACACTGAT CCGCGAAGAA GGGGTGCTCG GCAAGCTCGG CGTGATACCC GACAAGGTGG TGGACCTCAA GGCCCTCACC GGTGACAGCT CCGACAACAT CCCCGGCGTG CGTGGTGTTG GCCCCAAAAC AGCAATCAAC CTGCTGAAAG ACAACAGCGA TCTCGACGCG GTGTACGCCA CCCTTGAGGA AGTGGAAGCC GAAGGGCCGA AAGCCAGCCG GGGCGCCATC AAAGGAGCTC TGAAAGAGAA GCTGCGCGCC GACCGCGACA ACGCTTACCT CTCGCGCAAG CTGGCCGAGA TTCTCGTGGA TGTTCCCCTG CCCAAGGAAC CCAGCCTGCC GTTGTCGTCG GTGGATGCTG ATGGCCTGAG CAGCTGCCTG GAAGACCTCG AGCTCAACAG CCTGCTGCGC CAGGTGGGGG GCTTTGTGGC GGCCTTTTCC GAAGGGGGCT ACGGGGCCAA TGCAGAGGCC GCTGCAGCCA AGCCCCCCCA CCGCTCAGCC ACCACAGAAC CGGCAGCTGC CGAAACCGCA ACCGAACCGG TCACCAACGA CGACGTGGGG GTGCCGGCCC TGAAACCGAA GCTGATCCAG ACCGAAACGG CCCTGGACGC CCTGGTGCAG AGGCTGATGG CCTGCACCGA CAGCAGCCTG CCTGTGGCCT TCGACACCGA GACCACCGAC CTCAACCCGT TCCGCGCCGA ACTGGTGGGC ATCGGCATCT GCTGGGGAGA GGACCTGGAC GCACTTGCCT ACATCCCGCT GGGCCACAAA GGCAGCGAAG ACAGCATCCC GGAGCAGCTG TCCTTGGAAA CCGTGCTCAC CGCCCTCGCC CCTTGGCTGG CCAGCAGCAA CCACCCCAAG ACCCTTCAGA ACGCCAAGTA CGACCGCCTG ATCCTGCTGC GGCATGGCAT TGCCCTCGAG GGGGTCGTGA TCGACACACT GCTGGCTGAT TACCTGCGGG ATGCCTCGGC CAAGCATGGT CTGGAGCTCA TGGCGGAGCG GGAATTCGGC TTCCAGCCCA CCTCATTCAC CGATCTGGTG GGCAAGAAAC AAACCTTCGC CGACGTGCCG CTGGAGCCCG CCAGCCTGTA TTGCGGCATG GATGTGCACG TCACCCGGCG CCTCGCCCTG CTGCTGCGCC ATCAGCTGGA GACCATGGGC CCGCAGCTGC TGCCGCTGCT GGAGCAGGTG GAGCAGCCAC TGGAACCAGT GCTGGCCCGG ATGGAATCCA CCGGCATCCG CATTGATGTG CCCTATCTCC AGGGCCTTTC CGAAGAAATG GGCTCCACCC TTCAACAGCT GGAGTCCGAC GCCAAAGCGG CCGCAGGAGT GGACTTCAAC CTGGCCTCAC CCAAGCAACT CGGAGAGCTC CTGTTCGACA CCCTCGGCCT GGATCGCAAG AAATCCCGGC GCACCAAAAC TGGCTTCAGC ACCGACGCCA CCGTGCTGGA GAAACTCGGC AACGACCACC CGGTGGTGCC TCTGGTGCTG GACCACCGGG TGCTCAGCAA GCTCAAGAGC ACGTACATCG ACGCCTTGCC GCAGCTGGTG GAGGCGGAAA CCGGGCGAGT CCACACCGAT TTCAACCAGG CGGTTACGGC AACGGGTCGG CTGAGCAGCA GCAACCCCAA CCTGCAGAAC ATTCCGGTGC GCACCGAATA CAGCCGTCGG ATCCGCAAGG CCTTCCTGCC TCAGGAAGGC TGGACCCTGC TCAGCGCTGA TTACTCACAG ATCGAGCTGC GCATCCTCAC CCACCTCTCG GGCGAAAAGG TGCTGCAGGA GGCCTACCGC GGCGGCGATG ACGTGCACGC GCTGACCGCC AGGCTGCTGC TGGACAAGGA CGAGGTCAGT CCGGACGAGC GTCGGCTGGG CAAAACGATC AACTTCGGTG TGATCTATGG CATGGGAGCC CAGCGCTTCG CCCGGGAAAC CGGCGTAAGC CAAAGCGAAG CCAAGGAGTT CCTGGTCAAG TACAAGCAGC GCTACCCGAA GGTGTTCGCC TTTCTCGAGC TGCAGGAGCG GCTCGCCCTC AGCCGCGGTT ACGTGGAAAC GATTCTCGGC CGCCGGCGGC CGTTCCACTT CGATCGCAAC GGCCTGGGGC GGCTGCTGGG CAAGGACCCC TTGGAGATCG ACCTGGATGT GGCCCGCCGC GGTGGCATGG AGGCGCAGCA ACTGCGGGCA GCTGCCAATG CCCCGATTCA GGGGTCGAGT GCCGACATCA TCAAAGTGGC GATGGTGCAG CTGCAAGACG CCCTGCTGCG TCAGGGCCTA CCGGCCCAGC TGCTGCTGCA GGTGCACGAC GAACTAGTGC TCGAAGTGGC GCCGGACGCC CTCGAGACCA CCCGAGACCT GGTGGTGCGA ACCATGGAAC AGGCCTTTGA ACTGAGTGTT CCCCTAGTGG TGGAGACCGG CGTCGGCGCG AACTGGATGG AGGCGAAATA A
|
Protein sequence | MPEATAKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGR PTSVTYGFLK ALLDNSKTLQ PEGVAIAFDT AEPTFRHKAD ANYKAHRDVA PEVFFQDLEQ LQQILETHLQ LPLCMAPGFE ADDVLGTLAN RAADSGWGVR ILSGDRDLFQ LVDDSRDIAV LYMGGGPYAK SSGPTLIREE GVLGKLGVIP DKVVDLKALT GDSSDNIPGV RGVGPKTAIN LLKDNSDLDA VYATLEEVEA EGPKASRGAI KGALKEKLRA DRDNAYLSRK LAEILVDVPL PKEPSLPLSS VDADGLSSCL EDLELNSLLR QVGGFVAAFS EGGYGANAEA AAAKPPHRSA TTEPAAAETA TEPVTNDDVG VPALKPKLIQ TETALDALVQ RLMACTDSSL PVAFDTETTD LNPFRAELVG IGICWGEDLD ALAYIPLGHK GSEDSIPEQL SLETVLTALA PWLASSNHPK TLQNAKYDRL ILLRHGIALE GVVIDTLLAD YLRDASAKHG LELMAEREFG FQPTSFTDLV GKKQTFADVP LEPASLYCGM DVHVTRRLAL LLRHQLETMG PQLLPLLEQV EQPLEPVLAR MESTGIRIDV PYLQGLSEEM GSTLQQLESD AKAAAGVDFN LASPKQLGEL LFDTLGLDRK KSRRTKTGFS TDATVLEKLG NDHPVVPLVL DHRVLSKLKS TYIDALPQLV EAETGRVHTD FNQAVTATGR LSSSNPNLQN IPVRTEYSRR IRKAFLPQEG WTLLSADYSQ IELRILTHLS GEKVLQEAYR GGDDVHALTA RLLLDKDEVS PDERRLGKTI NFGVIYGMGA QRFARETGVS QSEAKEFLVK YKQRYPKVFA FLELQERLAL SRGYVETILG RRRPFHFDRN GLGRLLGKDP LEIDLDVARR GGMEAQQLRA AANAPIQGSS ADIIKVAMVQ LQDALLRQGL PAQLLLQVHD ELVLEVAPDA LETTRDLVVR TMEQAFELSV PLVVETGVGA NWMEAK
|
| |