Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0200 |
Symbol | |
ID | 5082139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 188763 |
End bp | 191576 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481755 |
Product | DNA polymerase I |
Protein accession | YP_001166415 |
Protein GI | 146276256 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.932691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GCTCGGCCTT CATCTTCCGC GCCTATCACG CGCTGCCTCC GCTGACGCGG AAATCCGACG GCCTGCCGGT GGGCGCCGTC GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC CGACGCGCCG ACCCATGTCG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCC CTTTACAAGG CCCAACGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT TCCCCTGACG CGCGAGGCGA CGCGGGCCTT CAACATCTCC TGCATCGAGA CCGAGGGCTT CGAGGCCGAC GACATCATCG CGGCGCTCGC CTGCCGTGCG CGCGAGGCGG GCGGCACCTG CACGATCCTG TCCTCGGACA AGGACCTGAT GCAGCTGGTG GGCGACGGGG TCGAGATGCT CGACCCGATG AAGAACAAGC GCATCGGCCG CGACGAGGTG ATCGAGAAGT TCGGCGTCCC GCCTGAAAGG GTGGTGGACG TGCAGGCGCT GGCGGGCGAT TCCGTGGACA ACGTGCCGGG CGCGCCGGGC ATCGGGATCA AGACCGCGGC GCTTCTGATC CAGGAATATG GCGACCTCGA AGGCCTGCTT GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA CAATGCCGAG CAGATCCGCA TCTCGAAGCG GCTGGTGGAA CTGGACTGCA ACACGCCGCT CGACTTTGCG CTCGAGGATC TCGAGGTGCG GCTGCCGGTG GCGGACGAGC TTCTGGGCTT CCTCAACCGG ATGGAGTTCC GCACGCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCCCCT GCCCCCACGG CCTTCGTGGT CCAGAACGAG AGCGTGCCCG AGATCGTCGA GCAGGTCGAG GCGGCGCTGC CCTTCGACAG CGCCTCCTAT GCCTGCGTGC GCGACGCCGA GGCGCTGGCC GCATGGATCG CGCGCATCCG CGAGCGGGGC CATGTCGCCA TCGACACCGA GACCACCAGC CTCGACGAGA TGCGCGCCGA ACTGGTGGGC ATCTCGCTCT GCGTCGAGGC GGGATCGGCC TGCTACATCC CGCTCGGCCA CCGCGCGGGC GGGGGTGATC TGTTCGGCAG CTTGGATCTC GTGGCCGACC AGATCCCGCT CGATCTGGCG CTCTCGATGC TGAAGCCGGT GCTGGAGGAT GACGCGATCC TCAAGATCGG CCAGAACATG AAATACGATG CCAAGATCTT CGCGCGGCAT GGCATCCGGG TCGCGCCGAT CGACGACACG ATGCTGATGT CCTACGCGAT GCACGCCGGC CGCCACGGCC ACGGCATGGA CGAGCTGTGC GATACCTACC TCGGCCACAA GCCCATCGCG ATCAAGACGC TTCTGGGGTC CGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTGGAT CAGGCGGTCT GCTACGCCGC CGAGGATGCC GATGTCACGC TGCGGCTGTG GCGGCTGTTC AAGCCGCAGC TGCACCGCGC GCGGGTGACG ACGGTCTACG AGACGCTGGA GCGCCCCCTT GTCCCGGTTC TGGCCGAGAT GGAGATGGCG GGGGTGCAGG TCGACCGCGA TACGCTCTCG CGGATGTCGA ACGCCTTCGC ACAGAAGATG GCCGCGCTCG AGGCCGAGAT CCACGGCCTC GCCGGCGGCC CCTTCAACGT CGGCAGTCCC AAGCAACTGG GCGAGATCCT GTTCGAGCGG ATGAGCCTTC AGGGCGGGGT CAAGGGCAAG AACGGCGCCT GGGGCACCGG CGCCGACGTG CTGGAGGATC TGGCCGCCGA GGGCCACGAC CTGCCCGCCC GCGTGCTGGA CTGGCGACAG CTCTCCAAGC TGAAATCCAC CTATACCGAC GCGCTGCAGG AGCATATCCA TCCCGAGACG GGACGCGTTC ACACCTCCTA TTCCATTGCG GGAGCGAACA CCGGCCGCCT CGCCTCGACC GATCCGAACC TGCAGAACAT CCCCGTCCGC ACCGAGGAGG GCCGCCGCAT CCGGGAGGCC TTCGTGGCGC CGAAGGGCAA GCTTCTGGTC AGCGTCGACT ATTCCCAGAT CGAGCTGCGC ATCCTGGCCC ATATCGCCGA CATCCCGGCG CTGAAGCAGG CCTTCCGCGA AGGGCACGAC ATCCACGCCA TGACGGCCGC GGAGATGTTC AACGTGCCGC TGGAGGGGAT GGACCCGATG GTGCGCCGGC AAGCCAAGGC GATCAACTTC GGGGTGATCT ACGGCATCTC GGGCTTTGGG CTGGCGCGCA ACCTGCGCAT CCCGCGCGCC GACGCCCAGG GCTTCATCGA CCGCTATTTC GAGCGCTTCC CCGGCATCCG CGCCTACATG GACGAGACGG TGGCCTTTGC GAAGGCCCAC GGTTTCGTCC AGACCCTGTT CGGCCGCCGC ATCAACACGC CCGAGATCAA CGCCAAGGGC CCCGGTGCGG GCTTTGCCCG CCGCGCGGCG ATCAACGCGC CGATCCAGGG AACCGCGGCC GATGTGATCC GCCGCGCCAT GATCCGCATG CCGAAGGCGC TCGACGGGCT GCCGGCGAAG ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG ACGAGGCGGC GGCGGACCCG GTGATCGCGC GGGTCCGCGA GGTGATGGAA AGCGCCTCCG AGCCGGCGGT GAAACTGTCG GTGCCGCTGA CGGTGGATGC GGGCCGCGGC GCCAACTGGG CCGAGGCCCA CTGA
|
Protein sequence | MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIS CIETEGFEAD DIIAALACRA REAGGTCTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV IEKFGVPPER VVDVQALAGD SVDNVPGAPG IGIKTAALLI QEYGDLEGLL ARAGEIKQPK RRAAIEDNAE QIRISKRLVE LDCNTPLDFA LEDLEVRLPV ADELLGFLNR MEFRTLTTRV AASLKVEPPP APTAFVVQNE SVPEIVEQVE AALPFDSASY ACVRDAEALA AWIARIRERG HVAIDTETTS LDEMRAELVG ISLCVEAGSA CYIPLGHRAG GGDLFGSLDL VADQIPLDLA LSMLKPVLED DAILKIGQNM KYDAKIFARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA IKTLLGSGKS QITFDRVPVD QAVCYAAEDA DVTLRLWRLF KPQLHRARVT TVYETLERPL VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AALEAEIHGL AGGPFNVGSP KQLGEILFER MSLQGGVKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET GRVHTSYSIA GANTGRLAST DPNLQNIPVR TEEGRRIREA FVAPKGKLLV SVDYSQIELR ILAHIADIPA LKQAFREGHD IHAMTAAEMF NVPLEGMDPM VRRQAKAINF GVIYGISGFG LARNLRIPRA DAQGFIDRYF ERFPGIRAYM DETVAFAKAH GFVQTLFGRR INTPEINAKG PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKALDGLPAK MLLQVHDELL FEVDEAAADP VIARVREVME SASEPAVKLS VPLTVDAGRG ANWAEAH
|
| |