Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2689 |
Symbol | |
ID | 4896469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2837611 |
End bp | 2840424 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640113290 |
Product | DNA polymerase I |
Protein accession | YP_001044563 |
Protein GI | 126463449 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.283059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.26219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GTTCGGCCTT CATCTTCCGG GCCTACCATG CGCTGCCGCC GCTGACGCGG AAGTCGGACG GCCTGCCCGT GGGCGCGGTC GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC GGACGCGCCG ACCCATGTGG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCG CTCTACAAGG CGCAGCGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT CCCGCTCACG CGCGAGGCGA CACGGGCCTT CAACATCGCC TGCATCGAGA CCGAAGGGTT CGAGGCCGAC GACATCATCG CGGCCCTCTC GTGCAAGGCG CGCGACGCGG GCGGCTCGGT CACGATCCTG TCCTCGGACA AGGATCTGAT GCAGCTGGTG GGCGACGGCG TCGAGATGCT CGACCCGATG AAGAACAAGC GCATCGGCCG CGACGAGGTG TTCGAGAAAT TCGGCGTCTA CCCCGAGCGC GTGGTGGATG TGCAGGCGCT GGCGGGCGAT TCCATCGACA ACGTGCCGGG TGCGCCGGGC ATCGGCATCA AGACGGCCGC GCTGCTGATC CAGGAATACG GCGATCTCGA GAGCCTGCTC GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA GAATGCCGAG CAGATCCGCA TCTCGAAGCG GCTGGTGGCG CTCGACTGCA ACACCCCGCT TACGTTCTCG CTCGAGGATC TCGAGGTGCG TCTGCCGGTG GCCGACGACC TTCTGGGCTT TCTCAACCGG ATGGAATTCC GCACCCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCTCCC GCCCCCACCG CCGCCGTCCT CCGGAACGAG GGCGTGCCCG AGATCGTCGA GCAGGTCGAG GCGGCGCTGC CCTTCGACAG CGCCTCCTAT GCCTGCGTCC GCGACGCCGA GGCGCTGGCC GCCTGGATCG CGCGCATCCG CGATCTGGGC CATGTCGCCA TCGACACCGA GACCACCAGC CTCGACGAGA TGCGCGCCGA ACTCGTGGGG ATCTCGCTCT GCGTCGAGGC GGGGGCCGCC TGCTACATCC CCCTCGGCCA CCGGGCTGGC GGCGGCGATC TCTTCGGCGC CTCGGAGCTC GTGGCCGACC AGATGCCGCT GGGTCTGGCG CTGTCGATGC TGAAGCCCGT GCTCGAGGAT GAGTCCATCC TCAAGATCGG CCAGAACATG AAGTACGACG CGAAGATCCT CGCGCGTCAC GGCATCCGGG TGGCGCCGAT CGACGATACG ATGCTCATGT CCTACGCGAT GCACGCGGGC CGCCACGGCC ACGGGATGGA CGAGCTCTGC GACACCTACC TCGGCCACAA ACCCATCGCG ATCAAGACGC TGCTGGGTTC GGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTCGAG CAGGCCGTCT GCTACGCGGC CGAGGATGCG GATGTGACCT TCCGGCTCTG GAAGCTGTTC AAGCCGCAGC TGCACCGCGC CCGGGTGACG ACGGTCTACG AGACGCTCGA GAGGCCGCTG GTGCCGGTGC TGGCCGAGAT GGAGATGGCG GGCGTTCAGG TCGACCGCGA CACGCTCTCG CGCATGTCGA ACGCCTTCGC GCAGAAGATG GCGGGACTCG AGGCCGAGAT CCACGCGCTG GCGGGCGGCC CGTTCAACGT GGGCAGCCCC AAGCAGCTGG GCGAGATCCT GTTCGAGAAG ATGGGCCTGC CCGGGGGTCA GAAGGGCAAG AACGGCGCCT GGGGCACCGG GGCCGACGTG CTCGAGGATC TGGCGGCCGA GGGGCACGAC CTGCCCGCGC GCGTGCTCGA TTGGCGCCAG CTCTCCAAGC TGAAATCGAC CTACACGGAT GCGCTGCAGG AGCATATCCA CCCCGAGACG GGCCGCGTCC ACACCTCCTA TTCCATCGCC GGAGCGAACA CGGGCCGCCT CGCCTCGACC GATCCGAACC TCCAGAACAT CCCCGTGCGC AGCGAGGAGG GCCGCCGCAT CCGCGAGGCC TTCGTGGCGC CGCCGGGCAA GCTTCTGGTC AGCCTCGACT ACAGCCAGAT CGAGCTGCGC ATCCTCGCCC ATATCGCCGA CATTCCGGCG CTGAAGCAGG CCTTCCGCGA GGGGCACGAC ATCCATGCGA TGACCGCCTC CGAGATGTTC AACGTGCCGC TCGAAGGCAT GGATCCGATG ATCCGGCGGC AGGCCAAGGC CATCAATTTC GGCGTGATCT ACGGGATCTC GGGCTTCGGC CTCGCACGCA ACCTGCGCAT CCCGCGGGCT GATGCGCAGG GCTTCATCGA CCGCTATTTC GAACGCTTCC CCGGCATCCG CGGCTATATG GACGAGACCA TCTCCTTCGC CAAGGCGAAC GGTCATGTGG AAACCCTGTT CGGACGCAGG ATCAACACGC CCGAGATCAA TGCCAAGGGG CCCGGCGCGG GCTTCGCGCG CCGCGCGGCC ATCAACGCGC CGATCCAGGG CACGGCGGCC GACGTGATCC GCCGGGCGAT GATCCGGATG CCGAAGGCGA TCCGGGGCAT GCCCGCGACC ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG AGGAAGAGGC CGCTGACGCG CTCATCGAGC GCGTGCGCGA GGTGATGGAA GGTGCGGCCG ACCCGGCGGT GAAGCTCACC GTGCCGCTGA CGGTGGAGGC GGGCCGCGGC CTGAACTGGG CCGCCGCCCA CTGA
|
Protein sequence | MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIA CIETEGFEAD DIIAALSCKA RDAGGSVTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV FEKFGVYPER VVDVQALAGD SIDNVPGAPG IGIKTAALLI QEYGDLESLL ARAGEIKQPK RRAAIEENAE QIRISKRLVA LDCNTPLTFS LEDLEVRLPV ADDLLGFLNR MEFRTLTTRV AASLKVEPPP APTAAVLRNE GVPEIVEQVE AALPFDSASY ACVRDAEALA AWIARIRDLG HVAIDTETTS LDEMRAELVG ISLCVEAGAA CYIPLGHRAG GGDLFGASEL VADQMPLGLA LSMLKPVLED ESILKIGQNM KYDAKILARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA IKTLLGSGKS QITFDRVPVE QAVCYAAEDA DVTFRLWKLF KPQLHRARVT TVYETLERPL VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AGLEAEIHAL AGGPFNVGSP KQLGEILFEK MGLPGGQKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET GRVHTSYSIA GANTGRLAST DPNLQNIPVR SEEGRRIREA FVAPPGKLLV SLDYSQIELR ILAHIADIPA LKQAFREGHD IHAMTASEMF NVPLEGMDPM IRRQAKAINF GVIYGISGFG LARNLRIPRA DAQGFIDRYF ERFPGIRGYM DETISFAKAN GHVETLFGRR INTPEINAKG PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKAIRGMPAT MLLQVHDELL FEVEEEAADA LIERVREVME GAADPAVKLT VPLTVEAGRG LNWAAAH
|
| |