Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1028 |
Symbol | |
ID | 3720996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 2790472 |
End bp | 2793285 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640072257 |
Product | DNA polymerase I |
Protein accession | YP_354113 |
Protein GI | 77464609 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.447207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GTTCGGCCTT CATCTTCCGG GCCTACCATG CGCTGCCGCC GCTGACGCGG AAGTCGGACG GCCTGCCCGT GGGCGCGGTC GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC CGACGCGCCG ACCCATGTGG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCG CTCTACAAGG CGCAGCGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT CCCGCTCACG CGCGAGGCGA CGCGGGCCTT CAACATTGCC TGCATCGAGA CCGAAGGGTT CGAGGCCGAC GACATCATCG CGGCCCTCTC GTGCAAGGCG CGCGACGCGG GCGGCTCGGT CACGATCCTG TCCTCGGACA AGGATCTGAT GCAGCTGGTG GGCGACGGGG TCGAGATGCT CGACCCGATG AAGAACAAGC GGATCGGCCG CGACGAGGTG TTCGAGAAAT TCGGCGTCTA CCCCGAGCGC GTCGTGGATG TGCAGGCGCT GGCGGGCGAT TCCATCGACA ACGTGCCGGG CGCGCCGGGC ATCGGCATCA AGACGGCCGC GCTGCTGATC CAGGAATACG GCGATCTCGA GAGCCTGCTC GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA GAATGCCGAA CAGATCCGCA TCTCGAAGCG GCTGGTGGCG CTTGACTGCA ACACCCCGCT TACGTTCTCG CTCGAGGATC TCGAGGTGCG TCTGCCGGTG GCCGACGACC TTCTGGGCTT TCTCAACCGG ATGGAATTCC GCACCCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCTCCC GCCCCCACCG CCGCCGTCCT CCGGAACGAG GGCGTGCCCG AGATCGTCGA GCAGGTCGAG GCGGCGCTGC CTTTCGACAG CGCCTCCTAT GCCTGCATCC GCGATGCCGA GGCGCTGGCC GCCTGGATCG CGCGCATCCG CGACCTGGGC CATGTCGCCA TCGACACCGA GACCACCAGC CTCGACGAGA TGCGTGCCGA ACTGGTGGGG ATCTCGCTCT GCGTCGAGGC GGGGGCCGCC TGCTACATCC CGCTCGGCCA CCGGGCTGGC GGCGGCGATC TCTTCGGCGC CTCGGAGCTC GTGGCCGACC AGATGCCGCT GGGTCTGGCG CTGTCGATGC TGAAGCCGGT GCTCGAGGAT GAGTCCATCC TCAAGATCGG CCAGAACATG AAGTACGACG CGAAGATCCT CGCGCGTCAC GGCATCCGGG TGGCGCCGAT CGACGATACG ATGCTCATGT CCTACGCGAT GCACGCGGGC CGCCACGGCC ACGGGATGGA CGAGCTCTGC GACACCTACC TTGGCCACAA ACCCATCGCG ATCAAGACGC TCCTGGGTTC GGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTCGAG CAGGCCGTCT GCTATGCGGC CGAGGATGCG GATGTGACCT TCCGGCTCTG GAAGCTGTTC AAGCCGCAGC TGCACCGCGC CCGGGTGACG ACGGTCTACG AGACGCTCGA GCGGCCGCTG GTGCCGGTGC TGGCCGAGAT GGAGATGGCG GGCGTTCAGG TCGACCGCGA CACGCTCTCG CGGATGTCGA ACGCCTTCGC GCAGAAGATG GCGGGGCTCG AGGCCGAGAT CCACGCGCTG GCGGGCGGCC CGTTCAACGT GGGCAGCCCC AAGCAGCTGG GCGAGATCCT GTTCGAGAAG ATGGGCCTGC CCGGGGGCCA GAAGGGCAAG AACGGCGCCT GGGGCACCGG GGCCGACGTG CTCGAGGATC TGGCGGCCGA GGGGCACGAC CTGCCCGCGC GCGTGCTCGA CTGGCGCCAG CTCTCCAAGC TGAAATCGAC CTACACCGAC GCGCTGCAGG AGCATATCCA CCCCGAGACG GGCCGCGTCC ACACCTCCTA TTCCATCGCC GGAGCGAACA CGGGCCGCCT CGCCTCGACC GATCCGAACC TCCAGAACAT CCCCGTGCGC AGCGAGGAGG GCCGCCGCAT CCGCGAGGCC TTCGTGGCGC CGCCGGGCAA GCTTCTGGTC AGCCTCGACT ACAGCCAGAT CGAGCTGCGC ATCCTCGCCC ATATCGCCGA CATTCCGGCG CTGAAGCAGG CCTTCCGCGA GGGGCACGAC ATCCATGCGA TGACCGCCTC CGAGATGTTC AACGTGCCGC TCGAAGGCAT GGATCCGATG ATCCGGCGGC AGGCCAAGGC CATCAACTTC GGCGTGATCT ACGGGATCTC GGGCTTCGGC CTCGCACGCA ACCTGCGCAT CCCGCGGGCC GATGCGCAGG GCTTCATCGA CCGCTATTTC GAGCGTTTCC CCGGCATCCG CGGCTATATG GACGAGACCA TCGCCTTCGC CAAGGCGAAC GGTCATGTGG AAACCCTGTT CGGACGCAGG ATCAACACGC CCGAGATCAA TGCCAAGGGG CCCGGCGCGG GCTTCGCGCG CCGCGCGGCC ATCAACGCGC CGATCCAGGG CACGGCGGCC GACGTGATCC GCCGGGCGAT GATCCGGATG CCGAAGGCGA TCCGGGGCAT GCCCGCGACC ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG AGGAAGAGGC CGCTGACGCG CTCATCGAGC GCGTGCGCGA GGTGATGGAA GGTGCGGCCG ACCCGGCGGT GAAGCTCACC GTGCCGCTGA CGGTGGAGGC GGGCCGCGGC CTGAACTGGG CCGCCGCCCA CTGA
|
Protein sequence | MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIA CIETEGFEAD DIIAALSCKA RDAGGSVTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV FEKFGVYPER VVDVQALAGD SIDNVPGAPG IGIKTAALLI QEYGDLESLL ARAGEIKQPK RRAAIEENAE QIRISKRLVA LDCNTPLTFS LEDLEVRLPV ADDLLGFLNR MEFRTLTTRV AASLKVEPPP APTAAVLRNE GVPEIVEQVE AALPFDSASY ACIRDAEALA AWIARIRDLG HVAIDTETTS LDEMRAELVG ISLCVEAGAA CYIPLGHRAG GGDLFGASEL VADQMPLGLA LSMLKPVLED ESILKIGQNM KYDAKILARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA IKTLLGSGKS QITFDRVPVE QAVCYAAEDA DVTFRLWKLF KPQLHRARVT TVYETLERPL VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AGLEAEIHAL AGGPFNVGSP KQLGEILFEK MGLPGGQKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET GRVHTSYSIA GANTGRLAST DPNLQNIPVR SEEGRRIREA FVAPPGKLLV SLDYSQIELR ILAHIADIPA LKQAFREGHD IHAMTASEMF NVPLEGMDPM IRRQAKAINF GVIYGISGFG LARNLRIPRA DAQGFIDRYF ERFPGIRGYM DETIAFAKAN GHVETLFGRR INTPEINAKG PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKAIRGMPAT MLLQVHDELL FEVEEEAADA LIERVREVME GAADPAVKLT VPLTVEAGRG LNWAAAH
|
| |