Gene Rsph17025_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0200 
Symbol 
ID5082139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp188763 
End bp191576 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content68% 
IMG OID640481755 
ProductDNA polymerase I 
Protein accessionYP_001166415 
Protein GI146276256 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.932691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GCTCGGCCTT CATCTTCCGC 
GCCTATCACG CGCTGCCTCC GCTGACGCGG AAATCCGACG GCCTGCCGGT GGGCGCCGTC
GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC CGACGCGCCG
ACCCATGTCG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCC
CTTTACAAGG CCCAACGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT TCCCCTGACG
CGCGAGGCGA CGCGGGCCTT CAACATCTCC TGCATCGAGA CCGAGGGCTT CGAGGCCGAC
GACATCATCG CGGCGCTCGC CTGCCGTGCG CGCGAGGCGG GCGGCACCTG CACGATCCTG
TCCTCGGACA AGGACCTGAT GCAGCTGGTG GGCGACGGGG TCGAGATGCT CGACCCGATG
AAGAACAAGC GCATCGGCCG CGACGAGGTG ATCGAGAAGT TCGGCGTCCC GCCTGAAAGG
GTGGTGGACG TGCAGGCGCT GGCGGGCGAT TCCGTGGACA ACGTGCCGGG CGCGCCGGGC
ATCGGGATCA AGACCGCGGC GCTTCTGATC CAGGAATATG GCGACCTCGA AGGCCTGCTT
GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA CAATGCCGAG
CAGATCCGCA TCTCGAAGCG GCTGGTGGAA CTGGACTGCA ACACGCCGCT CGACTTTGCG
CTCGAGGATC TCGAGGTGCG GCTGCCGGTG GCGGACGAGC TTCTGGGCTT CCTCAACCGG
ATGGAGTTCC GCACGCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCCCCT
GCCCCCACGG CCTTCGTGGT CCAGAACGAG AGCGTGCCCG AGATCGTCGA GCAGGTCGAG
GCGGCGCTGC CCTTCGACAG CGCCTCCTAT GCCTGCGTGC GCGACGCCGA GGCGCTGGCC
GCATGGATCG CGCGCATCCG CGAGCGGGGC CATGTCGCCA TCGACACCGA GACCACCAGC
CTCGACGAGA TGCGCGCCGA ACTGGTGGGC ATCTCGCTCT GCGTCGAGGC GGGATCGGCC
TGCTACATCC CGCTCGGCCA CCGCGCGGGC GGGGGTGATC TGTTCGGCAG CTTGGATCTC
GTGGCCGACC AGATCCCGCT CGATCTGGCG CTCTCGATGC TGAAGCCGGT GCTGGAGGAT
GACGCGATCC TCAAGATCGG CCAGAACATG AAATACGATG CCAAGATCTT CGCGCGGCAT
GGCATCCGGG TCGCGCCGAT CGACGACACG ATGCTGATGT CCTACGCGAT GCACGCCGGC
CGCCACGGCC ACGGCATGGA CGAGCTGTGC GATACCTACC TCGGCCACAA GCCCATCGCG
ATCAAGACGC TTCTGGGGTC CGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTGGAT
CAGGCGGTCT GCTACGCCGC CGAGGATGCC GATGTCACGC TGCGGCTGTG GCGGCTGTTC
AAGCCGCAGC TGCACCGCGC GCGGGTGACG ACGGTCTACG AGACGCTGGA GCGCCCCCTT
GTCCCGGTTC TGGCCGAGAT GGAGATGGCG GGGGTGCAGG TCGACCGCGA TACGCTCTCG
CGGATGTCGA ACGCCTTCGC ACAGAAGATG GCCGCGCTCG AGGCCGAGAT CCACGGCCTC
GCCGGCGGCC CCTTCAACGT CGGCAGTCCC AAGCAACTGG GCGAGATCCT GTTCGAGCGG
ATGAGCCTTC AGGGCGGGGT CAAGGGCAAG AACGGCGCCT GGGGCACCGG CGCCGACGTG
CTGGAGGATC TGGCCGCCGA GGGCCACGAC CTGCCCGCCC GCGTGCTGGA CTGGCGACAG
CTCTCCAAGC TGAAATCCAC CTATACCGAC GCGCTGCAGG AGCATATCCA TCCCGAGACG
GGACGCGTTC ACACCTCCTA TTCCATTGCG GGAGCGAACA CCGGCCGCCT CGCCTCGACC
GATCCGAACC TGCAGAACAT CCCCGTCCGC ACCGAGGAGG GCCGCCGCAT CCGGGAGGCC
TTCGTGGCGC CGAAGGGCAA GCTTCTGGTC AGCGTCGACT ATTCCCAGAT CGAGCTGCGC
ATCCTGGCCC ATATCGCCGA CATCCCGGCG CTGAAGCAGG CCTTCCGCGA AGGGCACGAC
ATCCACGCCA TGACGGCCGC GGAGATGTTC AACGTGCCGC TGGAGGGGAT GGACCCGATG
GTGCGCCGGC AAGCCAAGGC GATCAACTTC GGGGTGATCT ACGGCATCTC GGGCTTTGGG
CTGGCGCGCA ACCTGCGCAT CCCGCGCGCC GACGCCCAGG GCTTCATCGA CCGCTATTTC
GAGCGCTTCC CCGGCATCCG CGCCTACATG GACGAGACGG TGGCCTTTGC GAAGGCCCAC
GGTTTCGTCC AGACCCTGTT CGGCCGCCGC ATCAACACGC CCGAGATCAA CGCCAAGGGC
CCCGGTGCGG GCTTTGCCCG CCGCGCGGCG ATCAACGCGC CGATCCAGGG AACCGCGGCC
GATGTGATCC GCCGCGCCAT GATCCGCATG CCGAAGGCGC TCGACGGGCT GCCGGCGAAG
ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG ACGAGGCGGC GGCGGACCCG
GTGATCGCGC GGGTCCGCGA GGTGATGGAA AGCGCCTCCG AGCCGGCGGT GAAACTGTCG
GTGCCGCTGA CGGTGGATGC GGGCCGCGGC GCCAACTGGG CCGAGGCCCA CTGA
 
Protein sequence
MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP 
THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIS CIETEGFEAD
DIIAALACRA REAGGTCTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV IEKFGVPPER
VVDVQALAGD SVDNVPGAPG IGIKTAALLI QEYGDLEGLL ARAGEIKQPK RRAAIEDNAE
QIRISKRLVE LDCNTPLDFA LEDLEVRLPV ADELLGFLNR MEFRTLTTRV AASLKVEPPP
APTAFVVQNE SVPEIVEQVE AALPFDSASY ACVRDAEALA AWIARIRERG HVAIDTETTS
LDEMRAELVG ISLCVEAGSA CYIPLGHRAG GGDLFGSLDL VADQIPLDLA LSMLKPVLED
DAILKIGQNM KYDAKIFARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA
IKTLLGSGKS QITFDRVPVD QAVCYAAEDA DVTLRLWRLF KPQLHRARVT TVYETLERPL
VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AALEAEIHGL AGGPFNVGSP KQLGEILFER
MSLQGGVKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET
GRVHTSYSIA GANTGRLAST DPNLQNIPVR TEEGRRIREA FVAPKGKLLV SVDYSQIELR
ILAHIADIPA LKQAFREGHD IHAMTAAEMF NVPLEGMDPM VRRQAKAINF GVIYGISGFG
LARNLRIPRA DAQGFIDRYF ERFPGIRAYM DETVAFAKAH GFVQTLFGRR INTPEINAKG
PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKALDGLPAK MLLQVHDELL FEVDEAAADP
VIARVREVME SASEPAVKLS VPLTVDAGRG ANWAEAH