Gene Rsph17029_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2689 
Symbol 
ID4896469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2837611 
End bp2840424 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content68% 
IMG OID640113290 
ProductDNA polymerase I 
Protein accessionYP_001044563 
Protein GI126463449 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.283059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.26219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GTTCGGCCTT CATCTTCCGG 
GCCTACCATG CGCTGCCGCC GCTGACGCGG AAGTCGGACG GCCTGCCCGT GGGCGCGGTC
GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC GGACGCGCCG
ACCCATGTGG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCG
CTCTACAAGG CGCAGCGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT CCCGCTCACG
CGCGAGGCGA CACGGGCCTT CAACATCGCC TGCATCGAGA CCGAAGGGTT CGAGGCCGAC
GACATCATCG CGGCCCTCTC GTGCAAGGCG CGCGACGCGG GCGGCTCGGT CACGATCCTG
TCCTCGGACA AGGATCTGAT GCAGCTGGTG GGCGACGGCG TCGAGATGCT CGACCCGATG
AAGAACAAGC GCATCGGCCG CGACGAGGTG TTCGAGAAAT TCGGCGTCTA CCCCGAGCGC
GTGGTGGATG TGCAGGCGCT GGCGGGCGAT TCCATCGACA ACGTGCCGGG TGCGCCGGGC
ATCGGCATCA AGACGGCCGC GCTGCTGATC CAGGAATACG GCGATCTCGA GAGCCTGCTC
GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA GAATGCCGAG
CAGATCCGCA TCTCGAAGCG GCTGGTGGCG CTCGACTGCA ACACCCCGCT TACGTTCTCG
CTCGAGGATC TCGAGGTGCG TCTGCCGGTG GCCGACGACC TTCTGGGCTT TCTCAACCGG
ATGGAATTCC GCACCCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCTCCC
GCCCCCACCG CCGCCGTCCT CCGGAACGAG GGCGTGCCCG AGATCGTCGA GCAGGTCGAG
GCGGCGCTGC CCTTCGACAG CGCCTCCTAT GCCTGCGTCC GCGACGCCGA GGCGCTGGCC
GCCTGGATCG CGCGCATCCG CGATCTGGGC CATGTCGCCA TCGACACCGA GACCACCAGC
CTCGACGAGA TGCGCGCCGA ACTCGTGGGG ATCTCGCTCT GCGTCGAGGC GGGGGCCGCC
TGCTACATCC CCCTCGGCCA CCGGGCTGGC GGCGGCGATC TCTTCGGCGC CTCGGAGCTC
GTGGCCGACC AGATGCCGCT GGGTCTGGCG CTGTCGATGC TGAAGCCCGT GCTCGAGGAT
GAGTCCATCC TCAAGATCGG CCAGAACATG AAGTACGACG CGAAGATCCT CGCGCGTCAC
GGCATCCGGG TGGCGCCGAT CGACGATACG ATGCTCATGT CCTACGCGAT GCACGCGGGC
CGCCACGGCC ACGGGATGGA CGAGCTCTGC GACACCTACC TCGGCCACAA ACCCATCGCG
ATCAAGACGC TGCTGGGTTC GGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTCGAG
CAGGCCGTCT GCTACGCGGC CGAGGATGCG GATGTGACCT TCCGGCTCTG GAAGCTGTTC
AAGCCGCAGC TGCACCGCGC CCGGGTGACG ACGGTCTACG AGACGCTCGA GAGGCCGCTG
GTGCCGGTGC TGGCCGAGAT GGAGATGGCG GGCGTTCAGG TCGACCGCGA CACGCTCTCG
CGCATGTCGA ACGCCTTCGC GCAGAAGATG GCGGGACTCG AGGCCGAGAT CCACGCGCTG
GCGGGCGGCC CGTTCAACGT GGGCAGCCCC AAGCAGCTGG GCGAGATCCT GTTCGAGAAG
ATGGGCCTGC CCGGGGGTCA GAAGGGCAAG AACGGCGCCT GGGGCACCGG GGCCGACGTG
CTCGAGGATC TGGCGGCCGA GGGGCACGAC CTGCCCGCGC GCGTGCTCGA TTGGCGCCAG
CTCTCCAAGC TGAAATCGAC CTACACGGAT GCGCTGCAGG AGCATATCCA CCCCGAGACG
GGCCGCGTCC ACACCTCCTA TTCCATCGCC GGAGCGAACA CGGGCCGCCT CGCCTCGACC
GATCCGAACC TCCAGAACAT CCCCGTGCGC AGCGAGGAGG GCCGCCGCAT CCGCGAGGCC
TTCGTGGCGC CGCCGGGCAA GCTTCTGGTC AGCCTCGACT ACAGCCAGAT CGAGCTGCGC
ATCCTCGCCC ATATCGCCGA CATTCCGGCG CTGAAGCAGG CCTTCCGCGA GGGGCACGAC
ATCCATGCGA TGACCGCCTC CGAGATGTTC AACGTGCCGC TCGAAGGCAT GGATCCGATG
ATCCGGCGGC AGGCCAAGGC CATCAATTTC GGCGTGATCT ACGGGATCTC GGGCTTCGGC
CTCGCACGCA ACCTGCGCAT CCCGCGGGCT GATGCGCAGG GCTTCATCGA CCGCTATTTC
GAACGCTTCC CCGGCATCCG CGGCTATATG GACGAGACCA TCTCCTTCGC CAAGGCGAAC
GGTCATGTGG AAACCCTGTT CGGACGCAGG ATCAACACGC CCGAGATCAA TGCCAAGGGG
CCCGGCGCGG GCTTCGCGCG CCGCGCGGCC ATCAACGCGC CGATCCAGGG CACGGCGGCC
GACGTGATCC GCCGGGCGAT GATCCGGATG CCGAAGGCGA TCCGGGGCAT GCCCGCGACC
ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG AGGAAGAGGC CGCTGACGCG
CTCATCGAGC GCGTGCGCGA GGTGATGGAA GGTGCGGCCG ACCCGGCGGT GAAGCTCACC
GTGCCGCTGA CGGTGGAGGC GGGCCGCGGC CTGAACTGGG CCGCCGCCCA CTGA
 
Protein sequence
MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP 
THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIA CIETEGFEAD
DIIAALSCKA RDAGGSVTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV FEKFGVYPER
VVDVQALAGD SIDNVPGAPG IGIKTAALLI QEYGDLESLL ARAGEIKQPK RRAAIEENAE
QIRISKRLVA LDCNTPLTFS LEDLEVRLPV ADDLLGFLNR MEFRTLTTRV AASLKVEPPP
APTAAVLRNE GVPEIVEQVE AALPFDSASY ACVRDAEALA AWIARIRDLG HVAIDTETTS
LDEMRAELVG ISLCVEAGAA CYIPLGHRAG GGDLFGASEL VADQMPLGLA LSMLKPVLED
ESILKIGQNM KYDAKILARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA
IKTLLGSGKS QITFDRVPVE QAVCYAAEDA DVTFRLWKLF KPQLHRARVT TVYETLERPL
VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AGLEAEIHAL AGGPFNVGSP KQLGEILFEK
MGLPGGQKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET
GRVHTSYSIA GANTGRLAST DPNLQNIPVR SEEGRRIREA FVAPPGKLLV SLDYSQIELR
ILAHIADIPA LKQAFREGHD IHAMTASEMF NVPLEGMDPM IRRQAKAINF GVIYGISGFG
LARNLRIPRA DAQGFIDRYF ERFPGIRGYM DETISFAKAN GHVETLFGRR INTPEINAKG
PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKAIRGMPAT MLLQVHDELL FEVEEEAADA
LIERVREVME GAADPAVKLT VPLTVEAGRG LNWAAAH