Gene RSP_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1028 
Symbol 
ID3720996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2790472 
End bp2793285 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content68% 
IMG OID640072257 
ProductDNA polymerase I 
Protein accessionYP_354113 
Protein GI77464609 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.447207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTCG GCAAGGGCCA CCATCTGCAT CTCATCGACG GTTCGGCCTT CATCTTCCGG 
GCCTACCATG CGCTGCCGCC GCTGACGCGG AAGTCGGACG GCCTGCCCGT GGGCGCGGTC
GCGGGCTTCT GCAACATGCT GTTCCGCTAT GTCGAGAACA ACAAGGGCCC CGACGCGCCG
ACCCATGTGG CGGTGATCTT CGACCATTCC TCGAAGACCT TCCGCAACGA GATCTACCCG
CTCTACAAGG CGCAGCGCCC CGAGCCGCCC GAGGATCTGC GCCCGCAGTT CCCGCTCACG
CGCGAGGCGA CGCGGGCCTT CAACATTGCC TGCATCGAGA CCGAAGGGTT CGAGGCCGAC
GACATCATCG CGGCCCTCTC GTGCAAGGCG CGCGACGCGG GCGGCTCGGT CACGATCCTG
TCCTCGGACA AGGATCTGAT GCAGCTGGTG GGCGACGGGG TCGAGATGCT CGACCCGATG
AAGAACAAGC GGATCGGCCG CGACGAGGTG TTCGAGAAAT TCGGCGTCTA CCCCGAGCGC
GTCGTGGATG TGCAGGCGCT GGCGGGCGAT TCCATCGACA ACGTGCCGGG CGCGCCGGGC
ATCGGCATCA AGACGGCCGC GCTGCTGATC CAGGAATACG GCGATCTCGA GAGCCTGCTC
GCGCGGGCGG GCGAGATCAA GCAGCCCAAG CGCCGCGCCG CCATCGAGGA GAATGCCGAA
CAGATCCGCA TCTCGAAGCG GCTGGTGGCG CTTGACTGCA ACACCCCGCT TACGTTCTCG
CTCGAGGATC TCGAGGTGCG TCTGCCGGTG GCCGACGACC TTCTGGGCTT TCTCAACCGG
ATGGAATTCC GCACCCTGAC GACCCGTGTG GCCGCGAGCC TGAAGGTCGA GCCGCCTCCC
GCCCCCACCG CCGCCGTCCT CCGGAACGAG GGCGTGCCCG AGATCGTCGA GCAGGTCGAG
GCGGCGCTGC CTTTCGACAG CGCCTCCTAT GCCTGCATCC GCGATGCCGA GGCGCTGGCC
GCCTGGATCG CGCGCATCCG CGACCTGGGC CATGTCGCCA TCGACACCGA GACCACCAGC
CTCGACGAGA TGCGTGCCGA ACTGGTGGGG ATCTCGCTCT GCGTCGAGGC GGGGGCCGCC
TGCTACATCC CGCTCGGCCA CCGGGCTGGC GGCGGCGATC TCTTCGGCGC CTCGGAGCTC
GTGGCCGACC AGATGCCGCT GGGTCTGGCG CTGTCGATGC TGAAGCCGGT GCTCGAGGAT
GAGTCCATCC TCAAGATCGG CCAGAACATG AAGTACGACG CGAAGATCCT CGCGCGTCAC
GGCATCCGGG TGGCGCCGAT CGACGATACG ATGCTCATGT CCTACGCGAT GCACGCGGGC
CGCCACGGCC ACGGGATGGA CGAGCTCTGC GACACCTACC TTGGCCACAA ACCCATCGCG
ATCAAGACGC TCCTGGGTTC GGGCAAGAGC CAGATCACCT TCGACCGGGT GCCGGTCGAG
CAGGCCGTCT GCTATGCGGC CGAGGATGCG GATGTGACCT TCCGGCTCTG GAAGCTGTTC
AAGCCGCAGC TGCACCGCGC CCGGGTGACG ACGGTCTACG AGACGCTCGA GCGGCCGCTG
GTGCCGGTGC TGGCCGAGAT GGAGATGGCG GGCGTTCAGG TCGACCGCGA CACGCTCTCG
CGGATGTCGA ACGCCTTCGC GCAGAAGATG GCGGGGCTCG AGGCCGAGAT CCACGCGCTG
GCGGGCGGCC CGTTCAACGT GGGCAGCCCC AAGCAGCTGG GCGAGATCCT GTTCGAGAAG
ATGGGCCTGC CCGGGGGCCA GAAGGGCAAG AACGGCGCCT GGGGCACCGG GGCCGACGTG
CTCGAGGATC TGGCGGCCGA GGGGCACGAC CTGCCCGCGC GCGTGCTCGA CTGGCGCCAG
CTCTCCAAGC TGAAATCGAC CTACACCGAC GCGCTGCAGG AGCATATCCA CCCCGAGACG
GGCCGCGTCC ACACCTCCTA TTCCATCGCC GGAGCGAACA CGGGCCGCCT CGCCTCGACC
GATCCGAACC TCCAGAACAT CCCCGTGCGC AGCGAGGAGG GCCGCCGCAT CCGCGAGGCC
TTCGTGGCGC CGCCGGGCAA GCTTCTGGTC AGCCTCGACT ACAGCCAGAT CGAGCTGCGC
ATCCTCGCCC ATATCGCCGA CATTCCGGCG CTGAAGCAGG CCTTCCGCGA GGGGCACGAC
ATCCATGCGA TGACCGCCTC CGAGATGTTC AACGTGCCGC TCGAAGGCAT GGATCCGATG
ATCCGGCGGC AGGCCAAGGC CATCAACTTC GGCGTGATCT ACGGGATCTC GGGCTTCGGC
CTCGCACGCA ACCTGCGCAT CCCGCGGGCC GATGCGCAGG GCTTCATCGA CCGCTATTTC
GAGCGTTTCC CCGGCATCCG CGGCTATATG GACGAGACCA TCGCCTTCGC CAAGGCGAAC
GGTCATGTGG AAACCCTGTT CGGACGCAGG ATCAACACGC CCGAGATCAA TGCCAAGGGG
CCCGGCGCGG GCTTCGCGCG CCGCGCGGCC ATCAACGCGC CGATCCAGGG CACGGCGGCC
GACGTGATCC GCCGGGCGAT GATCCGGATG CCGAAGGCGA TCCGGGGCAT GCCCGCGACC
ATGCTGCTGC AGGTCCATGA CGAACTGCTG TTCGAGGTGG AGGAAGAGGC CGCTGACGCG
CTCATCGAGC GCGTGCGCGA GGTGATGGAA GGTGCGGCCG ACCCGGCGGT GAAGCTCACC
GTGCCGCTGA CGGTGGAGGC GGGCCGCGGC CTGAACTGGG CCGCCGCCCA CTGA
 
Protein sequence
MTFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV AGFCNMLFRY VENNKGPDAP 
THVAVIFDHS SKTFRNEIYP LYKAQRPEPP EDLRPQFPLT REATRAFNIA CIETEGFEAD
DIIAALSCKA RDAGGSVTIL SSDKDLMQLV GDGVEMLDPM KNKRIGRDEV FEKFGVYPER
VVDVQALAGD SIDNVPGAPG IGIKTAALLI QEYGDLESLL ARAGEIKQPK RRAAIEENAE
QIRISKRLVA LDCNTPLTFS LEDLEVRLPV ADDLLGFLNR MEFRTLTTRV AASLKVEPPP
APTAAVLRNE GVPEIVEQVE AALPFDSASY ACIRDAEALA AWIARIRDLG HVAIDTETTS
LDEMRAELVG ISLCVEAGAA CYIPLGHRAG GGDLFGASEL VADQMPLGLA LSMLKPVLED
ESILKIGQNM KYDAKILARH GIRVAPIDDT MLMSYAMHAG RHGHGMDELC DTYLGHKPIA
IKTLLGSGKS QITFDRVPVE QAVCYAAEDA DVTFRLWKLF KPQLHRARVT TVYETLERPL
VPVLAEMEMA GVQVDRDTLS RMSNAFAQKM AGLEAEIHAL AGGPFNVGSP KQLGEILFEK
MGLPGGQKGK NGAWGTGADV LEDLAAEGHD LPARVLDWRQ LSKLKSTYTD ALQEHIHPET
GRVHTSYSIA GANTGRLAST DPNLQNIPVR SEEGRRIREA FVAPPGKLLV SLDYSQIELR
ILAHIADIPA LKQAFREGHD IHAMTASEMF NVPLEGMDPM IRRQAKAINF GVIYGISGFG
LARNLRIPRA DAQGFIDRYF ERFPGIRGYM DETIAFAKAN GHVETLFGRR INTPEINAKG
PGAGFARRAA INAPIQGTAA DVIRRAMIRM PKAIRGMPAT MLLQVHDELL FEVEEEAADA
LIERVREVME GAADPAVKLT VPLTVEAGRG LNWAAAH