Gene Rsph17025_2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2952 
Symbol 
ID5085155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3014446 
End bp3017448 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content74% 
IMG OID640484523 
Productexonuclease-like protein 
Protein accessionYP_001169143 
Protein GI146278984 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.034558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0831314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACG CGTCTTCCCT GTTCGAAGGG CCGGGCCCCC GCCTCTTCGG GCTGCCGCCC 
GGCGTGGATT TCCCGGCGGC GCTGGTGCGG GGTCTGCGCG CCCGGATGGC CGGCGCCCCG
CCCGAGGCGA TGGCGCGGGT CGAGCTTTAC GTCAACACGC AGCGGATGCG GCGGCGGGTC
GTCGAGCTGA TGACGGCCGA GGGCGCGGGC TTCCTGCCGC GCATCCGCCT CGTGACCGAG
CTGCCGCCGG TGCCCGGCCT GCCCGCCCCC GTTCCGCCGC TGCGCCGGCG GCTGGAGCTT
GCGCAGCTCG TGGCGCGGCT GATCGAGGCG CAACCCGACA TCGCGCCCCG CTCGGCGCTC
TTTGACCTCT CGGACAGCCT TGCCGACCTG ATCGACGAGA TGCAGGGCGA GGGCGTGCCG
CCCGAGGCCA TCGCGCGGCT CGACGTGGCG GACCATTCGG CGCACTGGCA GCGGACGCAG
GCCTTCATGG CGATCGTGGC GCCGATGTTC GGGGCGGACG CACCCGACGC GCAGGCGCTG
GCGCGGATGA CGGTCGAGCG GATCGCCGCC CGCTGGGCCG AGGCGCCCCC CGATCATCCG
GTGATCGTCG CGGGCTCCAC CGGCTCGCGC GGGACGACGG CGCTCTTCAT GCAGGCGGTG
GCGCGGCTGC CGCAGGGCGC GCTGGTGCTG CCCGGCTTCG ACTTCGACCT GCCCGCCGAG
GTCTGGGACG GGCTGGGCGA TGCGCTGACC GCCGAGGACC ATCCCCAGTT CCGCTTTCAC
CGGCTGATGG GTCTCGTGGG GGCTGCGCCT TCTCAGGTGC ACCGCTGGAC CGACGAGGAC
CCGCCGAGCC CGGCGCGGAA CCGGCTGATC TCGCTCTCGC TCCGGCCCGC GCCGGTGACG
GACCAGTGGC TGACCGAGGG GCAGCGCCTG ACCGATCTCG CGCTTGCGGC CGAGGGCATG
GCGCTGATCG AGGCTGCAGG GCCGCGGGCC GAGGCGCTGG CCGTGGCGAT CATCCTGCGC
AAGGCCGCCG AAGACGGCCG GCGCGCGGCT CTCATCACCT CGGACCGGGG GCTCACGCGG
CAGGTGGCGG CGGCGCTCGA CCGCTGGGGG ATCGTGCCGG ACGATTCCGC GGGCCGCCCG
CTCGCCCTCT CGGCGCCGGG ACGGTTCCTG CGGCATGTGG CGCGTCTCTT CGGCCAGCGG
CTGACCGGCG AGGCGCTGCT CACGCTGCTC AAGCATCCGC TGACGGCCAC CGGCTCGGAC
CGCGGCAACC ACCTGCGCTG GACGCGGGAC CTGGAGTTGC ACCTGCGGCG CAAGGGGCCT
CCGTTCCCGA CGGGCGGCGA TCTGGATCTC TGGGCGGGGG CTCGGCCCGA CGACGGGGTG
GCGGACTGGG CGCGCTGGCT GGGCGGGCTG ATCGAGGGCC TCGATGCGGT GGGCAACCGC
CCCCTGTCCG ATCATGTCGC CGCCCATCTG GCGCTGGCGG AGGCTTTGGC CGCGGGACCA
GCCGGCACGG GCACGGGCGA GCTGTGGCTG AAGGAGGCGG GTGAGGCCGC GCGCGCCGCC
GTCGAGGAGT TGCGCCGCGA GGCACCGCAC GGGGGCGAGC TGACCACCGC GGACTACACC
GACCTCTTCG ACGCGATCCT CGCCCGCGGC GAGGTGCGCG AGGCGGTGCA GGCCCATCCC
GGCCTGATGA TCTGGGGCAC GCTCGAGGCG CGCGTGCAGG GGGCCGATCT CGTGATCCTC
GGCGGCCTCA ATGACGGGAC ATGGCCGCAA CTGCCGCCGC CCGATCCGTG GCTCAACCGG
CAGATGCGGC TCCGGGCGGG GCTTCTTCTC CCCGAACGGC GGATCGGCCT CTCGGCGCAC
GACTACAGCC AGGCGGTCGC CGCGCCCGAG GTGGTGCTGA CCCGCGCCAC CCGCAACGCC
GAGGCCGAAA CGGTGCCGTC GCGCTGGCTG AACCGGCTGA TGAACCTGAT GAGCGGGCTG
AAGGCGCAGG GCGGGCCCGA GGCGCTGGAG GCGATGCGCG CGCGGGGACG CGGCTGGCTG
GCGCTCGCCT CGGCGCTGGA GCAGCCCGAG GCGCCGGTGC CGCTGGCCGC GCGGCCCGCG
CCGCAGCCGC CCGTTTCGGC GCGGCCCGAC CGGCTGGCCG TGACCGGCAT CCGCACGCTG
ATCCGCGACC CTTACGCAAT CTATGCCCGC CACATCCTGC GACTCTATCC GCTCGATCCG
CTGCATCGGG CGCCGGATGC CCGGCTGCGC GGCTCGATCC TGCACCGCAT CCTCGAGGAG
TTCGTGAAGG ACCGCGCGCC GGGGACCGAC CGCGCCGCCG AGCGCGCGCG CCTGATGCGG
ATCGCCGAAA CGGTGCTGAC GGACGAGGTG CCATGGCCCG CCGCCCGCGC ACTGTGGCTT
GCCCGGCTCG ACCGGGCGGC GGATTTCTTC CTCGAGACCG AGGCGGCGCA TGGCGGAACC
CCGGTCGTGC TGGAAGAGGA GGGCCGCGTG GACCTCGCGC CCCTGCGCTT CACGCTGACG
GCCAAGCCGG ACCGGATCGA CAGGCTGCCC GATGGCCGGC TGCATATCCT CGACTACAAG
ACCGGAACCC CGCCCACCAG GAAGCAGCAG GAGCAGTTCG ACAAGCAGCT TCTGCTCGAG
GCGGCGATGG CCGAGCACGG CGGCTTTCGC AAACTGGGGC CCACGGACGT GGCGCGGATC
AGCTACATCG GCCTCGGATC GAGCCCCAAG GTGGAGAGTG TCGAGACCGA CGCGGCCCTT
CTGGGGCAGG TCTGGGAGGG GCTTCACGCC CTTGTCGGCC GCTACCTGCG GCGCGAGCAG
GGCTATGTCT CGCGCCGGGC CATGTTCGGC GAGCGGTTCC CCGGCGACTA CGACCATCTC
GCGCGGTTCG GCGAGTGGGA GATGAGCGAC AGCCCCGTGC CGGTGCCGGT GGGGGAAGAG
GCGGGCGCGC CGTCCGGCGA GGCCCGCCCG CGCGACAGGA CGCGCCCGGA GGATGCCGCA
TGA
 
Protein sequence
MLDASSLFEG PGPRLFGLPP GVDFPAALVR GLRARMAGAP PEAMARVELY VNTQRMRRRV 
VELMTAEGAG FLPRIRLVTE LPPVPGLPAP VPPLRRRLEL AQLVARLIEA QPDIAPRSAL
FDLSDSLADL IDEMQGEGVP PEAIARLDVA DHSAHWQRTQ AFMAIVAPMF GADAPDAQAL
ARMTVERIAA RWAEAPPDHP VIVAGSTGSR GTTALFMQAV ARLPQGALVL PGFDFDLPAE
VWDGLGDALT AEDHPQFRFH RLMGLVGAAP SQVHRWTDED PPSPARNRLI SLSLRPAPVT
DQWLTEGQRL TDLALAAEGM ALIEAAGPRA EALAVAIILR KAAEDGRRAA LITSDRGLTR
QVAAALDRWG IVPDDSAGRP LALSAPGRFL RHVARLFGQR LTGEALLTLL KHPLTATGSD
RGNHLRWTRD LELHLRRKGP PFPTGGDLDL WAGARPDDGV ADWARWLGGL IEGLDAVGNR
PLSDHVAAHL ALAEALAAGP AGTGTGELWL KEAGEAARAA VEELRREAPH GGELTTADYT
DLFDAILARG EVREAVQAHP GLMIWGTLEA RVQGADLVIL GGLNDGTWPQ LPPPDPWLNR
QMRLRAGLLL PERRIGLSAH DYSQAVAAPE VVLTRATRNA EAETVPSRWL NRLMNLMSGL
KAQGGPEALE AMRARGRGWL ALASALEQPE APVPLAARPA PQPPVSARPD RLAVTGIRTL
IRDPYAIYAR HILRLYPLDP LHRAPDARLR GSILHRILEE FVKDRAPGTD RAAERARLMR
IAETVLTDEV PWPAARALWL ARLDRAADFF LETEAAHGGT PVVLEEEGRV DLAPLRFTLT
AKPDRIDRLP DGRLHILDYK TGTPPTRKQQ EQFDKQLLLE AAMAEHGGFR KLGPTDVARI
SYIGLGSSPK VESVETDAAL LGQVWEGLHA LVGRYLRREQ GYVSRRAMFG ERFPGDYDHL
ARFGEWEMSD SPVPVPVGEE AGAPSGEARP RDRTRPEDAA