Gene Rsph17029_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2059 
Symbol 
ID4896171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2182390 
End bp2184141 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content75% 
IMG OID640112652 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001043934 
Protein GI126462820 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCC CGGCGGCGCG CGCCTTCCTG GGGGTCGAGG CCTCCGCGAC CGGGCGCCGC 
TGGGTTGGCC CCTCCCCCGA GGAGGACCGG CTGGCCGAGG CGATGGCGCA GACCACGCGC
CTGCCGCTCG CGCTCTGCCG CACGCTGGTG CGGCGGGGCG TCCCGGCCTC CGATGCCGAG
AGCTTCCTCG CGCCCGCGCT GCGCGACCTC CTGCCCGACC CGCTGACGCT GAAGGACATG
GCGCCCGCCG CCGCGCGGCT GCTGCAGGCC GTGGCCCGGC GCGAGCGGAT CGCGATCTTC
GCCGACTACG ACGTGGACGG CGGCTCCTCC GCCGCGCTGC TGATCGTCTG GCTCCGGGCG
CTCGGGCGCT CGGCCACGCT CTACATCCCC GACCGGATCG ACGAGGGCTA CGGCCCGAAC
GTGCCCGCCA TGCAGGCGCT GGCCGAGGCG CATGACCTGA TCCTCTGCGT CGATTGCGGC
ACCCTCTCGC ACGAGCCCAT CGCGGCGGCG AAGGGCGCGG ATGTGGTGGT GCTCGATCAC
CACCTCGGCG CCGAGACGCT GCCGCCCGCG CTCGCCGTGG TGAACCCCAA CCGGCAGGAC
GAGGACGGGG CGCTGGGCCA TCTCTGTGCG GCCTCCGTCG TGTTCCTGCT TCTCGTCGAG
GCCAACCGCC GCCTGCGCGC GGAAGGGGTG CAGGGCCCGG ACCTGATGGC GATGCTCGAC
CTCGTGGCGC TGGCGACCGT GGCCGATGTC GCGCCGTTGA CAGGTGTCAA CCGCGCGCTC
GTGCGGCAGG GCCTCAAGGT CATGGCGCGG CGCGAGCGGC CGGGCCTCGT GGCGCTGGCC
GATGTGGCAC GGATGAAGAC CGCGCCCAAC AGCTATGCGC TGGGCTTCCT TCTCGGCCCG
CGCGTCAATG CGGGCGGGCG GATCGGGCAG GCGGACCTCG GCGCACGGCT GCTGTCCACC
GAGGATCCGC GCGAGGCGCA GGCCTTGGCG GAGCGGCTCG ACCAGCTCAA CACCGAGCGG
CGCGAGATCG AGGCCCGCGT CCGCGAGGAG GCTCTGGCTC AGGCCGAGGC GCGCGGCATC
TCGGGCCCGC TCGTCTGGGC CGCGGCCGAG GGCTGGCACC CCGGCGTGGT GGGCATCGTG
GCCGCCCGGC TGAAGGAGGC CACCAACCGC CCCGCCGTGG TGATCGGCTT CGAGAACGGC
ATCGGCAAGG GCTCGGGCCG CTCGATCGCC GGTGTGGATC TGGGCGCTTC CATCCACCGC
GTGGCGCATG AGGGGCTGCT TCTGAAGGGC GGCGGGCACC GGATGGCTGC GGGCCTCACC
GTCGAGGAAG GCCGCCTCGA GGCCGCGATG GAGCGGCTGG GCGAGCTTCT GGCGCGGCAG
GGGGCGGCCG AGGCGGGTCC GGCGGACCTG CGGCTCGACG GGCTCCTGAT GCCCTCGGCG
GCGACGGCGG AGTTCGTCGA GCAGATCGAC TCCGCGGGAC CCTACGGGGC GGGCGCCGCC
GCGCCGCGCT TCGCCCTGGC AGATCAGCGC GTGACCTGCC GGCGCATGGG CGACAAGCAC
ATGCGGCTGA CGATCGGCGG CGCCGAGGGC CGGCTCGAGG CGGTGGCCTT CGGCTGCTTC
GACGGCCCGC TCGGCCCGCT TCTGGAAGGC GGCGGCGCAG GCCGCTTCCA CCTCGCCGGC
CGGCTCGAGA TCGACACCTG GGGCGGCAGC GCGCGCGTGC AGATGCGCCT CGAAGACGCG
GCGCGCGCAT GA
 
Protein sequence
MTGPAARAFL GVEASATGRR WVGPSPEEDR LAEAMAQTTR LPLALCRTLV RRGVPASDAE 
SFLAPALRDL LPDPLTLKDM APAAARLLQA VARRERIAIF ADYDVDGGSS AALLIVWLRA
LGRSATLYIP DRIDEGYGPN VPAMQALAEA HDLILCVDCG TLSHEPIAAA KGADVVVLDH
HLGAETLPPA LAVVNPNRQD EDGALGHLCA ASVVFLLLVE ANRRLRAEGV QGPDLMAMLD
LVALATVADV APLTGVNRAL VRQGLKVMAR RERPGLVALA DVARMKTAPN SYALGFLLGP
RVNAGGRIGQ ADLGARLLST EDPREAQALA ERLDQLNTER REIEARVREE ALAQAEARGI
SGPLVWAAAE GWHPGVVGIV AARLKEATNR PAVVIGFENG IGKGSGRSIA GVDLGASIHR
VAHEGLLLKG GGHRMAAGLT VEEGRLEAAM ERLGELLARQ GAAEAGPADL RLDGLLMPSA
ATAEFVEQID SAGPYGAGAA APRFALADQR VTCRRMGDKH MRLTIGGAEG RLEAVAFGCF
DGPLGPLLEG GGAGRFHLAG RLEIDTWGGS ARVQMRLEDA ARA