Gene Rsph17029_2442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2442 
Symbol 
ID4895219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2576852 
End bp2577871 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID640113040 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_001044316 
Protein GI126463202 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.23952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00360154 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCGCC CGCTGCGCAT CGCCACCTAC AATGTCGAAT GGTTCAACGG GCTCTTCGAC 
GACCACGGGC GGCTCAGGAC CGACAACGAA CTGTCGGGCC GCTACGAGAT CACCCGCCGC
AACCAGATCG AATCGCTGGG CATCGTCTTC ACCGCCCTCG ATGCCGACGC GATCATGGTC
ATCGAGGCGC CGAACCAGAG CCGGCGCCGC TCGACGGTGA AGGCGCTCGA GACCTTCGCG
CGCACCTTCG GCCTGCGCGC CTCGCACGCC GTCATGGGCT TTCCGAGTGA GACCGAGCAG
GAGATCGCGC TCCTCTACGA CCCCTCGCGC ATCGAGGCCC ATCACGATCC GCAGTCGAGC
GCCAAGGCGC CTAGGTTCGA CGATGTGTTC CGCTTCGACA TCGACGTGGA TGCCACGCCC
GAGGCCATCC GCTTCTCCAA GCCGCCGCTC GAGCTTGCGC TCAGGGCCGA CGGCCATCCG
CTGCGGGTGA TCGGCGTCCA TGCCAAGTCC AAGGCCCCGC ACGGCGCGCG CAACCCGGCC
GAGGCGGTGC GGATCGGCAT TCAGAACCGC CGTCAGCAGC TGGCCGAATG CGTCTGGCTG
CGCCGGCGGG TGGCGGGCCT CCTCGCGCGG CACCAGAGCG TGATGGTGAT GGGCGATTTC
AACGACGGCC CCGGTCTCGA CGAATATGAG AAGCTCTTCG GCCGGTCGGG GATCGAGATC
GTCCTCGGGC TCGAGGAGCC TCCCGAGTTG CGCCTGCACG AGCCCCATGC GCGCATGGCG
CTCACGCAGA AGGTGGGCAT CCAGCCCAGC TCGGCCCGCT TCTGGCTCGC CCCGGAACAG
CAGTATTTCG AGGCCCTGCT CGACTTCATC ATGGTCTCGG CCGATCTGGC GGCGAAATCG
CCCCGCTGGC GGATCTGGCA TCCGCTGAAC GACCCGAACT GCTTTCGCAC CCCCGAGTTG
CAGCAGGCCC TCCTCGCGGC CTCGGACCAT TTTCCGGTCA CGCTCGACAT CGACCTCTGA
 
Protein sequence
MARPLRIATY NVEWFNGLFD DHGRLRTDNE LSGRYEITRR NQIESLGIVF TALDADAIMV 
IEAPNQSRRR STVKALETFA RTFGLRASHA VMGFPSETEQ EIALLYDPSR IEAHHDPQSS
AKAPRFDDVF RFDIDVDATP EAIRFSKPPL ELALRADGHP LRVIGVHAKS KAPHGARNPA
EAVRIGIQNR RQQLAECVWL RRRVAGLLAR HQSVMVMGDF NDGPGLDEYE KLFGRSGIEI
VLGLEEPPEL RLHEPHARMA LTQKVGIQPS SARFWLAPEQ QYFEALLDFI MVSADLAAKS
PRWRIWHPLN DPNCFRTPEL QQALLAASDH FPVTLDIDL