Gene RPC_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2031 
Symbol 
ID3973931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2212904 
End bp2214349 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID637925140 
Productradical SAM family protein 
Protein accessionYP_531905 
Protein GI90423535 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG AATCTGATGA AGCGTCGCCA ATCATTCACA TCGTTCCGTT GGGAGATCCA 
CGGCGTGCCG CCGCGATCCA TCCCGGCAGT GGGGCCTGGG TTCTGATCGA AGATGCGGAC
ATGCCGGGGC TCGGACTTCG GGGTGATACG GTGAATCCGG TCGAACTAGC ACCGACGCAT
CCGCTCTATG CGGGCGTGCT TGACGCCACC CTGGGCTATC GCTCCGCCGC GTTGCAGCGC
TGCGAGCCGA AGTTGGACAC CCTGATCCTG AAGGTGACCA ACCGTTGCAA CGAGGCCTGC
AGACATTGCT ACGACGCCGG CGGGCGGGCC GAGATGGACG CCGCGATCGA CACGTTGTTC
GACGCGACGG ACGAGGCGCT GGTGCTATGC GGACCGACGT TGAACCTTTT GTTCCACGGC
GGCGAGCCGT TTCTCCGGAT CGATGTTCTC GATCGCGTCG CCGCTCACGC GCGCGATCAG
GCTGCCGGTC TCGGCAAGCA GGTCGGCGTG TTCGTGCAGA CCAACGCCTC GATCCTGAAT
GACAGGATCA TCGGCATTCT GCAGAAGCAT CACTTCGGCG TCGGCGTGTC GCTCGACGGC
TGGGCCGAGC TGCACGATCA GATGCGGGTG ATGGCCGACG GCACCGGCAC CTACCGATTG
TTCGAACGCT CCTACCGCCG CTACGCGCAT TACCTGACGG CCCATGCGGG AATCATGACG
ACGGTGATGG CCTGCAATGT CGGGGCGTTG CCGGAAATCG TTCGGCATGT CCGTGACCTC
GGCTTCCGGA CATGGGACGC CACGCTGTTC GATCTCAGCG GAAAAGGCGC GCTGTATCCG
CAGCTCGCCG TGGGCGGCGA GGCGTATAGC GCGGCGCTGG AGCCGATCCT CGACCTGATC
GAAGCCGGGG AGTGCGACGA GATTGCGATC AAGCCGGTGC TGCGCCGACT CGACAATCTG
CTGTCGCCGC GTCGCGACGA TATGTGCCTG CCTGGAAACG GTCCCTGCGG CGCCGGCGGC
CGGCTGTTGT CGCTGTCGGC CGAAAACATC GTGCATGGCT GCGACATCAT CGACCGTGCC
TCGCTGCGCC TGGGGGTGTT TCCGGCCACC ACGTTCGGCG CGGCCCTGGC GTCACCTCAG
GCGGCTATCA TGCGAAGCAG GCCGTCGCGA CTTGCGGCCT GCCACCGATG CACCTGGTTC
GGCCTGTGCG GCGGCACCTG CCTTGCCAGA GGCTCGCTGA ACGCACCGGA TTCCACAGAG
TGCCTGGTGT CGAAGCGGAT CAACCACAGC CTACTGCGGC GCATCGCACG GAGCGACCGG
CTGCTCGACT GGTATGAACG TTACCCGCCG GATCGCCGCC GCGCGTCGAT CATCTCCGAA
ACCGCAAACC GTGCGGCCGC ATCGCATCCG GTCAACACGG CCCAACGACC CAGGAGCGTC
AGTTGA
 
Protein sequence
MSSESDEASP IIHIVPLGDP RRAAAIHPGS GAWVLIEDAD MPGLGLRGDT VNPVELAPTH 
PLYAGVLDAT LGYRSAALQR CEPKLDTLIL KVTNRCNEAC RHCYDAGGRA EMDAAIDTLF
DATDEALVLC GPTLNLLFHG GEPFLRIDVL DRVAAHARDQ AAGLGKQVGV FVQTNASILN
DRIIGILQKH HFGVGVSLDG WAELHDQMRV MADGTGTYRL FERSYRRYAH YLTAHAGIMT
TVMACNVGAL PEIVRHVRDL GFRTWDATLF DLSGKGALYP QLAVGGEAYS AALEPILDLI
EAGECDEIAI KPVLRRLDNL LSPRRDDMCL PGNGPCGAGG RLLSLSAENI VHGCDIIDRA
SLRLGVFPAT TFGAALASPQ AAIMRSRPSR LAACHRCTWF GLCGGTCLAR GSLNAPDSTE
CLVSKRINHS LLRRIARSDR LLDWYERYPP DRRRASIISE TANRAAASHP VNTAQRPRSV
S