Gene Rsph17025_3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3025 
Symbol 
ID5084364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3096355 
End bp3097863 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID640484596 
Producthypothetical protein 
Protein accessionYP_001169214 
Protein GI146279055 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0498706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTC TGCCTGCCCT GCTGGTGATC ATCGCCTTTC CCGCCGCGGC CGAGCCTCTG 
GATCCGCCGA ACCTGACCAT CCTGCCCCGC ACGGCCGCCG AGACCGCCCG GATTGCGGCC
GTTCTCGCAC CGCCCGTCGA TTTCTCGAAG CCCGAGCCCT TCGAGGCGCT GCCCGGCGGC
GCGGCCAGCG TGCGCGCCCG CGACACGGCC GACGCCTTCT CGCAATCCTC GGGCAACATG
CCGTTCGAGC GCGAGATGGA TTTCAAGCTC GGCAACGGGC TGTTCCGCAA GCTGTGGGTG
GCGGCCCCCT CCTCGACGCA GGCCTCGGAC GGGCTCGGCC CGCTCTACAA CGCCCGCGGT
TGCCAGAACT GCCACCTGAA GGACGGCCGC GGCCATGTGC CCGAGGGGCC CGAGGATGAG
GCGGTCTCGA TGTTCCTGCG CCTTTCCGTC CCCGGCGGTC CCTCTCCCGA GGGCATCGAG
GAGTGGATCG CCACCAGCGC CGAACCCACC TATGGCGGGC AGCTTCAGGA CTTTGCCGCC
CCCGGTCTGG CACCCGAGGG CCGGATGCGG ATCGACTGGC AGGAGCTGCC CGTCACGCTC
GATGACGGCA CGGTGGTGAC GCTGCGCAAG CCCGACTACT CGGTCGAGGA TCTGAATTAC
GGCCCGATGG CAAGGGATGT GATGCTCTCG CCCCGCGTCA CGCCGCAGAT GATCGGGCTG
GGGCTGCTCG AGGCCATTCC GGCCGCCGAC ATCCTGGCCC ACGCCGACCC CGAGGATCGG
GACGGCGACG GCATCTCGGG CCGCCCCAGC ATCACCTGGT CGGCCGAAGC GGATGCGCCG
ATGCTCGGCC GGTTCGGCCT CAAGGCGGGG ACGCCCACGG TGCTGCAGCA GTCCGCCTCG
GCCTTCGCCG GTGACATGGG GATCGCGAAC GCCCTCTTCC CCGAGCCCTG GGGCGAATGC
ACCGAGGCGC AGACCGCCTG CCGCGCCGCG GTCCACGGGA TCGAGCCAGG CAAGCGCGAC
GGTCTCGAGA TCGACCGGCA GGGGCTCGAA CTGACGACGT TCTACGCCCG CAACCTCGCC
GTGCCCGAGA GGCGCCGGGT GGACGATCCG CAGGTGCTGC GCGGCAAGCA ACTCTTCCAC
GAGGCGGGCT GTCCCGCCTG CCATGTGCCC AAGTTCGTGA CCCACCGGCT GAAGGACCAG
CCCGAGCAGA GCTTCCAGCT GATCTGGCCC TACACCGATC TGCTGCTGCA CGACATGGGC
GAGGGGCTGG CGGACGGCCG CCCCGAGGGT CGGGCCACGG GTCGCGAGTG GCGCACCGCG
CCGCTCTGGG GCATCGGCCT GACCGAGCAG GTGAGCGGCC ACGCCAACTT CCTGCACGAT
GGCCGTGCGC GGACGATCCT CGAGGCAATC CTCTGGCACG GCGGCGAAGC CGAGGCCGCC
CGCGCGCGCG TCATGGCCCT GCCCGCCCCC GACCGCGCGG CCCTCATCGC CTTCGTGGAG
GATCTCTGA
 
Protein sequence
MSRLPALLVI IAFPAAAEPL DPPNLTILPR TAAETARIAA VLAPPVDFSK PEPFEALPGG 
AASVRARDTA DAFSQSSGNM PFEREMDFKL GNGLFRKLWV AAPSSTQASD GLGPLYNARG
CQNCHLKDGR GHVPEGPEDE AVSMFLRLSV PGGPSPEGIE EWIATSAEPT YGGQLQDFAA
PGLAPEGRMR IDWQELPVTL DDGTVVTLRK PDYSVEDLNY GPMARDVMLS PRVTPQMIGL
GLLEAIPAAD ILAHADPEDR DGDGISGRPS ITWSAEADAP MLGRFGLKAG TPTVLQQSAS
AFAGDMGIAN ALFPEPWGEC TEAQTACRAA VHGIEPGKRD GLEIDRQGLE LTTFYARNLA
VPERRRVDDP QVLRGKQLFH EAGCPACHVP KFVTHRLKDQ PEQSFQLIWP YTDLLLHDMG
EGLADGRPEG RATGREWRTA PLWGIGLTEQ VSGHANFLHD GRARTILEAI LWHGGEAEAA
RARVMALPAP DRAALIAFVE DL