Gene Rsph17025_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0206 
Symbol 
ID5082145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp197387 
End bp199453 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content71% 
IMG OID640481761 
Producthypothetical protein 
Protein accessionYP_001166421 
Protein GI146276262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.243086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAA GCGTCGTCTT CGACCCGATC CTGCCCTGGG CCGTCATCTG GACCCTGGCC 
GCCCTCGGAG CCGTCATGGT GGTGCTCGCA CTCTGGCGCG GTCTCTCGGG CTGGTGGCTG
CGCGGGCTGG CGCTCGGCGT GCTGCTCTTG GCGCTGGCCA ATCCTGCGCT GCAGGAAGAG
GATCGCGCGC CGCTCTCCGA CATCGTGATC GCGGTGGTGG ACGAAAGCGC GAGCCAGCGG
ATCGGTGACC GCCAGGCCCA GAGCGCGGCG GCGCTGGCGG CGGTCGAGGC CGAGATCCGG
GCCCTGCCCG ATACCGAACT GCGCGTGGTG CGGGTGGGCG ATGGCGAAGG CGACGAGGGC
TCTCTCGTGA TGACCGCCTT GGCCGAGGCG CTGGCCGAAG AACCGCGGGC GCGCATCGCG
GGGGCCATCC TGATCACCGA CGGGCAGGTC CATGACCTCG AGCTTGCGCC GCAGATGCCC
GCGCCGCTCC ATGTGCTGCT GACCGGCCAC GAGGAGGACT GGGACCGCCG GCTGGTGATC
CGCAACGCGC CGGCCTTTGC GATCCTGGGC GAGCCGGTCT CGCTCGTGCT GCGGATCGAG
GATCAGGGGC GGGTGCCCGC CTCGGCCGGA ACCTCGGCCG ACCTCACCAT CTCGATCGAC
GGAGGCGAGC CGCAGACGGT GCGCGTGCCG GTGGGCGAGG ATCTGGAACT GCCCGTGACG
CTGCCGCACG GGGGCATGAA CGTCCTCCAG TTCCAGGTGG CGGCCTCGCC GGACGAGTTG
ACAGACCGCA ACAATTCCGC CGTGGTGCAG ATCAACGGCG TGCGCGACCG GCTGCGGGTG
CTGCTGGTCT CGGGCGAGCC CCATGCGGGC GAGCGCGTCT GGCGCAACCT CCTGAAGTCG
GACGCTTCGG TGGATCTGGT GCATTTCACC ATCCTGCGCC CGCCCGAGAA GCAGGACGGC
ATCCCGGTCT CGGAGCTGTC GCTGATCGCT TTCCCGACCC GCGAACTGTT CGTCGAGAAG
ATCGAGGAGT TCGACCTCAT CATCTTCGAC CGCTACCGGC TGCGGGGGAT TCTGCCGACC
TCCTACCTCG AGAATGTGCG GGACTATGTC CGCAACGGCG GCACGGTTCT GGTGGCCGCG
GGGCCCGAGT TCGGCTCGGC CGACAGCCTC TGGCGTTCGC CGCTGGCGGA CGTGATGCCG
GTGCAGGCCA CCAGCCGCGT GACCGAGGGC GGCTTCCGCC CGACCCTGAC CGACGTGGGC
CGCAAGCATC CGGTGACTCA GGGGCTCGAG GCGCAGGCCC CAGAGGGCGG CTGGGGCCGC
TGGTTCCGCC AGATCGAACT GTCGGCCACC TCGGGTCAGG TGGTGATGAA CGGGGCCGGC
GACCGGCCGC TCCTCGTGCT CGACCGTGTG GACGAGGGTC GGATCGCCGT GCTCGCCTCG
GATCAGATCT GGCTCTGGGG GCGGGGCTAC GAGGGGGGCG GGCCGCAGCT CGAACTGCTG
CGGCGGCTGG CGCACTGGAT GATGAAGGAG CCCGACCTCG AGGAAGAGGC GCTGATCGCC
GCGGGCGAGG GGGCGCGGAT GACCATCACG CGCCGCACGA TCGGCGAGGA TCCGGGCGAG
GTGACGATCA CCGGCCCCGA TGGCGCGGAG ACGACGCTTT CCATGCAGGA GACGGCGCCC
GGCCGCTGGA GCGTCGTATG GGAGGCGCCC GAGATGGGGG TCTACCGGCT GGCCCAGGGC
GAGCAGAGGG CGGTGATCGC CGTCGGGCCC TCGGCCCCGC GCGAGTTCGA GGAAACGATT
GCCAGCGGCG ACAGGCTCGC GCCGGTGATC GGGCCGACGA ATGGGGGCAC GCTCCGGCTT
GAGGAGGGGG CGCCGGACAT CCGTGCGGTC CGCGAAGGAC GGGTGGCGGC GGGGCGGGGC
TGGATCGGGA TCACCCCGCG CGGCGCCCAT GTCACGCAGG ATGTGCGGGT GGCGGCGCTG
CTGCCCGGCT GGCTCTACCT GCTGCTGGCC GCGAGTCTGG CCCTCGGTGC CTGGCTGCGC
GAGGGCCGCT TTGGCCGCAG GGCCTGA
 
Protein sequence
MTGSVVFDPI LPWAVIWTLA ALGAVMVVLA LWRGLSGWWL RGLALGVLLL ALANPALQEE 
DRAPLSDIVI AVVDESASQR IGDRQAQSAA ALAAVEAEIR ALPDTELRVV RVGDGEGDEG
SLVMTALAEA LAEEPRARIA GAILITDGQV HDLELAPQMP APLHVLLTGH EEDWDRRLVI
RNAPAFAILG EPVSLVLRIE DQGRVPASAG TSADLTISID GGEPQTVRVP VGEDLELPVT
LPHGGMNVLQ FQVAASPDEL TDRNNSAVVQ INGVRDRLRV LLVSGEPHAG ERVWRNLLKS
DASVDLVHFT ILRPPEKQDG IPVSELSLIA FPTRELFVEK IEEFDLIIFD RYRLRGILPT
SYLENVRDYV RNGGTVLVAA GPEFGSADSL WRSPLADVMP VQATSRVTEG GFRPTLTDVG
RKHPVTQGLE AQAPEGGWGR WFRQIELSAT SGQVVMNGAG DRPLLVLDRV DEGRIAVLAS
DQIWLWGRGY EGGGPQLELL RRLAHWMMKE PDLEEEALIA AGEGARMTIT RRTIGEDPGE
VTITGPDGAE TTLSMQETAP GRWSVVWEAP EMGVYRLAQG EQRAVIAVGP SAPREFEETI
ASGDRLAPVI GPTNGGTLRL EEGAPDIRAV REGRVAAGRG WIGITPRGAH VTQDVRVAAL
LPGWLYLLLA ASLALGAWLR EGRFGRRA