Gene Rsph17029_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4090 
Symbol 
ID4894990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp31952 
End bp33253 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content74% 
IMG OID640110492 
Producthypothetical protein 
Protein accessionYP_001041804 
Protein GI126464828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones107 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value0.150292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG CCTCGCCGAT CCTGTCGCTT CCCTACATCC TGCCCTCTCA GGCGCAGAAA 
CATGTGACCC ACAACGAGGC GCTGCAGCGG CTCGATGTGC TGGTCCAGCC CGCCGTGCTC
GACCGCGACC GCTCCGCGCC GCCCGCCGCC CCGGCCGCGG GGGCGCGGCA TCTGGTGGGC
CCGGGCGCCG AAGGGGCCTG GGCAGGGCGG GAGGAGGCCT TTGCGGTCTG GGACGCGGAG
GCGGCGGTCT GGCGTTTCCT CGCCCCGCAG CCGGGCTGGC AGACCTTCGT GCTGGCCGAG
GGGGCGGGGC TCGTCTTCAC TGCCCAGGGC TGGCGCACGC TGATCGGCCT TCTGCCGGAA
TTTCCCTCGC TGGGCATCGC CACCCCGGCC GATGCCACCA ACCGCCTCGC GGTGGCGGGC
CCCGCCACGC TCTTCACCCA TGCGGGCGCG GGCCACCGGA TCAAGGTCAA CAAGGCCGCG
GAGGCCGAGA CGGCGAGCCT CCTGTTCCAG TCCGACTGGT CGGGCCGGGC CGAGATCGGG
CTTGCGGGCA GCGACGACTT CGCGCTGAAG GTCAGCCCGG ACGGCACTTC CTTCCGCACC
GCGCTCAGCG CCGACCGGGC GAGCGGGCGG GTGGCGCTGC CGCAGGGGGC GGTGGTGACG
GGCAGCCTCA CCGGAAGCGC GGTGCAGGCC TCGGCCGCCG ATGCGACCCC GGGCCGGCTC
CTGACGGTGG GGGCCTTCGG GCTGGGGGCG CCGGCGCCGC TCGTCGGCAA TGCCGGGGCG
GTGGACGGCG CGCTCGCCCC GGGCTTTTAC GGCTACGACA GCGCGCAGGG CAGCAGCGGT
GGCCCTGCGG GCGTGCAGGC GGGCCTTCTC CTTCATCAGA GCCGGGGGGC GGGCGAGGTG
CAGCTCTTTC TCGTGGAGGC GGGGGGCGGG GGCCTCATGC CGGGCATCCT CTTCTCGCGC
GCCCGCGGCG AGGGCGCCTG GTCGCCCTGG GTCGCGGGCG GGATCGTCGA GAGCGCGGGC
AACGCCAACG GCCGCTACAT CCGCCATCAG GACGGGACGC AGAGCTGCTG GCAGAAGGTG
ACCACCTCGG CCTCCGCCGA TGTGGTGGCC CCCTTTCCCG CCGCCTTCTC CACCGCCACG
GGCCTCGTCA CGGTCTCGAG CGTGGTCTCG AACGGAGCCC AGGCGCTCAG CCCGCGGCTG
ACCGGGCGGA CGACGACCAG CGTCGGCGTC TCGGTCTTCA GCGCCACGAA CACGCGCCTT
GCCGCGCAGG TCGAGCTGAT CTCGATGGGC CGCTGGTATT GA
 
Protein sequence
MSDASPILSL PYILPSQAQK HVTHNEALQR LDVLVQPAVL DRDRSAPPAA PAAGARHLVG 
PGAEGAWAGR EEAFAVWDAE AAVWRFLAPQ PGWQTFVLAE GAGLVFTAQG WRTLIGLLPE
FPSLGIATPA DATNRLAVAG PATLFTHAGA GHRIKVNKAA EAETASLLFQ SDWSGRAEIG
LAGSDDFALK VSPDGTSFRT ALSADRASGR VALPQGAVVT GSLTGSAVQA SAADATPGRL
LTVGAFGLGA PAPLVGNAGA VDGALAPGFY GYDSAQGSSG GPAGVQAGLL LHQSRGAGEV
QLFLVEAGGG GLMPGILFSR ARGEGAWSPW VAGGIVESAG NANGRYIRHQ DGTQSCWQKV
TTSASADVVA PFPAAFSTAT GLVTVSSVVS NGAQALSPRL TGRTTTSVGV SVFSATNTRL
AAQVELISMG RWY