Gene Rsph17029_3160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3160 
Symbol 
ID4898924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp184269 
End bp185306 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID640113762 
Productextracellular solute-binding protein 
Protein accessionYP_001045032 
Protein GI126463919 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00659585 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATGGATC GGGTTTGCAG GCTGTCGCTC GCGGCGCTGC TCCTGGGCTC GGGCGCCGCC 
TATGCGGACG GCTCGACGCT GACGGTCGCC TATTACGGCG GCAACTGGGG CGAGGCCTTC
GACGCCTGCG TGGCGCAGCC CTTCACCGAG AAGACGGGCA TCCGGGTGGT GGCCGAGATC
GGCAACTCGA CCACCACGCT TGCCAAGCTG CAGCAGCAGT CGGGCGATGC GGTGATCGAC
GTGGCCTATA TGGACGGCGG GATCAGCGAG CTCGCCCAAG AGGCGGGCGT GCTGGCCCCC
ATCGATCTCG CCGCGATCCC GAACGCGGCC AGCTACCTGC CGCAGGCCGT CTACGAGGCG
GGCGACGAGG TCTTCGCGGT CAGCGCGGGC TATTACTCGC TCGGCCTCAT CTACAACACG
TCGGAAGTGA CCGAGACGCC GGACAGCTGG CTTTCGCTCT GGGACGAGAG GTATGCGGGC
GCCGTGGCCC TGCCCTCGCC CAGCAACTCC TCGGGCGTGC CCTTCGTCCT GTTCCTCGCC
CGCTCGGTGC TGGGCGATAC CTCGGAATCC CTCGACCCGA CCTTCGCCAA GCTGAAGGAG
CTGGACACGG GGCTTCTCTT CGACAGTTCG GGCGCGGCCT CGAACGCCTT CCAGAGCAGC
GAGGTCATCA TCGGCGCCCA TTTCAACGTC GGCGCCTGGG ACCTGACCGA CGGCGGCCTG
CCCATCGGCT TCAGCGTGCC GAAGGAGGGC GTCTGGGCCA CCGATGCGCG GATGCATGTG
GTGAAGGGCA CGAAGAACCC CGAGGGCGCG GCGCAGTATC TCGACATGGC GGCCTCGCCC
GAGGCGGCGG CCTGCCTGGC CGAACGGCTC TATCTCGGCC CGCCCGTCAC CGGCGTCACG
CTCGCGCCCG ATGTCGAGCG CAAGCTGCCC TGGGGCGAGG GCGGATCGGT CGAGAAGCTG
CATCTGTCGG ACTGGACCGA GGTGAATGCG CGCCGCGCCG CCATCGTCGA ACGCTGGAAC
CGCGAGATCG CGAACTGA
 
Protein sequence
MMDRVCRLSL AALLLGSGAA YADGSTLTVA YYGGNWGEAF DACVAQPFTE KTGIRVVAEI 
GNSTTTLAKL QQQSGDAVID VAYMDGGISE LAQEAGVLAP IDLAAIPNAA SYLPQAVYEA
GDEVFAVSAG YYSLGLIYNT SEVTETPDSW LSLWDERYAG AVALPSPSNS SGVPFVLFLA
RSVLGDTSES LDPTFAKLKE LDTGLLFDSS GAASNAFQSS EVIIGAHFNV GAWDLTDGGL
PIGFSVPKEG VWATDARMHV VKGTKNPEGA AQYLDMAASP EAAACLAERL YLGPPVTGVT
LAPDVERKLP WGEGGSVEKL HLSDWTEVNA RRAAIVERWN REIAN