Gene Rsph17029_3409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3409 
Symbol 
ID4898961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp476359 
End bp477573 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content65% 
IMG OID640114006 
Productextracellular solute-binding protein 
Protein accessionYP_001045274 
Protein GI126464161 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.280364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.614723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CCATCAGCGC GCTGGCCCTT CTGGCCGGCC TCGCGCCGGG GATCGCCTCG 
GCCGACAGCA CCGTCCGCTT CTGGTATCAC TTCGACAATC CGGAAAACCC GATGTCGGAT
CTGGTGGCCA AGTTCGAGGA AGCGAACCCG GGCATCAGGA TCGACGCGCA GAACGTGCCC
TGGAACAGCT ATTACGACAA TCTCTACACG GCCATCGTTG GCGGCAACGC GCCCGACGCG
GCCATGCTGA AGATGTTCGC CCTGCCGCGC CTTCTCGAGA TGGAGGCGCT CGAGCCGCTC
GACGAGATGA TCGCGGGCTG GGAGGGCCGC GACGACATCC TCGACAATCT CTTCGACCTG
ACCGAGGCCG AGGATGGCAA GCACTATTAC CTGCCGGTGC AATATGTGGC GCTCTACCTC
TATTACCGCG CCGACATGTT CGAGGAGCTG GGCCTCGCCC CGCCCGAGAC CTGCGACCAG
TTCCGCGAAG CCGCGATCAA GCTCACCCGC GACACGAACA ACGACGGCAA GATCGACACC
TACGGGTTCG GCTTCCGCGG CGGCAAGTCC GGGCACGAAC ATTGGGGCGC CTTCACCCTC
GGCCGCGAGG GCGTGGCGCT CGATGACAGC CTCACCTCCG AGGCCGGCGT GGCGGGCACG
CAGTTCGTGG TGGATCTCTT CCAGAAGGAC AAGGTCTTCC CGCCCTCGGC CCCGAACGAC
GGCTTTCAGG AGATCATCGG CGCCTTCAAG ACCGGCGTGA CCGCGATGAC GATCCATCAT
GTCGGCTCCT CGAACGATCT GGTGGCGGCG CTGGGCGACA AGGTCGCGGC CGTGCCGGTG
CCGGAATGCG GCGGCGGGCG CTGGACCACC TTCGGCGACG AATCCACCGG CGTCTTCAGC
AATGCCAGCG ACAAGGAAGC CGCCTGGAAG TGGATCGCGT TCCTTTCGTC GGAGGGCAAC
AACGCGCTCT TCAACAGCGC CACCGGCCAG CTTCCGGTGA CCAAGAGCGA CAGCGCCACC
TGGGACCAGC ACGAGAAGCG CTTCGTCGAC GCGACCCAGG CCTCGCTCCC CTTCGCCCAT
CTGCTGCCCG CCTCCTCGGC CACGCCCGAG TTCGTGAACA CCGTCTGGCC CACGAACATG
CAGCGCGCGC TGAACGGGGA GATCACCGCC GCCCAGATGA ACGAAGCCAT CGCCAAGCTC
TTCGCCGAAG AGTGA
 
Protein sequence
MKRTISALAL LAGLAPGIAS ADSTVRFWYH FDNPENPMSD LVAKFEEANP GIRIDAQNVP 
WNSYYDNLYT AIVGGNAPDA AMLKMFALPR LLEMEALEPL DEMIAGWEGR DDILDNLFDL
TEAEDGKHYY LPVQYVALYL YYRADMFEEL GLAPPETCDQ FREAAIKLTR DTNNDGKIDT
YGFGFRGGKS GHEHWGAFTL GREGVALDDS LTSEAGVAGT QFVVDLFQKD KVFPPSAPND
GFQEIIGAFK TGVTAMTIHH VGSSNDLVAA LGDKVAAVPV PECGGGRWTT FGDESTGVFS
NASDKEAAWK WIAFLSSEGN NALFNSATGQ LPVTKSDSAT WDQHEKRFVD ATQASLPFAH
LLPASSATPE FVNTVWPTNM QRALNGEITA AQMNEAIAKL FAEE