Gene RSP_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4037 
Symbol 
ID3711799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp3642 
End bp4931 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content68% 
IMG OID640069302 
ProductABC sugar transporter, periplasmic lignad binding protein 
Protein accessionYP_345169 
Protein GI77404595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0204263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACAGT TCATCATCGG CGCCGCCGTG GCGCTCGTTG CCGTGCCGGC CGCAGCGCAG 
GAGTTCGACT GGCGCAAGCA CGAGGGCGAG ACGATCAACG TCATGCTGAA CAACCTCGCC
TGGACGCAGC TGATGCGCGA CCGGATCGAG GCCTTCACCG AGGCCACCGG CATCAGGGTG
CGCGCCGAGA CCTTCAGCGA AGAGCAGTAC CGCACCCGCC TGACCACGCT TCTTCAGGGC
GGCTCGAGCG AGCTCGACGT CTTCATGACC CTGCCCTCGC GCGAGGCGCC GCTCTTCGCC
TCGAACGGCT GGTATGCCGA TCTCGCGCCG CTCCTGAAGG GCGAGGCGAC CGATCCGGCC
TACGATTACG ACGATTTCAG CGCGGCCCTG CGCCAGAGCG GCGTGGTGGG CGAGACCATC
ACCAGCGTGC CGATCAACGT CGAGGGCCCG CTCTTCTACT GGCGCCGCGA CATCTTCGAG
AAATGCAACG TCGAGAAGCC CGAATATCTC GAGGATCTGC CCGCCGCGGC CGAGAAGATC
CGCGCCTGCG ACAGCGCGAT CACGCCCTGG GCCGCCCGCG GCCTGCGCGG CACCGTGGGC
TACCCGCTCG GCGCCTTCGT CTACAACATG GGCGGCGACT TCATGGATGC GGACGGCAAG
GCCTCGCTCT GCCTGCCGGG CACGATCAAG GGCCTCGACC TCTACGGCTC GATGCTGCGC
GACTACGGCC CGCCGGGCGC CACCAACCAC ACCTTCACGC AGGTGATGGA CCTGCTGGGT
CAGGGCCGCG TCGCCATGAC CAACGAATCC TCGAACGAAT TCTCGACCCT GATGAAGCAT
GAGGGCCGGG CCGAGGACAT CGGCGTGGAT GTGCTGCCCG GCGGGCGCGA GTCCGGCACC
TCGAAACCCG TGGTCATCAA CTGGAGCCTC GCCGTCTCGG GCCTCTCCGA GAACAAGGAA
GCCGCCTGGT ATTTCGTCCA GTGGGCCACC GGCGCCGAGA ACCAGGAGGC GCTCGCCACG
CAGGGCATCG CCCCCTCGCG CGTCTCGGTC TTCAACGGCG AAGGCTTCCG CAACTGGGCC
AGCGAAAGCC GCCCGCGCGG CGAATGGCTC GAGGCGCTGC TCGAGATCTC GCAGACCGGC
TCCTCGCTCT ACCAGACCCC CTCGCTGACC CGGACGCCCG AGGCGCGCGA GATCCTGTCG
AACGTGGTGC AGCAGATCGT GCTGGGCCAG ACCGACGCCG AAACCGCCGC CTGCGCCGTG
ACCGACGAGG TCCAGGCCCT GCAGAACTGA
 
Protein sequence
MRQFIIGAAV ALVAVPAAAQ EFDWRKHEGE TINVMLNNLA WTQLMRDRIE AFTEATGIRV 
RAETFSEEQY RTRLTTLLQG GSSELDVFMT LPSREAPLFA SNGWYADLAP LLKGEATDPA
YDYDDFSAAL RQSGVVGETI TSVPINVEGP LFYWRRDIFE KCNVEKPEYL EDLPAAAEKI
RACDSAITPW AARGLRGTVG YPLGAFVYNM GGDFMDADGK ASLCLPGTIK GLDLYGSMLR
DYGPPGATNH TFTQVMDLLG QGRVAMTNES SNEFSTLMKH EGRAEDIGVD VLPGGRESGT
SKPVVINWSL AVSGLSENKE AAWYFVQWAT GAENQEALAT QGIAPSRVSV FNGEGFRNWA
SESRPRGEWL EALLEISQTG SSLYQTPSLT RTPEAREILS NVVQQIVLGQ TDAETAACAV
TDEVQALQN