Gene RSP_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3701 
Symbol 
ID3722191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp822302 
End bp823351 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID640073375 
ProductABC sugar transporter, periplasmic binding protein 
Protein accessionYP_355212 
Protein GI77465709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCA GAACCTTCGC CCTGGCTGCG GCCTCGGGCC TCGTGGCCGC CCTTTTCGGC 
GCCGCCGCTT CCGCGCAGGA GGCCGCCACC GTGGCCTTCC TGATGCCCGA CCAGGCATCC
ACCCGCTACG AGGAGCACGA CTTCCCCGGC TTCCAGAAGT CGATGGGCGA GCTCTGCGCC
GACTGCACGG TGATCTACCA GAACGCCAAC GGCGACGTGG CGCTCCAGCA GCAGCAGTTC
AACTCGGTGA TCGCGCAGGG CGCCAAGATC GTCGTGCTCG ATCCGGTCGA TTCGGCCGCC
GCCGCCTCGA TGGTCGAGAT CGCCCATTCG CAGGATGTGA AGGTGATCGC CTATGACCGG
CCGATCCCGG CCACGCCCGC GGATTACTAC GTCTCCTTCG ATAACAAGGG CATCGGCCAG
GCCATCGCCC AGTCGCTCGT CGATCATCTG AAGGCCACCG GCGTGCCGGA CGGCGCGGGC
GTCCTGCAGA TCAACGGCTC GCCCACCGAT GCGGCCGCGG GCCTCATTCG CGACGGGATC
GACGCGGCGC TCGACGCATC GAGCTACAAG ACGCTGGCCG AGTTCGACAC GCCGGACTGG
GCCCCGCCGA AGGCGCAGGA ATGGGCCGCG GGCCAGATCA CCCGCTTCGG CGACGAGATC
AAGGGCGTGG TCGCGGCCAA TGACGGCACC GCCGGCGGCG CCATCGCGGC CTTCAAGGCG
GCGGGCGTGG ATCCGGTTCC GCCGGTCACC GGCAACGACG CCACCATCGC GGCGCTGCAG
CTCATCATCT CGGGCGACCA GTACAACACC ATCTCGAAAC CCTCCGAGAT CGTGGCCGAG
GCCGCGGCGA AGGTGGTCGT GACCTTCCTC AAGGGCGAGA CCCCCGAGGC CAAGACCACG
CTCTACGACA CGCCGGCCGA GCTCTTCGTG CCTGCGGTGG TGACGGCCGA GAACATCAAG
GCCGAGATCT TCGACAAGGG CATCCAGACC GCGGCGGAAG TCTGCACCGG CGAATATGCC
GAAGGCTGCG CCAAGCTCGG CATCCAGTGA
 
Protein sequence
MTSRTFALAA ASGLVAALFG AAASAQEAAT VAFLMPDQAS TRYEEHDFPG FQKSMGELCA 
DCTVIYQNAN GDVALQQQQF NSVIAQGAKI VVLDPVDSAA AASMVEIAHS QDVKVIAYDR
PIPATPADYY VSFDNKGIGQ AIAQSLVDHL KATGVPDGAG VLQINGSPTD AAAGLIRDGI
DAALDASSYK TLAEFDTPDW APPKAQEWAA GQITRFGDEI KGVVAANDGT AGGAIAAFKA
AGVDPVPPVT GNDATIAALQ LIISGDQYNT ISKPSEIVAE AAAKVVVTFL KGETPEAKTT
LYDTPAELFV PAVVTAENIK AEIFDKGIQT AAEVCTGEYA EGCAKLGIQ