Gene Rsph17025_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0794 
Symbol 
ID5083485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp808418 
End bp809425 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID640482352 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001167005 
Protein GI146276846 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGA CCTTTCGCAG CCTCGGACGC GGGCTTGCCA TCGCGGGCGT GGCGCTGGCC 
GCGGCGCTTG GCGCGCCAGC CGGGGCCGAG ACACTGCTGA ACGTGAGCTA CGACCCGACG
CGCGAGCTTT ACCGCGACGT GAACGAGGCT TTCGCGAAAC ACTGGCAATC GCAGGGCAAC
CCGGCCCCGA CGCTCGAGTC CAGCCATGGC GGCTCGGGCG CGCAGGCCCG CGCGGTGATC
GACGGGCTGA ACGCGCAGGT GGTGACGCTG GCGCTCGCCT CGGACATCGA CGCGATTGCG
GCGAAGTCGG GCAGGATCCC GGCCGACTGG CAGGCGAAGC GCCCGCACAA CTCCTCGCCC
TATACCTCGA CCATCGTTTT CCTGGTGCGC GAGGGCAATC CGAAGGGCAT CGGGGACTGG
GGCGACCTGG TGAAGGAGGG CGTGCAGGTC ATCACGCCCA ACCCCAAGAC CAGCGGAGGC
GCGCGCTGGA ACTATCTGGC CGCCTGGGCC TGGGCCGAGA AGAACGGCCA GGATCCGAAG
GCCTTCCTGC ACGACCTTTT TGCCAATGTG CCTGTGCTCG ACACCGGCGC GCGCGGCGCC
ACCACGACCT TCGCGCAGCG GGGTCTTGGC GATGTGCTGC TGGCCTGGGA GAACGAGGCC
TGGCTCGCGC TTGAGGAACT GGGCGAGGAC CGCTTCGACA TCGTCGTCCC CTCGGTCTCG
GTGCTGGCCG AGCCGCCGGT GACGGTGGTC GAGGGCAACA TCGCCTCGGA GGCGCAGCGC
GAGCTGGCGA ATGCCTATCT CGACTTCCTC TACACGCCCG AGGGCCAGGC GCTCGCCTTC
AAGCACTACT ACCGCGCCTG GGATGCGTCA AAAGCCGATC CGGCCGACGT GAAGCGCTTC
CCCGAGCTGG AACTGGTCGA CATCGGCCAC TTCGGCGGCT GGGCCAAGGC GCAGCCCGAA
CATTTCGGCG ACGGCGGCAT CTTCGACCAG ATCTACGAGG CGAAGTGA
 
Protein sequence
MTQTFRSLGR GLAIAGVALA AALGAPAGAE TLLNVSYDPT RELYRDVNEA FAKHWQSQGN 
PAPTLESSHG GSGAQARAVI DGLNAQVVTL ALASDIDAIA AKSGRIPADW QAKRPHNSSP
YTSTIVFLVR EGNPKGIGDW GDLVKEGVQV ITPNPKTSGG ARWNYLAAWA WAEKNGQDPK
AFLHDLFANV PVLDTGARGA TTTFAQRGLG DVLLAWENEA WLALEELGED RFDIVVPSVS
VLAEPPVTVV EGNIASEAQR ELANAYLDFL YTPEGQALAF KHYYRAWDAS KADPADVKRF
PELELVDIGH FGGWAKAQPE HFGDGGIFDQ IYEAK