Gene Rsph17029_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4179 
Symbol 
ID4895050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp114519 
End bp115523 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content69% 
IMG OID640110570 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001041882 
Protein GI126464906 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value0.0354274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value0.159883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGT TCTTCCCCCG CCTTCCACGC CTTGCCACAG CCGGCCTCGC GCTGGCCCTG 
GGGCTCGGGG CCCCCGCCTC GGCCGAAACC CTGCTGAACG TGAGCTACGA CCCCACGCGC
GAGCTCTACC GCGACGTGAA CGAGGCCTTC GCCAAGGCTT GGGAGGCCGA GGGACACGAG
GCGCCCGAGA TCGAGACCAG CCACGGCGGC TCGGGCGCGC AGGCGCGGGC GGTGATCGAC
GGGCTCGATG CGCAGGTGGT GACGCTGGCG CTCGCCTCCG ACATCGATGC GGTCGCGGCG
AAATCGGGCA AGATCCCCGC CGACTGGCAG AGCCGGCTGC CGCACAATTC CTCGCCCTAT
ACCTCGACCA TCGTCTTCCT CGTGCGGTCG GGCAATCCCA AGAAGATCGC CGACTGGGGC
GATCTGGTGA AGGAGGATGT GCAGGTCATC ACGCCCAATC CCAAGACCAG CGGCGGGGCG
CGCTGGAACT ACCTCGCGGC CTGGGCCTGG GCCGAGAAGA ACGGGCAGGA TCCGAAGGCC
TTCCTCCACG ACCTCTATGC CCATGTGCCG GTGCTCGACA CCGGCGCGCG GGGGGCCACC
ACGACCTTCG TGCAGCGCGG CCTCGGCGAC GTGCTGCTGG CCTGGGAGAA CGAGGCCTGG
CTCGCGCTGG AGGAACTCGG GCCCGACCGG TTCGAGATCG TCGTGCCCTC GCTCTCGATC
CGGGCCGAGC CGCCGGTGGC GCTGGTCGAG GGCAATCTCG CCTCGGACGG GCAGCGCGAA
CTGGCGACGG CCTATCTCGA TTTCCTCTAC ACGCCCGAGG GGCAGGCGCT GGCCTACAAG
CACTATTACC GCGCCTGGGA CAGGTCGGCC GCCGATCCTG CGGATGTGAA GCGCTTCCCC
GATCTCGAGC TGGTCGACAT CGGCCACTTC GGCGGATGGG GCAAGGCGCA GCCCGATCAT
TTCGGCGACG GCGGCACCTT CGACCAGATC TACGAGGCGA AATAG
 
Protein sequence
MTQFFPRLPR LATAGLALAL GLGAPASAET LLNVSYDPTR ELYRDVNEAF AKAWEAEGHE 
APEIETSHGG SGAQARAVID GLDAQVVTLA LASDIDAVAA KSGKIPADWQ SRLPHNSSPY
TSTIVFLVRS GNPKKIADWG DLVKEDVQVI TPNPKTSGGA RWNYLAAWAW AEKNGQDPKA
FLHDLYAHVP VLDTGARGAT TTFVQRGLGD VLLAWENEAW LALEELGPDR FEIVVPSLSI
RAEPPVALVE GNLASDGQRE LATAYLDFLY TPEGQALAYK HYYRAWDRSA ADPADVKRFP
DLELVDIGHF GGWGKAQPDH FGDGGTFDQI YEAK