Gene Rsph17029_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4143 
Symbol 
ID4894956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp79619 
End bp81109 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID640110535 
Productsulphate transporter 
Protein accessionYP_001041847 
Protein GI126464871 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones81 
Plasmid unclonability p-value0.508442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAT TCGCCACCTA TCGCCAGCAA TGGCTGGGCC ATGTGCGGGG CGACCTGCTC 
TCGGGGCTCG TCGTGGCGCT GGCCCTCATT CCCGAGGCCA TCGCCTTCTC GATCATCGCG
GGCGTCGATC CGAAGGTCGG GCTCTATGCC TCCTTCTCGA TCGCCGTCGT CACCGCCATC
GCGGGCGGAC GGCCCGGGAT GATCTCGGCC GCCACCGCAG CGACCGCCGT GCTGATGGTG
ACCCTCGTGC GCGACCACGG GCTCCAGTAT CTGCTGGCCG CCACCGTGCT CGCGGGGCTG
ATCCAGATCG CGCTCGGGCT CCTGAAGCTC GGCTTCGTCA TGCGCTACGT CTCGCGCTCG
GTGATGACGG GCTTCGTCAA TGCGCTGGCG ATCCTGATCT TTCTCGCGCA ATTGCCCGAG
CTCGACCCGC GGACCGTGCC GCCGCTGACC TATCTCCTCG TGGCGGCGGG CCTGGCCATC
ATCTATCTCT TCCCGCGCCT CACCCGCGCC GTGCCCTCGC CGCTCGTCAC CATCGTCGTG
CTGACGGCGC TGACGCTCGG CCTCGGGCTC GACGTGCGGA CGGTGGGCGA CATGGGCGTG
CTGCCCGACA CGCTGCCCGT CTTCCTGATC CCGGACATTC CCCTGACCTT CGAGACACTG
CGGATCATCC TGCCCCCGGC CACAGCCGTG GCGGTGGTGG GGCTTCTGGA AAGCCTGATG
ACGCAGACCC TCGTCGACGA GCTGACCGAC ACCCGCTCGA GCCGCAATCA GGAATGTATC
GGGCAGGGGC TGGCCAACGC CGCCACCGGC TTCATCGGCG GCATGGCGGG CTGCGCCATG
ATCGGCCAGT CGATGATCAA CGTGAAGTCG GGCGGGCGCG GGCGGCTGTC CTGCTTCGTG
GCGGGCGTGT TCCTGCTGAT CCTCGTCGTG GGGCTCGGCG ATGTCGTCAG CCGGATCCCG
ATGGCCGCGC TCGTGGCCAT CATGATCATG GTCTCGATCG GCACCTTCTC CTGGTCGTCC
CTCAAGGCGC TGCGCACCCA TCCCCGGTCC TCCTCCGTGG TGATGCTGGC GACGGTGGCG
ACCGTGGTCT GGACCCACAA TCTGGCCCTG GGCGTCCTCG TGGGCGTGCT GCTCTCGGGG
ATCTTCTTCG CCGCCAAGAT TGCGCAGCTC TTCGCGGTCA GCTCCGAACT CTCGGCCTGC
GGGCGCGCGC GGACCTACCG GGTCGAGGGC CAGCTCTTCT ACGGCTCGGT CGAGGATTTC
ATGGCCGCCT TCGACTTCCG CGAACCGCTC GAGCGCGTCA CCATCGACGT GAGCCGCGCC
CATATCTGGG ACATCTCCTC GGTGCAGGCG CTCGACATGG CGGTGCTGAA GTTCCGCCGC
GAGGGGGCCG AGGTGCGGAT CGTGGGCATG AACGAGGCCT CCGAGACTCT CGTCGACCGG
CTGGCCCTGC ACGACAGGCC GGGAGCCATG GACCGGCTCA CGGCCCATTG A
 
Protein sequence
MISFATYRQQ WLGHVRGDLL SGLVVALALI PEAIAFSIIA GVDPKVGLYA SFSIAVVTAI 
AGGRPGMISA ATAATAVLMV TLVRDHGLQY LLAATVLAGL IQIALGLLKL GFVMRYVSRS
VMTGFVNALA ILIFLAQLPE LDPRTVPPLT YLLVAAGLAI IYLFPRLTRA VPSPLVTIVV
LTALTLGLGL DVRTVGDMGV LPDTLPVFLI PDIPLTFETL RIILPPATAV AVVGLLESLM
TQTLVDELTD TRSSRNQECI GQGLANAATG FIGGMAGCAM IGQSMINVKS GGRGRLSCFV
AGVFLLILVV GLGDVVSRIP MAALVAIMIM VSIGTFSWSS LKALRTHPRS SSVVMLATVA
TVVWTHNLAL GVLVGVLLSG IFFAAKIAQL FAVSSELSAC GRARTYRVEG QLFYGSVEDF
MAAFDFREPL ERVTIDVSRA HIWDISSVQA LDMAVLKFRR EGAEVRIVGM NEASETLVDR
LALHDRPGAM DRLTAH