Gene Sala_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2784 
Symbol 
ID4080365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2945610 
End bp2947094 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID638011168 
Productsulphate transporter 
Protein accessionYP_617822 
Protein GI103488261 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.88404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.423283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGCC TTTCATCGCT TCGCCGCGAT TGGTTGTCCA ATCCGCGCGG GGATATATTG 
GCGGGGATCG TCGTCGCGCT GGCGCTGATT CCCGAAGCCA TCGGTTTTTC GATCATCGCC
GGGATCGATC CCAGGGTCGG GCTCTACGCC TCCTTCTCGA TCGCCGCCAT CATCGCGCTC
GTCGGCGGGC GGCCCGGCAT GATCTCCGCT GCGACCGCCG CGATCGCCGT GTTGATCGTC
CCGCTCGTCA AGGCGCATGG CGTCGAATAT CTCTTCGCCG CGACGATCCT GATGGGTGTG
CTCCAGTTCA TCGGCGGCCT GCTCCGCCTC GACCTCCTCA TGCAGTTCGT CTCCAAGTCG
GTCGTCACCG GCTTCGTCAA CGCGCTCGCG ATCCTGATCT TCATGGCGCA GCTTCCGCAG
CTAATCGGCG TCGGCTGGAT GACCTATCCG CTCGTCGCCG CGGCGCTGGC GATCATCTAT
CTTTTGCCGC GCCTGACCAG GGCGATCCCG TCGCCGCTGG TCGCGATCGT CGTCCTGACC
GCGGCCGCGA TCTACTGGAA TCTCGACGTC AACCGCGTCG GCGACATGGG CGAGCTTCCC
TCCGCCCTGC CCTTCTTCGC GCTTCCGCAA GTGCCGCTGA CCTTCGAGAC GCTCGCGATC
ATCTTCCCCT ATTCGCTGGC GATGGCGGCG GTCGGCCTGC TCGAAAGCCT GCTCACCGCG
CAGATCGTCG ACGATCTGAC CGACACGCCG AGCGACAAGC CGCGCGAGCT GAAGGGTCAG
GGCATCGCCA ATTTCGTCAC CGGCTTTTTC GGCGGCATGG GCGGCTGTGC GATGATCGGC
CAGTCGGTGA TCAATGTGAA ATCGGGCGGC GACGGACGGC TTTCGTCTTT CGTGGCGGGC
ACCTTCCTGC TCTTTCTGAT CGTCGTGCTC GGCCCGCTCG TCGCGCAAAT CCCGATGCCC
GCGCTCGTCG CCGTGATGAT CATGGTGTCG ATCGGCACCT TCAGCTGGCG ATCGGTCAAG
GAATTGCGCA CCAACCCCTG GCACAGTTCG GTCGTGATGG CCGCCACCGT GGTCGCGGTC
GTCGCGACGC ACGACCTTGC CAAGGGCGTG CTGGTCGGCG TGCTGTTGTC GGGCATCTTT
TTCGCAAGCA AGGTGCGCGC CCTCTTTGCG GTCGATACCA GCCTGTCGAC CGACGGCTCG
ACGCGCACCT ATCGTTTCAC CGGCCAGATT TTCTTCGCCT CGGTCGAGCG CTTCCTGGCC
GCGTTCGATT TCCGCGAAGT GATCGAAAAG GTCGTGATCG ACGTTCGCGA CGCGCATTTC
TGGGATATTT CGGCGGTCGC CGCGCTCGAC AAGGCCGTGA TCAAGCTGCG CCGCGAAGGC
ACCACGGTCG AAGTGCTGGG GCTCAACGAA GCCAGCGCGA CGATGATCGA CCGCTTCGGC
ATTTCGGACA AGCCCGACGC CGAAGCGCGC CTCGCCGCGC ATTGA
 
Protein sequence
MPSLSSLRRD WLSNPRGDIL AGIVVALALI PEAIGFSIIA GIDPRVGLYA SFSIAAIIAL 
VGGRPGMISA ATAAIAVLIV PLVKAHGVEY LFAATILMGV LQFIGGLLRL DLLMQFVSKS
VVTGFVNALA ILIFMAQLPQ LIGVGWMTYP LVAAALAIIY LLPRLTRAIP SPLVAIVVLT
AAAIYWNLDV NRVGDMGELP SALPFFALPQ VPLTFETLAI IFPYSLAMAA VGLLESLLTA
QIVDDLTDTP SDKPRELKGQ GIANFVTGFF GGMGGCAMIG QSVINVKSGG DGRLSSFVAG
TFLLFLIVVL GPLVAQIPMP ALVAVMIMVS IGTFSWRSVK ELRTNPWHSS VVMAATVVAV
VATHDLAKGV LVGVLLSGIF FASKVRALFA VDTSLSTDGS TRTYRFTGQI FFASVERFLA
AFDFREVIEK VVIDVRDAHF WDISAVAALD KAVIKLRREG TTVEVLGLNE ASATMIDRFG
ISDKPDAEAR LAAH