Gene Sala_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2538 
Symbol 
ID4081512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2678373 
End bp2679464 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID638010915 
Productbile acid:sodium symporter 
Protein accessionYP_617577 
Protein GI103488016 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.344052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GAATCGCTGC GGCTGCCCCT GCCGAAACGC AGAACCGACC CGGTATCGGC 
TTCTTCGAGC GTTATCTGAC GCTTTGGGTC GCACTGTGCA TCGTCGCCGG CATTGCGCTG
GGATCATGGC TGCCGGCGCT GTTCGCGACG ATCGCTTCGG CCGAGATCGC CCGCGTCAAT
CTCGTCGTCG CGGTGCTGAT CTGGCTGATG ATCGTGCCGA TGCTCTTGAA GATCGATTTC
GGCGCACTCG GCTCGGTCAG GCAGCACTGG AAGGGCGTCG GCGTCACGCT GTTCATCAAC
TGGGCGGTCA AGCCCTTCTC GATGGCGCTG CTCGGCACGC TGTTTATCGG TTGGTTGTTC
GCGCCGCTGC TGCCGCAGGG CGAGATTTCC TCCTACATCG CCGGGTTGAT CCTGCTCGCG
GCCGCGCCCT GCACGGCGAT GGTGTTCGTC TGGTCGAACC TTTGCGAGGG CGAGCCCAAC
TACACGCTCA GCCAGGTTGC CTTGAACGAC CTCATTATGG TGTTCGCCTT TGCGCCGATC
GTCGGCCTCT TGCTCGGGGT CGCTTCGATC ACCGTACCGT GGGAAACGCT GCTGCTCTCC
GTCGCGCTCT ATATCGTAGT GCCGGTGATG GTCGCGCAGG TCATCCGGCG GGCGGTTCTC
GCCCGCGGCG GTGCGGACGC GCTGCAGACA CTGCTCGACC GTCTCGGTCC GGTCTCGCTG
CTGGCGCTGC TCACCACGCT GGTGCTGCTG TTCGGCTTTC AGGGCGAGCA GATCCTTGCC
CGGCCACTCG TCATCGCTCT GCTCGCGGTG CCGATCCTGG TCCAGGTCTA TTTCAATGCA
GGGCTTGCCT ACTGGCTGAG CAAACGATTC GGCGTCGCAT GGTGCGTGGC CGCGCCGGCT
GCGCTGATCG GCGCCTCGAA CTTTTTCGAG TTGGCTGTCG CCGCCGCCAT CAGCCTGTTT
GGCCTCAACT CGGGGGCGGC GCTCGCGACT GTGGTCGGCG TGCTGGTGGA GGTGCCGGTG
ATGCTCTCGG TCGTGGCGAT CGTGAAGCGC ACCCGGAGTT GGTACGAGAA CCGCCCAGCG
AGCCTCGCCT AA
 
Protein sequence
MSDGIAAAAP AETQNRPGIG FFERYLTLWV ALCIVAGIAL GSWLPALFAT IASAEIARVN 
LVVAVLIWLM IVPMLLKIDF GALGSVRQHW KGVGVTLFIN WAVKPFSMAL LGTLFIGWLF
APLLPQGEIS SYIAGLILLA AAPCTAMVFV WSNLCEGEPN YTLSQVALND LIMVFAFAPI
VGLLLGVASI TVPWETLLLS VALYIVVPVM VAQVIRRAVL ARGGADALQT LLDRLGPVSL
LALLTTLVLL FGFQGEQILA RPLVIALLAV PILVQVYFNA GLAYWLSKRF GVAWCVAAPA
ALIGASNFFE LAVAAAISLF GLNSGAALAT VVGVLVEVPV MLSVVAIVKR TRSWYENRPA
SLA