Gene Sala_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2083 
Symbol 
ID4080057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2188781 
End bp2190370 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content67% 
IMG OID638010458 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_617125 
Protein GI103487564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.438087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.890804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCC GCTCCCTCCC CCGCCGTCCT GCTGCCGCGG CGCTCCCCGC TGACGATATG 
ATCCCGCTCC ACGAACGCGT GCGCTACCGC GGCCTGCTCA CCGTCGCCGT CATGGGCGCG
TCGATCATGC AGATCCTCGA CACGACGATC GCCAATGTCG CCATCCCGCA CATGCAATCG
GCCTTGGGCG CGACGAGCGA AACGGTGACG TGGGTGCTCA CCAGCTATAT CCTCGCAAGC
GCGATCGCGA TGCCGATCAC CGGCTGGCTC GCCGACCGCA TCGGGCGGCG CGAGCTGTTC
CTTGCCGCGA TCGCGGGGTT CATCGTCGCG TCGATGGCGT GCGGCGCGGC GCAATCGCTC
GAACAGATGG TCGCCTTCCG CTTTTTGCAG GGTATTTTCG CGGCCTTCAT CGGCCCGCTG
TCGCAGTCGG TGATGCTCGA CATCAACCCG CCCGAGCGCC ATGCGCGCGC GATGTCGATC
TGGGGCATGG GGATCATGAT CGGACCCATT CTCGGCCCCG TGCTCGGCGG CTGGCTGACC
GAAAGCGCGA ACTGGCGCTG GGTCTTTTAC GTCAACCTGC CCGTCGGGCT GGTCACGCTC
GCGCTGATGT GGGCGCTGCT GCCCGCGATG CGCCGGACAA GCCGCAAGTT CGATCTCTTC
GGCTTTTCGA TGCTCGCGCT GGGGCTCGCC GCGCTGCAAC TGATGCTCGA CCGCGGCGCG
CATCTCGACT GGTTCGACAG CATCGAAATC TGGATCGAAC TCGGCGTCGC GACGGCCTGC
CTCTGGATGT TCTTCGTCCA TCTCTTCACC GCGCGCGCGC CGCTGTTCAG CCGCGCGATG
CTCGCCGACC GCAACCTCGT CACCGCGATG GGCTTCATGA TCGTCATCGG CATCGTGATG
TTCGCGTCGA TGGCGCTGCT GCCGCCGATG CTGCAAAATC TCTTCGGCTG GCCGGTGATC
GACACGGGGC TGGTGCTCGC GGTGCGCGGC GTCGGCATCC TCGCGAGCAT GTGGGTCGCG
GGGCAGTTGC TGGGCAAGGT CGACGCGCGC TGGCTCGTCG GCACCGGCCT TGCCATCGCC
GCCTATTCGC TGTGGCAGAT GAGCCACTGG TCGCTCGCCA TGGGGATGCA GCCGGTGATC
GTCAGCGGGC TGGTGCAGGG GCTGGGCATG GGGCTGATCT TCATTCCGCT CAACACCATG
GCCTTTGCAA CGATCGCGCC GCAGCACCGC ACCGACGGGT CGAGCCTGCT GAACCTGCTC
CGCAGCCTCG GCGCCTCGGT CGGCATTTCG GTGGTGACCA CCCTGCTCGG CATCAATATC
CAGACCAGCC ATCAGGATCT CGCCGCGCAT GTCACCAACA GCTCGGTCGC GCTCATCGAC
CCCTCGACCG CCGACCGCTT CGGCGTCGTC GGCGACACCG CGCTGGCGAT GGTCAACGCC
GAGATCAACC GGCAGGCGGC GATGGTCGCC TATATCGACG ATTTCTGGCT GATGATGTGG
GTGACTTTGC TTTCGGTCCC GCTCGTCCTC CTGCTCCGCC CGCCCAGGGC CGGCGCGCCC
AAGGCCTCGG CGGCCGACAT GGGGCATTGA
 
Protein sequence
MASRSLPRRP AAAALPADDM IPLHERVRYR GLLTVAVMGA SIMQILDTTI ANVAIPHMQS 
ALGATSETVT WVLTSYILAS AIAMPITGWL ADRIGRRELF LAAIAGFIVA SMACGAAQSL
EQMVAFRFLQ GIFAAFIGPL SQSVMLDINP PERHARAMSI WGMGIMIGPI LGPVLGGWLT
ESANWRWVFY VNLPVGLVTL ALMWALLPAM RRTSRKFDLF GFSMLALGLA ALQLMLDRGA
HLDWFDSIEI WIELGVATAC LWMFFVHLFT ARAPLFSRAM LADRNLVTAM GFMIVIGIVM
FASMALLPPM LQNLFGWPVI DTGLVLAVRG VGILASMWVA GQLLGKVDAR WLVGTGLAIA
AYSLWQMSHW SLAMGMQPVI VSGLVQGLGM GLIFIPLNTM AFATIAPQHR TDGSSLLNLL
RSLGASVGIS VVTTLLGINI QTSHQDLAAH VTNSSVALID PSTADRFGVV GDTALAMVNA
EINRQAAMVA YIDDFWLMMW VTLLSVPLVL LLRPPRAGAP KASAADMGH