Gene Sala_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1946 
Symbol 
ID4082895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2050771 
End bp2052396 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content63% 
IMG OID638010323 
Productmajor facilitator transporter 
Protein accessionYP_616991 
Protein GI103487430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CAGAGGCAGC GGCACCTATC GTCACCCGCC AGAGCGAACG CAAGGTCATC 
CTCGCCTCGT CGCTCGGTAC GGTGTTCGAA TGGTATGATT TCTACCTCTA CGGCCTGCTC
GCGACGGTCA TTTCGGCGCA GTTCTTCTCT GGCGTCAACG AAACCACGGG CTTCATCTTT
GCGCTTGCCG CCTTTGCCGC GGGCTTTGCG GTGCGGCCCT TTGGCGCAGT TGTCTTCGGG
CGCATAGGCG ACCTCGTCGG GCGCAAGAAT ACGTTTCTGG TGACCATGGG GCTGATGGGC
GCCTCGACGT TCCTCGTCGG CCTGCTGCCC AGCTATGCGT CGATCGGGGT CGCGGCGCCG
ATCCTGCTCG TTGTCTTGCG CCTTGTGCAA GGACTCGCGC TTGGCGGCGA ATATGGCGGC
GCTGCCACTT ATGTTGCCGA ACATGCGCCC GAGGGGAAGC GCGGGCTGTT CACCAGCTTC
ATCCAGACCA CCGCGACGCT CGGCCTGTTC GCGGCGCTCG GCGTCGTCAT CGTCATCCGC
TCGGCGATGG GCGAGGCGGC GTTTGCCGAC TGGGGCTGGC GCATCCCCTT CCTCATTTCA
ATCGTCCTGC TCGGCATTTC GCTCTGGATC CGCCTCCAGC TCGAGGAAAG CCCGGTCTTC
AGGCAGATGA AGGCAGAGGG GACGACGTCG AAGGCGCCGT TGACCGAAGC CTTCGGCCGC
TGGGTAAATC TCAAATGGGT GCTCGTTGCG CTGTTCGGCG CCGTCGCGGG GCAGGCGGTC
GTCTGGTACA CGGGGCAATT CTATGCGCTA TTCTTCCTCG AAAAGACGCT GAAGGTCGAT
GGCGCGGCGG CGAACATATT GATCGCCATC GCGCTTGCAC TCGGCACACC CTTTTTCATC
CTCTTCGGCT GGCTCAGCGA CAAGGTGGGG CGCAAACCGA TCATCCTCAC GGGCTGTGCC
CTCGCGGCGC TGACCTATTT CCCGGTGTTC CAGATGCTGA CCGCTGCGGC CAATCCGGCG
CTCGCCGAGG CACAGGCGCG GGCGCCTGTT GTCGTGTTCG CCCATCCCGA GACATGCTCG
TTCCAGTTTG ACCCTATTGG CCGCACCTCG TTCGACCGGA CGGGCTGCGA CATCGCCAAG
TCAGCGCTCG CCAGAGCGGG GATCCCTTAT GCCAATGAGG ATGGACCATA TGATGGTCGC
GCGCTTGTCC GCGTCGGCGA TGAAGACATC CCGGTGATCG ACAAGGCCCA ATTCGGCGCT
GCGCTTACAC CCGCGCTCGC CGCGGCCGGC TATCCACCGA AAGCCGATCC CGCCGCGATC
GACAAGCCGC TCGTCGTCGC GATCCTCTTC TACCTCGTGC TGCTCGTGAC GATGGTTTAT
GGCCCGATCG CCGCTTTGCT CGTGGAGCTG TTTCCCAGCC GCATCCGTTA CACCGCCATG
TCGTTGCCCT ATCATATCGG CAACGGCTGG TTCGGCGGAT TTCTGCCGAC GACGGCGTTC
GCGATGGTCG CCGCGACGGG CAATATCTAT TACGGCCTCT GGTATCCGGT CATCGTCGCG
GTGATCACGC TGGTGGTCGG CCTGTTGTTC CTGCCCGAAA CCTTCCGCCG GTCGATCCAT
CAATAG
 
Protein sequence
MIDAEAAAPI VTRQSERKVI LASSLGTVFE WYDFYLYGLL ATVISAQFFS GVNETTGFIF 
ALAAFAAGFA VRPFGAVVFG RIGDLVGRKN TFLVTMGLMG ASTFLVGLLP SYASIGVAAP
ILLVVLRLVQ GLALGGEYGG AATYVAEHAP EGKRGLFTSF IQTTATLGLF AALGVVIVIR
SAMGEAAFAD WGWRIPFLIS IVLLGISLWI RLQLEESPVF RQMKAEGTTS KAPLTEAFGR
WVNLKWVLVA LFGAVAGQAV VWYTGQFYAL FFLEKTLKVD GAAANILIAI ALALGTPFFI
LFGWLSDKVG RKPIILTGCA LAALTYFPVF QMLTAAANPA LAEAQARAPV VVFAHPETCS
FQFDPIGRTS FDRTGCDIAK SALARAGIPY ANEDGPYDGR ALVRVGDEDI PVIDKAQFGA
ALTPALAAAG YPPKADPAAI DKPLVVAILF YLVLLVTMVY GPIAALLVEL FPSRIRYTAM
SLPYHIGNGW FGGFLPTTAF AMVAATGNIY YGLWYPVIVA VITLVVGLLF LPETFRRSIH
Q