Gene Sala_1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1509 
Symbol 
ID4080022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1572460 
End bp1573788 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content66% 
IMG OID638009876 
Productgeneral substrate transporter 
Protein accessionYP_616555 
Protein GI103486994 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.386131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCG ACACCGTCCC GACGACCGAG GCCGAACGCG ACGCTCGCGC CCTCCACGAC 
CATGACGGCC ACCGCATCGA CCCCGCCGAA ATCGCCATCG GCGTCATCAT CGGCCGGACC
TCAGAGTTTT TCGACTTCTT CGTCTATGCG ATCGCTTCGG TGCTGGTATT TCCCAAGCTC
GTCTTTCCGC ACCTCGATCC GCTGGCGGGC ACACTCTGGT CCTTCGCGAT CTTTGCCCTC
GCCTTTGTCG CGCGCCCGGT CGGAACGGTC ATCTTCACCG CGATCGACCG CGGTTATGGC
CGTGGCGCCA AGCTCACCAT TGCGCTGTTC CTGCTTGGCG GATCGACCGC GGCGATCGCC
TTCCTGCCCG GCTATGAATC GATCGGCATC GGCGCCGCGC TGCTGCTCGC GCTGTTCCGC
ATGGGCCAGG GCGTCGCGCT CGGCGGCTCA TGGGACGGCC TCGCCTCGCT GCTCGCATTG
AACGCCCCCG AATCGAAACG CGGCTGGTAT GCGATGATCC CGCAGCTCGG CGCGCCGCTT
GGCCTCATCG TCGCCAGCCT GCTCTTCATG TTCCTGATCT CCGCGCTCCC GGCCGAAGAC
TTTCTCGCCT GGGGTTGGCG CTATCCTTTC TTCGTCGCCT TTGCGATCAA CGTCGTCGCG
CTGTTCGCGC GGCTGCGCAT CGTCGTGACC CCCGACTATG CCGAGCTGTT CGAAAACCGC
GCGCTTCAGC CCGCGCCGCT CCTCGAAACG GTACGGTCGG AATGGAAAAC CATCGTCACC
GGCGCCTTCG CCCCGCTCGC CAGCTTCGCG ATGTTCCACA TGGTCACTGT CTATCCGCTG
TCGTGGGTGT TCCTGTTCAC CGACGAAACC CCCGCACGCT TCCTGATGAT CGAGGCGATC
GCTGCGGTCG GCGGCGTGAT CGCGATCATC GCCTCGGGCT ATCTTGCCGA CCGCTTCGGG
CGCCGCACCG TGCTTGCCGC GACGGCGGCG GCGATCGCGG CGTTCAGCGG CTTTGCCCCG
CAATTGCTCG ACGCGGGCCA GGCGGGTGAG GCGAGCTTCA TGATCCTCGG CTTCCTCCTG
CTCGGCCTGT CGTTCGGGCA ATCGTCGGGC GCGCTCTCAT CGAACTTCAC GCCGCGCCAC
CGCTACACCG GGTCGGCCTT CACCGCCGAC CTCGCCTGGC TGTTCGGTGC TGGTTTCGCA
CCGATGGTGG CGCTCTGGCT GTCGAGCGAA TTCGGGCTGA TCGCCGCGGG TGCCTATCTG
CTGTCGGGCG CGATCGTTAC GCTCGTCGCG CTGTGGCTCA ACCGCGAACT TGCACGCACG
ATCGATTGA
 
Protein sequence
MAADTVPTTE AERDARALHD HDGHRIDPAE IAIGVIIGRT SEFFDFFVYA IASVLVFPKL 
VFPHLDPLAG TLWSFAIFAL AFVARPVGTV IFTAIDRGYG RGAKLTIALF LLGGSTAAIA
FLPGYESIGI GAALLLALFR MGQGVALGGS WDGLASLLAL NAPESKRGWY AMIPQLGAPL
GLIVASLLFM FLISALPAED FLAWGWRYPF FVAFAINVVA LFARLRIVVT PDYAELFENR
ALQPAPLLET VRSEWKTIVT GAFAPLASFA MFHMVTVYPL SWVFLFTDET PARFLMIEAI
AAVGGVIAII ASGYLADRFG RRTVLAATAA AIAAFSGFAP QLLDAGQAGE ASFMILGFLL
LGLSFGQSSG ALSSNFTPRH RYTGSAFTAD LAWLFGAGFA PMVALWLSSE FGLIAAGAYL
LSGAIVTLVA LWLNRELART ID