Gene Sala_1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1605 
Symbol 
ID4082757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1681887 
End bp1683209 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content69% 
IMG OID638009974 
Productmajor facilitator transporter 
Protein accessionYP_616651 
Protein GI103487090 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.983244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.202214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAG CATCGCACTC CATCCCCTCC GACCGCATGG CGGTGCTCTT CGCGGTGATG 
CTCGTCGCGG CGGCAGGCAA CACCGCCATG CAGTCGATCC TTCCGGCTAT CGGCGCCAAA
CTGCGCATCC CCGACGTGTG GGTCAGTCTG GCCTTCAGCT GGTCGGCGCT GCTCTGGGTG
CTCACCGCGC CGCACTGGGC GCGCCAGTCG GACAAGCGCG GGCGCAAGGC GCTGATGGCG
CTGGGGGTGA TCGGTTTCCT GTCGTCGATG GCGCTGTGCG GTCTCGTCCT GTGGTCGGGG
CTGCAGGGCT GGTTCGCGGC GGGCATGACC TTCGTCCTTT TCGCGCTCTT CCGCAGCCTT
TATGGCGGGC TGGGTTCTGC CGCGCCGCCC GCGGTGCAGG CCTATGTCGC CGCGCGCACC
GACCCCGACC AGCGGACGCA GGCGCTGTCG CTCGTCTCCT CCTCCTTCGG GCTCGGCACG
GTGATCGGTC CCGCGATCGC GCCCTTCTTC ATCCTTCCCG TCGTCGGCCT CGCCGGGCCA
CTGCTCGTCT TCGCTCTCAT CGGGCTTGCC GTGCTCATCG CGCTGCGCTG GCGCCTTCCC
AACGACGTGC CGCGCTTTGC CGCGCGCGGT TCGATCATGT CCTATCCCAC CACCGGCGCG
TCGTCGCAGG CCGCCACGGA CGAAGATGAC GAACAAACCT CCGATCGGAG CGCTTCCGAA
CCGCAGCCGC TGCGCTGGAC CGACACGCGC GTGCGCCCTT GGCTTTTTGC CGGGTTCCTC
GGCGGACAGG CGCAGGCGAT GATGCTCGGC GTGATCGGTT TTTTGATCCT CGACCGGCTG
AACCTGCGGC TGCGTCCCGA CGAGGGCGCG GCGATGACCG GCATCGTGCT GATGGCGGGC
GCCTTTGCGA CCCTGCTCTC GCAATGGGGA CTGATCCCGC TCCTCAAAAT GTCGCCGCGC
ACCGCGGTGC TCGGCGGCGC GGCGCTCGGC GGCGCGGGGA CGCTCCTCAC CGGCCTGTCT
TACGACTTTC ACGGTATCGT CATCGGTTTC GCCCTCGCCT CGCTCGGCTT CGGCCTGTTC
CGTCCGGGGT TCACCGCCGG CGCCTCGCTC GCCGTGCCGC GCCGCGACCA GGGCGGGGTC
GCGGGGATGA CCGCGTCGAT CAACGGATCG GCCTATATCG TCTCCCCCGC GATCGGCGTC
CTCGTTTATA ACTGGCACCC CATCGTGGCC TACGGCCTGA TGGCGGGATT CTGCGGCTGG
CTCGTGCTCT GGGGCTGGGC GGCGCTGCGG CTGGACCAGC CTGTTAGGGA CAGGACCCAT
TAA
 
Protein sequence
MATASHSIPS DRMAVLFAVM LVAAAGNTAM QSILPAIGAK LRIPDVWVSL AFSWSALLWV 
LTAPHWARQS DKRGRKALMA LGVIGFLSSM ALCGLVLWSG LQGWFAAGMT FVLFALFRSL
YGGLGSAAPP AVQAYVAART DPDQRTQALS LVSSSFGLGT VIGPAIAPFF ILPVVGLAGP
LLVFALIGLA VLIALRWRLP NDVPRFAARG SIMSYPTTGA SSQAATDEDD EQTSDRSASE
PQPLRWTDTR VRPWLFAGFL GGQAQAMMLG VIGFLILDRL NLRLRPDEGA AMTGIVLMAG
AFATLLSQWG LIPLLKMSPR TAVLGGAALG GAGTLLTGLS YDFHGIVIGF ALASLGFGLF
RPGFTAGASL AVPRRDQGGV AGMTASINGS AYIVSPAIGV LVYNWHPIVA YGLMAGFCGW
LVLWGWAALR LDQPVRDRTH