Gene Sala_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0887 
Symbol 
ID4082787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp894248 
End bp895498 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID638009248 
Productmajor facilitator transporter 
Protein accessionYP_615938 
Protein GI103486377 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.160597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGG ATACAGGCAG TCGGCGCGGC ATTACCCTGG CGCTGCTCGC CGTGGTGGCG 
ATCGTTTCCT ATGTGGACCG GCAGGTGTTC ACGCTGTTCC AGGACGACAT CAAGGTCGAA
TTGGGACTGA CCGACGGGCA GCTCGGTCTG CTTACAGGGA TCGCCTTTGC CGCCTTTTAT
GCGCTCGCGG CCTTTCCCAT CGCGCGTTAC GCGGATCGCG GGGATCGCCG GTTGGTCATC
GCGCTATGCG TGTCTTTCTG GAGTCTGGCG ACGATTGCCT GCGGCATGGC GCAGAATTTC
TGGCAGATGA TGCTGGCGCG CATCGGCCTG GCGGCCGGGG AGGCCGGGGC GGGACCGGCG
GGCAATTCGC TGCTGGTCGA AATCTTTCCG AAGGAACGCC GTACCACCGT CATCTCGACC
ATGCTGGCGG CCAATGCGAT CGGCCTGTCA GGCGGCCTCG CCCTCGCGGG GTGGCTCGGC
CAATGGTATG ACTGGCGCGC CGTTTTCGTG ATTGTCGGCG CACCCGGTAT CTTGCTGGGC
CTCGTCGTCT GGATGTTTGC CGCCGAACCG CGGCGGCGCA GCGGCCCCGC GCCCGAGATG
CCGCCGCAAA CCAGCCTTGG CGAGGTATTG CGGACAATCG CCGGCAATAG GTCGCTGCGC
TGGGTTGGCC TGTTGCTGTC GATGGTTCCG GTTGCCGGTT TCGGCTTCAT TTTGTGGGGA
GCATCCTTCT TCCGCCGCGT CCACGAAATG GACCGGGCGG AAACCGGATT CTGGCTGGGC
GGAGCAATGG CGATCGGACT GGTCGTCGGC AATTTGTTCG CAGGCTGGTT CAGCGATCGC
TATGGCAAGG CGAACCCGCG CTTCAACGGC GGGTTCGCGG GCATTGGCTT GCTGATTTCC
TTTCCGTTCG GCTTGACCTT CGCTTTGACC GACAGCGCCT ATCTCGCGCT CGCCTGCTTC
GTCGTCGTCA AATTCATGAT GACATTGCAC CTCGGCCCGA TCATCGCGCT CAGCTTCGCG
CAGGTGCCGG GCCATATGCG GGCGATGATG TCGGCCACGA TCAACATGAT GATCGGCCTG
GCTGGAGTCG GGTTGGGCGG TACGGTTGCC GGGCTGCTGA GCGAATATTT CATGCCCGAA
TATGGCGATC TGTCGCTCCA GCCGGCGCTG GCTGTCCTGT CGGTTTGCCT GCTGGTGGGC
GGCGTAGCTG CGATCATGGC GGGCCGGACG GCGAAACCGA TCGAAGAATA G
 
Protein sequence
MKADTGSRRG ITLALLAVVA IVSYVDRQVF TLFQDDIKVE LGLTDGQLGL LTGIAFAAFY 
ALAAFPIARY ADRGDRRLVI ALCVSFWSLA TIACGMAQNF WQMMLARIGL AAGEAGAGPA
GNSLLVEIFP KERRTTVIST MLAANAIGLS GGLALAGWLG QWYDWRAVFV IVGAPGILLG
LVVWMFAAEP RRRSGPAPEM PPQTSLGEVL RTIAGNRSLR WVGLLLSMVP VAGFGFILWG
ASFFRRVHEM DRAETGFWLG GAMAIGLVVG NLFAGWFSDR YGKANPRFNG GFAGIGLLIS
FPFGLTFALT DSAYLALACF VVVKFMMTLH LGPIIALSFA QVPGHMRAMM SATINMMIGL
AGVGLGGTVA GLLSEYFMPE YGDLSLQPAL AVLSVCLLVG GVAAIMAGRT AKPIEE