Gene PICST_37347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37347 
SymbolSGE1.3 
ID4851558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2131033 
End bp2132742 
Gene Length1710 bp 
Protein Length569 aa 
Translation table 
GC content42% 
IMG OID640393266 
Productsuppressor of gal11 null 
Protein accessionXP_001388034 
Protein GI126274852 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.315077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATA TCAAGGTACA AACGTCAAGG GTTATAGATG TAAATGATTC ACAAGTAATA 
GAGAAGCAAG CCTTGGATGA CAGGCTAGGA GATGCCCATC TCAATATTCT TCCTACAAAA
AAAATCATAG TCTGTCTTGC AGCCTTATCG CTAGGTCTAT TTGCATCGTT CGCGGACCAG
ACAAGTATAA CCATAGCATT ACCAGCCATA GCCAAGGATT TAAGAGCTGA AACCACTATC
AACTGGGCGG GAACAGCCGC CTTATTGGCC AACTGTGTTT GCCAAGTTCT CTTTGGAAGA
CTAGCAGACA TCTTTGGCAG AAAGAATATT TTGCTCTTTT CTCTTGGTAC ACAAGCAGTA
GCAGACATTG GATGTGCAGT TTCTCGGACT GGGGTGGAAT TCTATATCTT TAGAGCGATT
GCTGGTATTG GATGTGGAGG TACTCAATCG TTGACCATGG TGATGTTGAG CGATATTTGT
ACCTTGAAGC AAAGGGGAAA GTATCAGGGT ATATTAGGTG CTCAGGTCGG TTTAGCGAAT
GCTTTGGGTC CCTTCATTAT GGCAGCTTTC GTAGAACACA CAACTTGGAG AGACTTCTAC
TACATGATGA TTCCCTTGGT TATAAGCGTG ATGGTTACCA TCTACTTTTT GATTGATGGT
AAGAAAAATG CTAGTCAACT CAACAACGTT TTGTCCAGAA AAGAGAAATT CAAGAAGATT
GACTACTTGG GGATGTTTTT CAGTACTGCA AGTCTTACAT TGTTGCTCAT TCCCATCAGT
GGCGGTGGTT CATCTTACCC TTGGAACAGT CCTCTCATTA TTGGTATGTT CGTATCAGGT
GGGTTGAGTT TCTTTGTTTT TATCTACATC GAATGGAAGC TTGCTGAACT TCCAATGATT
CCTTTGAGAA TTTTCGCCAG TCCCTCCCTA TCTCTTATCT TGGGTTCCAA TTTCCTATAC
GGAATGGCTT ACTACGGATT TACGTATTAC TTGCCATACT ACTTGCAAAT CGTTCGAGGA
CTCGATTCGA TCCATGCCTC GATTATTTTG TTACCATTAG TGCTTACGCA ATCTATAGCT
TCCATCATTG GAGGAACCTT GATAAGTTAT TTTGGCCACT ACAAGAATAT TATTCTTATG
GGATATGGGC TCTGGACAGT TAGCTGTGGG CTCTTGTATA TCTTCAACAC GCAGACCAAC
TGGGGAGTCA TAGTTGTCAT TTTGTTAGTT ATGGGAGTAG GCGTTGGGTG GACTTTCCAG
CCTACAATGG TTGCTGCTCA GAGTCAAGCC AAAAAATCAG ACAGAGCGAT TGTTATCAGT
GCCAGAAACG TTTTGAGATC CTTTGGTGGT TCAGTAGGCA TTTCTATTGC TTCCATGATT
GTCAGCAATA GTTTGTTAAG GGAAATCAGA AGAGAATCCA AGAATGAAGG TAGCATATTG
GACGGTTATT TGGACTACTT GAAGGATCAC ATCTACAGCA GAGTTGATAC ATCCAAGCTT
AACCACGCCC AACAAGTGGT AGTTAGAGAG ATGTACATGA AAGCCATCAA GAACTATTTC
TACATCTGCT TGCCTCTCAT TGCAGTTTGT TTTATCTCTA CCATCTTCGT GGTAGACCGA
GGCTTGCAAT GTATTGACGA GGAGCCAGAA CAAAAGAACA AGGACAAGGA ATCGGATATA
GATACAAGCA GCAACAGCTC AAGACAGTAA
 
Protein sequence
MEDIKVQTSR VIDVNDSQVI EKQALDDRLG DAHLNILPTK KIIVCLAALS LGLFASFADQ 
TSITIALPAI AKDLRAETTI NWAGTAALLA NCVCQVLFGR LADIFGRKNI LLFSLGTQAV
ADIGCAVSRT GVEFYIFRAI AGIGCGGTQS LTMVMLSDIC TLKQRGKYQG ILGAQVGLAN
ALGPFIMAAF VEHTTWRDFY YMMIPLVISV MVTIYFLIDG KKNASQLNNV LSRKEKFKKI
DYLGMFFSTA SLTLLLIPIS GGGSSYPWNS PLIIGMFVSG GLSFFVFIYI EWKLAELPMI
PLRIFASPSL SLILGSNFLY GMAYYGFTYY LPYYLQIVRG LDSIHASIIL LPLVLTQSIA
SIIGGTLISY FGHYKNIILM GYGLWTVSCG LLYIFNTQTN WGVIVVILLV MGVGVGWTFQ
PTMVAAQSQA KKSDRAIVIS ARNVLRSFGG SVGISIASMI VSNSLLREIR RESKNEGSIL
DGYLDYLKDH IYSRVDTSKL NHAQQVVVRE MYMKAIKNYF YICLPLIAVC FISTIFVVDR
GLQCIDEEPE QKNKDKESDI DTSSNSSRQ