Gene PICST_28802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28802 
SymbolSGE1.4 
ID4851551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2113737 
End bp2115434 
Gene Length1698 bp 
Protein Length565 aa 
Translation table 
GC content41% 
IMG OID640393259 
ProductMFS efflux transporter 
Protein accessionXP_001387647 
Protein GI126274828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.211393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.613875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCAA GCAACAAGAA AGAGAAAGAC CAAGTGCATA TAGTAGAGAA AGTCAAGCGT 
CCTCATAGAG ATCTGGACCT TTCTGATCAG CTAGGTGACG CTCACTTCAA TATTCTTCCA
CAAAGGAGAA TTATTACTAT CTTGTTGATA CTTGCATTGG CCAATCTCGT TTCATTTGCA
GATCAAACTG GTATAACAGT AGGCTTGTCT GCTATCGCTT CAGATTTGAA TTCAGAAAAG
ACTATCAATT GGGCTGGTAC AGCTTCCTTG TTAGCAAATT GTGTTTGCCA AATTTTGTTC
GGACGTCTCT CTGACATATT TGGAAGGAAG AATGTGTTGA TGTGGTGTTT AGCCATCTTA
ATCATTGGAC AGGTTGCATG TTCAACAGCT CGAAATGGTC CTGAGTTCTA TGTATTTAGA
GCATTAGCCG GGATTGGTAA TGGCGGTGTC TCTTCTCTTG GAATGGTTAT TTTAAGTGAT
ATTGTCTCTT TGAAAGATAG AGGTAAATAC CAAGGTATAC TTGGTGCCAG TGTTGGATTA
GGAAATATTG TTGGACCTTT CGTCATGTCT GCATTCGTAA AGCACTATTC CTGGAGGGGC
TTCTATTATT TATTTGCACC CTTAGGCTGC CTTGTTAATG TTGCAATTTA TTTTACCATT
GATGGAAATA CTAAACTTGA CGGTGTCTTG TCTAAGAAAG AAAAGTTCAA GAAGATAGAT
TATTTGGGCA TTATAATTGC TTCGATTGCC TTAACTTGCC TTCTTGTTGC AATCAGTGGT
AGTGGTACTT CATTTCCTTG GGATAGTAAG CTCACTATTA CCTTGTTTTG TGTTGGTGGT
ATTGCATTCA TTGTATTCTT CTTAGTTGAG TGGAAGATTC CGGAGTTGCC AATGATCCCA
TTAAGACTAT TCAAGAGCCC TTCTATGTGC TTGTTGTTTG CTTCCACTTT TTTGTTCGGC
GCTACATACT TCAGTTTGTT GTATTACTTA CCATACTATT TCCAGATTGT CAGGTCTAAA
AGCGAAATAC AAACTTCGGT ATTTATTGTT CCTTTGGTGG CTGCCCAAGC CCTTATGTCT
ATTGTTGGAG GGCAAATCAT CACTCTTACA GGACATTATT TTTTTGTTGT ATGTGGAGGT
TACGCACTTT GGTTAACTGG TTGTGGCTTA TTGATCATCT GGAATGAGCA CACGAGTGAT
GGAGTTCTAG TTGTGGTAAT GTTGATCATT GGCACTGGTG TTGGTTTTTC ATTTCAACCT
TCAATGGTAG CCATCCAAGC CAATTGTAAG AAGGCTGAAA GGGCGGTGGC AATCTCTACC
AGAAATGTTT TGCGTTCCTT TGGGGGTGCC ATTGGTATTG CATCCGGATC TACTATGATT
AGTAACACGT TATTGAACCA TCTCGCACAA ATTCAAGGTC AATCAGACTT AACTGAGGCT
ACGATCGACT ACTTGAAGGA CCATATATAT TCGAAGATAA ACCTTGCTGG GGTTTCGGCA
TCCGAGATTA CGACAATTTC ACAATTGTAC ATCTCGGCAT TGAGATACTA CTACTATCTC
ATCGTCACCT TCATGGGAAT ATGCTTGGTG TGCTCCATTT TCGTTAAGGA TCGGGGCTTA
CAATGCACGG ACGAACACCC TGTCCGTACG AGAAAGGATC TTGAGTCATC TGCATCCTCG
TACACAGTCA ATAGTTAA
 
Protein sequence
MDPSNKKEKD QVHIVEKVKR PHRDLDLSDQ LGDAHFNILP QRRIITILLI LALANLVSFA 
DQTGITVGLS AIASDLNSEK TINWAGTASL LANCVCQILF GRLSDIFGRK NVLMWCLAIL
IIGQVACSTA RNGPEFYVFR ALAGIGNGGV SSLGMVILSD IVSLKDRGKY QGILGASVGL
GNIVGPFVMS AFVKHYSWRG FYYLFAPLGC LVNVAIYFTI DGNTKLDGVL SKKEKFKKID
YLGIIIASIA LTCLLVAISG SGTSFPWDSK LTITLFCVGG IAFIVFFLVE WKIPELPMIP
LRLFKSPSMC LLFASTFLFG ATYFSLLYYL PYYFQIVRSK SEIQTSVFIV PLVAAQALMS
IVGGQIITLT GHYFFVVCGG YALWLTGCGL LIIWNEHTSD GVLVVVMLII GTGVGFSFQP
SMVAIQANCK KAERAVAIST RNVLRSFGGA IGIASGSTMI SNTLLNHLAQ IQGQSDLTEA
TIDYLKDHIY SKINLAGVSA SEITTISQLY ISALRYYYYL IVTFMGICLV CSIFVKDRGL
QCTDEHPVRT RKDLESSASS YTVNS