Gene PICST_33078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33078 
SymbolXUT5 
ID4840252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1380077 
End bp1381564 
Gene Length1488 bp 
Protein Length495 aa 
Translation table12 
GC content37% 
IMG OID640391567 
Productputative xylose transporter 
Protein accessionXP_001385962 
Protein GI150866384 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.442314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA GAAGCATTGG ACCTTTAATC CCCAGAAATA AGCACTTATT CTATGGATCC 
GTATTATTGA TGAGTATTGT TCACCCAACT ATCATGGGAT ACGATTCCAT GATGGTTGGT
AGTATTCTTA ATCTAGATGC ATATGTAAAT TATTTCCACT TAACGGCTGC TACCACTGGA
CTCAATACTG CTGCAGTATG GCTTGGGCAA GTAATTGCCA CATTGACAGT TATTCTGTAT
TTCAATGACA AATTTGGTAG AAGAAGTTCA GTTTGTATAA GTATTGCAAT CAGTTTGGTT
GGGGTTGCAT TGCAATCAGC AGCCCAAAAC ATTGAGATGT TTATTATCGG AAGAATAGTT
ATTGGTTTTG GAATATCTAT TGGTTTTGTC TCATCTACCA TTTTGGTAAG TGAACTAGCC
CCTCCAGACA AAAGAGGATT TATTCTTGGA TTGAGTTTTA CAAGCTTTCT AGTAGGAAGT
TTAATTGCAG CAGGTGTCAC ATATGGAACA AGAAATGCTC CTGGAGACTG GTGTTGGAGA
ATCCCATCAA TTATTCAAGG GGCTCCAGAT ATTGTTGCTA TTATTAACAT ACTCTTTATT
TCAGAATCAC CAAGGTGGTT GATTGCAAAG GAAAGATTCA GCGAAGCTCG TGAAATTATT
TCTATCATTA GTGATGTTCC TATTGAAGAT GCACATGAAG AATGTGAAAA GATACATGCC
CATATTCAAA CTGAGAAGAC TGCTTTCCCT GGCAATAAGT GGAAACAAAT GGTGAGCTCC
AAGAGCAATA CAAGAAGAGT TATTATCTTG TTCACACAGG CCATAGTTAC TGAAATGGCC
GGTTCTTCAG TTGGATCGTA CTATTTTTCA ATTATATTAA CTCAAGCTGG GGTCAAAGAT
TCGAATGATA GACTAAGAGT AAATATTGTG ATGAGTTCGT GGTCATTGGT AATTGCTCTT
TCCGGATGTC TAATGTTTGA CAGAATTGGA AGAAAAATGC AATCGCTCAT TTCGTTATCA
GGTATGATCA TATGCTTTAT AGTTTTAGGT GTTTTGGTTA AAGAATATGG CGATGGTCAT
AGCAAGAGCG GAAGTTACGC AGCTGTCGCC ATGATGTTTT TATTCACAGG ATTTTACTCA
TTCACTTTCA CTCCATTGAA CTCTTTGTAC CCTCCAGAAT TGTTCCCCTA CGTGTTGAGA
AGTACAGGAG TTACACTCTT TAATATTTTC AACGGCTGCT GGGGACTTTT CGCAAGTTTC
ATTTTACCCA TTGCAATGAA TGGAATTGGC TGGAAATTTT ACATCATTAA TGCTTGCTAT
GACGTCATAT TCCTTCCAAT AATAATGTTC TGTTGGATTG AGACAAAGGG AATTAATTTG
GATACAATTA GTGAAGTATT GCACGGAAGA GGACCTGAAG ATGAAGAAAG CATTGAAGAA
AGTCACAGCC TAATCAGACA AGGTTTTGTT GTTAATACAA AGAAGTAA
 
Protein sequence
MTERSIGPLI PRNKHLFYGS VLLMSIVHPT IMGYDSMMVG SILNLDAYVN YFHLTAATTG 
LNTAAVWLGQ VIATLTVISY FNDKFGRRSS VCISIAISLV GVALQSAAQN IEMFIIGRIV
IGFGISIGFV SSTILVSELA PPDKRGFILG LSFTSFLVGS LIAAGVTYGT RNAPGDWCWR
IPSIIQGAPD IVAIINILFI SESPRWLIAK ERFSEAREII SIISDVPIED AHEECEKIHA
HIQTEKTAFP GNKWKQMVSS KSNTRRVIIL FTQAIVTEMA GSSVGSYYFS IILTQAGVKD
SNDRLRVNIV MSSWSLVIAL SGCLMFDRIG RKMQSLISLS GMIICFIVLG VLVKEYGDGH
SKSGSYAAVA MMFLFTGFYS FTFTPLNSLY PPELFPYVLR STGVTLFNIF NGCWGLFASF
ILPIAMNGIG WKFYIINACY DVIFLPIIMF CWIETKGINL DTISEVLHGR GPEDEESIEE
SHSLIRQGFV VNTKK