Gene PICST_16856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_16856 
SymbolXUT4 
ID4840896 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp628073 
End bp629701 
Gene Length1629 bp 
Protein Length529 aa 
Translation table12 
GC content42% 
IMG OID640392211 
Producthigh affinity xylose transporter (putative) (HGT3) 
Protein accessionXP_001386715 
Protein GI126140386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0752816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.401981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGT TATTGACTAA CGAATACTTC AAAGACTACT ACCACAACCC GACTCCTGTT 
GAAGTGGGTA CTATGATTGC TATCTTAGAG ATCGGCGCAC TTTTTTCCTC CTTCATAGCT
GGAAGAGTAG GTGACATCGT TGGCAGAAGA AGAACCATTA GATACGGGTC TTTCATTTTT
GTAGTAGGCG GTCTTGTACA AGCTACTTCG GTCAATATTG TCAATCTCTC ACTAGGAAGA
TTGATTGCCG GTATTGCCAT TGGCTTTTTG ACAACCATCA TCCCATGCTA CCAGTCTGAA
ATCAGCCCCC CAGACGATAG AGGTTTCTAT GCCTGTTTGG AGTTCACCGG AAATATCATT
GGATATGCTA GTAGTATTTG GGTAGACTAC GGGTTTTCAT TTTTAGACAA TGATTTCAGC
TGGAGGAGCC CATTGTATGT TCAGGTTGTT ATTGGCTCCA TGTTATTTAT TGGTTCATTC
CTTATTGTAG AAACCCCTAG ATGGCTCTTG GATCACAACC ATGATATCGA AGGCATGATT
GTCATTTCTG ACTTGTATGC AGATGGTGAT GTGGAAGACG ATGATGCTAT TGCTGAGTAC
AGAAACATAA AGGAAAGTGT CTTGATAGCC AGAGTTGAAG GCGGAGAGAG ATCGTACCAG
TATTTGTTCA CCAAATATAC CAAGAGACTT TCTGTGGCAT GCTTTTCGCA AATGTTTGCC
CAGATGAATG GTATAAACAT GGTATCTTAC TATGCTCCTA TGATCTTCGA ATCTGCTGGC
TGGGTTGGTA GACAAGCTAT CTTGATGACT GGTATCAACT CCATTATCTA CATCTTTAGT
ACCATTCCTC CATGGTACTT AGTTGATTCT TGGGGCAGAA AACCTTTGCT TTTATCTGGA
TCTGTGCTCA TGGGTGTTCC GCTCTTAACC ATTGCTTGTT CGTTATTCTT AAACAACACA
TACACACCCG GGGTTGTGGT TGGCAGTGTA ATCGTATTCA ATGCTGCTTT TGGATACAGT
TGGGGTCCAA TTCCTTGGCT CATGAGCGAA GTGTTCCCTA ACTCAGTTAG ATCAAAAGGT
GCTGCCATGT CTACTGCAAC CAACTGGCTC TTTAACTTTA TTGTTGGAGA GATGACACCT
ATTTTGTTGG ATACAATTAC CTGGAGAACT TACTTGATCC CGGCAACTTC GTGTGTATTA
TCGTTTTTTG CTGTTGGATT TTTATTTCCA GAGACCAAGG GTTTAGCATT GGAGGATATG
GGCTCCGTAT TCGATGATAA TTCGTCAATA TTTTCATATC ACTCAACTCC TTCCACTGGG
TATGGTGCGA CCGAGTCTAA CAGTAATGCC AGGAGAGCAA GTGTCATCTC TTCAGAAAAC
TACCAGGATA GTTTGCATCA GACAGCGGCT TCATTGGCTA GAAATCCTTC AAGCATGAGG
CCTGATTACG ATGGCATAAT CACAGGAGCT GCTACCCTTT CGCCAGTACC ACCATTAAAA
CCAATAAAGT CTGATGCGTC AGTCCATTCA GTCGATGCCA TAATTCCAAG CATTTCCAGC
AATATTCCGC AGGAAATTGA ACCACCAACC TTTGATGAAA TCTTTAAGTA CAAGTTGAAT
GAGATGGAA
 
Protein sequence
MSSLLTNEYF KDYYHNPTPV EVGTMIAILE IGALFSSFIA GRVGDIVGRR RTIRYGSFIF 
VVGGLVQATS VNIVNLSLGR LIAGIAIGFL TTIIPCYQSE ISPPDDRGFY ACLEFTGNII
GYASSIWVDY GFSFLDNDFS WRSPLYVQVV IGSMLFIGSF LIVETPRWLL DHNHDIEGMI
VISDLYADGD VEDDDAIAEY RNIKESVLIA RVEGGERSYQ YLFTKYTKRL SVACFSQMFA
QMNGINMVSY YAPMIFESAG WVGRQAILMT GINSIIYIFS TIPPWYLVDS WGRKPLLLSG
SVLMGVPLLT IACSLFLNNT YTPGVVVGSV IVFNAAFGYS WGPIPWLMSE VFPNSVRSKG
AAMSTATNWL FNFIVGEMTP ILLDTITWRT YLIPATSCVL SFFAVGFLFP ETKGLALEDM
GSVFDDNSSI FSYHSTPSTG YGATESNSNA RRASVISSEN YQDSLHQTAA SLARNPSSMR
PDYDGIITGA ATLSPVPPLK PINISSNIPQ EIEPPTFDEI FKYKLNEME