Gene PICST_29041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29041 
SymbolHXT2.1 
ID4851777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2801503 
End bp2802984 
Gene Length1482 bp 
Protein Length493 aa 
Translation table 
GC content40% 
IMG OID640393485 
Producthexose transporter (tentative) 
Protein accessionXP_001386873 
Protein GI126275571 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.773835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCATA TTTTTGTGTT TTTATGTACT CTTTCTTGTA CTACTAACGG TTACGATGGT 
TCTATGTTGA ACGGTTTGCA AGCACTTGAC TCCTGGCAGG ATGCAATGGG TCACCCAGAA
GGCTATAAGC TTGGTTCCCT TGCAAATGGT ACAATCTTTG GTTCAGTTCT CAGTGTTTCT
GTTGCAGCAT GGCTCAGTGA CAAGGTCGGT AGAAGAGTCG CTATTATTAT TGGTTCTGGT
ATAGCCGTTG TTGGTGCTAT TTTACAAGGT GCTTCTACTA ATTTCGCTTT CTTTTTAGTT
TCCAGAATTT TGCTTGGTTT CGGTGTTGGA ATTGGAGCTA TTGCTTCACC CGCATTGATT
GCAGAAATTT CTTACCCAAC TTTCAGACCA ACGTGTACTA CCCTCTACAA TACGTTATGG
TATTTGGGTG CTGTTATTGC TGCTTGGGTC ACTTTCGGTA CTCAACACTT GAAAGGAAGT
GCTAGTTGGA GAGTTCCATC GTATATCCAG GCATTCTTAC CAGCAGTGCA ATTTGTCAGT
CTTTGGTGGT GCCCCGAATC CCCAAGATGG ATGATTGCCA AAGGCAGAGA AGATGAAGCC
AGACAAATCC TCTTCAAATA TCATACTGGT GGGGACCAAG ATGATAGAGC AGTAAGATTG
GTTGAGTTTG AAATAAAAGA AATCAAGGCT GCTTTGGAGA TGGAAAAGAT TTGCTCCAAC
TCTAAGTACA GTGACTTCTT GACAATTCCT TCTTACAGAA AGAGATTATT TTTGCTTTCA
TTTACAGCTA TCATCATGCA ATTATCTGGT AATGGGTTAG TTTCTTACTA TCTCAGTAAG
GTTTTGACTT CAATTGGTAT TAAATCTGCT AACGAGCAGT TGATCATCAA TGGTTGTCTT
ATGATTTACA ATATGGTTAT TGCTCTGTCT GTTGCATTCG TCGTTTACTT ATTTAGAAGA
AGAACTTTGT TCTTAACGTC CATTTCAGGT ATGTTATTCA GTTACATTAT CTGGACAGCC
CTTTCTGCAG TTAATCAACA GAGAGACTTC AAGGACAAAT CATTGGGCAA GGGCGTGCTT
GCAATGATCT TCTTCTACTA TTTGTCCTAC GATATTGGTG CAAATGGATT GCCATTCTTG
TATGTGACAG AAATCTTACC TTACACCCAC AGAGCCAAGG GCCTTAACGT CATGTACGGG
GTTCAAATGA CTACTTTAGT GTACAATGGT TACGTCAACC CTATAGCTAT GGACGCACTT
GACTGGAAAT ACTACATTGT GTGGTGTTGT TTCTTGGCCT TTGAATTGCT CATTGTCTAC
TTCTTCTTTG TGGAAACATA TGGATACTCT TTGGAAGAAG TTGCAAAGGT TTTCGGTGAC
GATCCAAACT CTTCCCTCAT TCAATCAACT TCTAGCAACG AAAAAGCTTC CATTGAGCAT
TTAGAAGATA CTTCTTCCGC AGAGATCGGA AGAGTCGTCT GA
 
Protein sequence
MLHIFVFLCT LSCTTNGYDG SMLNGLQALD SWQDAMGHPE GYKLGSLANG TIFGSVLSVS 
VAAWLSDKVG RRVAIIIGSG IAVVGAILQG ASTNFAFFLV SRILLGFGVG IGAIASPALI
AEISYPTFRP TCTTLYNTLW YLGAVIAAWV TFGTQHLKGS ASWRVPSYIQ AFLPAVQFVS
LWWCPESPRW MIAKGREDEA RQILFKYHTG GDQDDRAVRL VEFEIKEIKA ALEMEKICSN
SKYSDFLTIP SYRKRLFLLS FTAIIMQLSG NGLVSYYLSK VLTSIGIKSA NEQLIINGCL
MIYNMVIALS VAFVVYLFRR RTLFLTSISG MLFSYIIWTA LSAVNQQRDF KDKSLGKGVL
AMIFFYYLSY DIGANGLPFL YVTEILPYTH RAKGLNVMYG VQMTTLVYNG YVNPIAMDAL
DWKYYIVWCC FLAFELLIVY FFFVETYGYS LEEVAKVFGD DPNSSLIQST SSNEKASIEH
LEDTSSAEIG RVV