Gene PICST_35662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35662 
SymbolMFS5 
ID4837861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1524223 
End bp1525878 
Gene Length1656 bp 
Protein Length551 aa 
Translation table12 
GC content38% 
IMG OID640389176 
Producthexose transporter 
Protein accessionXP_001383911 
Protein GI150864906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.908779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.396773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGAA CAAAAGATAT TAAAGAAAAT AACATTATAA CAGTCAGCTC TAAAGGCTAT 
GAGAATGGCT CACTTGGGCA AGAAGAACAA AGAGTTCTTA CTTTGGAAGA CGTCACCCCA
AAATTAGACA AATGGTGGTT CCAGTACCCT CATTTGTTAA AATTGAACGT ATTGTTAGGA
TGTGCGCTGC TTGCTCAGGT TACTTTAGGT TATGATGGTA GTATGATGGG TAATTTGCAA
ACATTACCAA GTTGGTCAGC ATATTTCAAT AAACCACATG GTCAAATTTT GAGTACTATG
ACGAATGGTA TCACCATTGG AGCAATGGCT GTTACTCCAT TTTTGACTGC TAGTGGTGAC
AGAATGAAAA GAAAACATGT ACTTGCAGGT GCAATTCTAT TGACTATTCT TGGTGCTGCA
TTGCAGTCTG CTTCAGTTAA TTTTGGTATG TTTCTCGCTG CACGTATGAT TCTTGGTTTT
GGTTCTGGTG GAATGCAAGT TTCTGCTGCT CCATTGTTGG CAGAGACAGC TTACCCATCT
CAGAGACCTA TTTTGACAAG TATGTTGCAG GCTGCCTTTC CAGTAGGTTC ATTCATGGCA
GCACTTTTTA CTTGGGGTCC TTATCAATCT AGTATGAAGT ACAACAACTG GTCATGGCGC
TTACCTTCGT TGCTTCAAGC TGTTGTTCCA TTAATCCAAC TAGTATTAGT TTATTTCTGT
CCAGAATCTC CTAGATGGCT AATTGCACAC GGGAAAGAAG ACGAAGCTTT TGAAATTTTG
ACTAAATACC ATGCTGGTGG TGACAGAAAC AGTGAATTAG TTAAATTCGA AATGGCTGAA
ATTAGTGCTG CAATCTCCAG AGAAAAGATC GGCAAAAAGG TTTCTTGGTT GACATGGTTT
TCATCAAAAG CAAATATGCA TAGATTATTT ATCACGCTTG CATTGCCATC GATTCTTCAA
TTGTGTGGTT CCTCGCTTAT TGCTTATTAT TTCTCAATTG TTCTTCAGGA TATTGGTATT
ACCGAGCCAA GTGACAAGTT AAAGATTAAT ATTGGTTTAA CACTTTTTGG TGGTGTTTGT
GCACTTGTTT TTGCTATTTA TTCTGCTAAG TGGAGACGGA AAGTTGTAAT GCAAGTTTCT
CTTGGTGGAA TGTGCATAAT ATTCATTATT TGGATAGTGT TGGCAGACAT AAACGAAAAA
ACACAATTCA AGAATAAGGC ATTGGGTCGG GGAATTATTG CACTCCTCTA TTTCGATATG
GGATTTTATC ATGCTTTCTC TCCAATTGGA AACACATATG TTATGGAAAT TGTCCCTTAT
ACTTTGAGAG GAAAAGCTTC TATATTGTAT TCCTTGGCAA GTCAAGTTTG GATTTTTTTC
AACAACTACG TGAACAATCT TGGAATGGAT TCCATTGGAT GGAAATACTA TATTGTTTAC
TGTATTTTAT TGTTCACGCA TATGGTCGTG ATTCAATTCA CCTTCCCAGA AACTAGAGGA
TTGGGATTGG AAGAGGTTGC AAAGATCTTC GGAGAAGATA TTTCTGACTT GATGTTCAAA
GCTCAACAAG CTATTGTTGA ACCAGAATCA CCAACAAAGA GGAAGGCTGA AGTGGAGCAC
ATTGAAGATA CTAAATCTAA GACATCGGAA ACATAA
 
Protein sequence
MIGTKDIKEN NIITVSSKGY ENGSLGQEEQ RVLTLEDVTP KLDKWWFQYP HLLKLNVLLG 
CASLAQVTLG YDGSMMGNLQ TLPSWSAYFN KPHGQILSTM TNGITIGAMA VTPFLTASGD
RMKRKHVLAG AILLTILGAA LQSASVNFGM FLAARMILGF GSGGMQVSAA PLLAETAYPS
QRPILTSMLQ AAFPVGSFMA ALFTWGPYQS SMKYNNWSWR LPSLLQAVVP LIQLVLVYFC
PESPRWLIAH GKEDEAFEIL TKYHAGGDRN SELVKFEMAE ISAAISREKI GKKVSWLTWF
SSKANMHRLF ITLALPSILQ LCGSSLIAYY FSIVLQDIGI TEPSDKLKIN IGLTLFGGVC
ALVFAIYSAK WRRKVVMQVS LGGMCIIFII WIVLADINEK TQFKNKALGR GIIALLYFDM
GFYHAFSPIG NTYVMEIVPY TLRGKASILY SLASQVWIFF NNYVNNLGMD SIGWKYYIVY
CILLFTHMVV IQFTFPETRG LGLEEVAKIF GEDISDLMFK AQQAIVEPES PTKRKAEVEH
IEDTKSKTSE T