Gene PICST_50606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50606 
SymbolSGE1.2 
ID4840829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp17184 
End bp18602 
Gene Length1419 bp 
Protein Length472 aa 
Translation table12 
GC content39% 
IMG OID640392144 
Producthypothetical protein 
Protein accessionXP_001386606 
Protein GI150866868 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.862046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACTTGGCTT CGTGGATTAC TACTTCTTAT TTAATCACTT CAACTGCATT TCAGCCTTTG 
TATGGCTCCT TTTCGGATGT TATTGGTAGA AGGAAATGCT GCTTCTTTGC TCTGGCGTCA
TTCGCACTTG GTTGTCTAGG TTGCTCTATG GCAACTGATA TCATCACTTT CAATTTGATG
AGAGCCTTAA CAGGTATCGG AGGTGGAGGG TTAATCACAT TATCTACAAT CGTCAACTCT
GATGTCATCA CTAAGAGGAA GAGAGGATTA TTTCAAGCCG TGCAGAATTT GTTGCTAGGA
TTTGGTGCAG TGTGTGGTGC ATCTTTTGGT GGTTCCATTG CTTCATATTT TGGTTGGAGA
TGGTGTTTCA TCTTTCAAGT GTTTCCTTCC ATTTGGAGTC TTATAATTGG ATACAAGTAT
ATCACAAATC AACCTGGATT TGATGAATCC CAGCAGCATT TATCCAGCTC TGTGTTAGAA
AGAATAGACT ACAAGGGTTC AATTATCCTA GTGAGTGCAC TTACATGTCA ACTATTTGTT
CTCACATTAG GAGGAAATGA ATTACTGTGG CTGGACTCGA GATTAATAAC ACTTGCAATT
TTAGGAGCAA CATTATTATT CTACTTCATC TATATCGAAC TTCATACCAA AGCTAATCCA
ATCATTCCCG TAAGGAGATT CAATAGCTTA TTCACAGTCT TGCTACTTGC CCAGAACTTC
TTATTGGGAT TATGTGCTTA TGCTTATCTT TTTGCATTGC CATTATTATT TCAGATCGTC
TTGGGCGACA CTCCGTCAAA AGCAGGGTTG AGGTTGGCAG TTCCTTCTTT ATCTACTCCT
ATTGGTAGTG TGATAACTGG AGTAATGATG AATAAATATG GAGTGTTAAA AGGATTGTTG
TACGTTGGAA CTATGACAAT GGCAATTGGA AACTTTTTGA CCTTGTTAGT CAGTCCCAGT
ACTCCTAGTT GGCTCTTGAA TATTTTGCTA ATGCCAGCAA ATATCGGTCA AGGAATGGCA
TACCCGAGCT CGCTATTTAC ATTCATATTT GCTTACGGAA CAACTCACCA GGCAACTTCT
ACTTCGACAA TATATTTACT GAGAAGCATA GGGGGAGTAT TTGGTGTATC CAGTGTTTCA
GCGATTATCC AAGCATATTT GAAATTCAAA GTGAGAAAGG ATTTGAGTGC CCTACCAGAA
TTATCCCATA AAGAAATCCA TAAGATTGTT ATTGCAATTT CAAAATCTTC TGATGCCATA
TACAAGTACC CTGACACCAT CAAGTCTATT ATTCTCCTTG ATTACGAAAG AGCCATAAGA
CTTGCACAAT TGTTTTCCAG TATTTGTTGT GCTACAGCCT TTATTCTCTG TTTGATGAGA
GATATAACAA GATCAAAGCC AGACACATCT GTTGCATGA
 
Protein sequence
NLASWITTSY LITSTAFQPL YGSFSDVIGR RKCCFFASAS FALGCLGCSM ATDIITFNLM 
RALTGIGGGG LITLSTIVNS DVITKRKRGL FQAVQNLLLG FGAVCGASFG GSIASYFGWR
WCFIFQVFPS IWSLIIGYKY ITNQPGFDES QQHLSSSVLE RIDYKGSIIL VSALTCQLFV
LTLGGNELSW SDSRLITLAI LGATLLFYFI YIELHTKANP IIPVRRFNSL FTVLLLAQNF
LLGLCAYAYL FALPLLFQIV LGDTPSKAGL RLAVPSLSTP IGSVITGVMM NKYGVLKGLL
YVGTMTMAIG NFLTLLVSPS TPSWLLNILL MPANIGQGMA YPSSLFTFIF AYGTTHQATS
TSTIYLSRSI GGVFGVSSVS AIIQAYLKFK VRKDLSALPE LSHKEIHKIV IAISKSSDAI
YKYPDTIKSI ILLDYERAIR LAQLFSSICC ATAFILCLMR DITRSKPDTS VA