Gene PICST_30899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30899 
Symbol 
ID4838255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1225168 
End bp1226484 
Gene Length1317 bp 
Protein Length438 aa 
Translation table12 
GC content39% 
IMG OID640389570 
Productpredicted protein 
Protein accessionXP_001383864 
Protein GI126134679 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.341718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.766697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACT CTATAGACCT ATTTGAGAAC CAGGATTACG AGGAGAGGAT ACAGCGTAAT 
CGAAAGTACG AAGAAATTGA AGAAGATGAA GAGGAGGAAT CGTCTAGTTC AACATCAGAA
CAAGGAGATT TCCCGACAAG ATTGAAGTCT CTTCTCAAGC TTGAACTAGT TGAAAAATTC
GATACAGCCT ATTTTGCTAT TATCATAGGA TTTGCAATCT CGGCTAACAT CTGCTATGAA
TTCCCATACC CTTCTCGTTG GCTCAGAATA TGCGGTATAA TATTCTTTGC TTTCGCTGTA
TTTTTCTTTC TTGCAAATTT GAGTTTATTC ATAATAGGGT GCTGCTATCA CCGAGAGAGA
ATATATGCTT ATCATACCGA CCGAAGAAAG TCGTCATTTA TCGGCGCTTT TGTCATGGGA
TTCATCACGA TAATAACATT TATTCACCAT CTTGTAGGTG AGAAACATGC CATCTTTGTG
TGGACTTTAT GGTGGATTGC AGTCTTCTTG TCAATGTACT GTTATGTCAT ATTCTACCTA
TCCTATTTCT CAAAATTAAG TAAGAAGTAC ACCTTACAAG ACGTAAATTT TACTGCAATA
TTATTGCCCG TTACGTTCGT TGTCGTTGCT TCGAATGGAG AAACAATTGC TCCAAGTTTA
CCTACTTTGG AGTTAAGAAT TGTAACAGAG TTGGTCAGTT TGGCACTCTG GAGTATTTGC
TGCTGTTTTG GTTTAATCTT AATTCCAATA GTTATGCTCC GAATGATGAT ATACAAAATT
CCCGATTCCG CCTCAGTGTT TGCCCTCTTT CTTCCAATTG GTTACGTTGG CCAGCTAAGT
TATTTCATTA TGCTTTTCGG AAAGAATATG TCGCAAATGA TTCCTAACCA CAATATTGCC
GAGTCATTCA CAGTAGCTTG CGGTCTTGTT TCGTTCGTAT TAATGTCATT TGGATACCTT
TTGACTCTAC TAGCAATCGG TTCTGTGTTA TCCAAGATAA AACCCTTCGC CAAAACTCCA
AATCCAACGT ACTCTTCTTC AAAAACCGGC CTATTAGTAT GGCACAAGTC GTTTTGGTCC
ATGAATTTCC CTATCGGAAC GATGTCGTTG GCCAACATAG AGCATTCCAG GGGAACTGTT
GGAGGCTACA AAATAGAGTT TTTCAGAGTT GTAAGTGCCA TATATGGAGT AGCTTTGATC
CTAATTACAA TTTGTAACTC GCTTGGAATG TTGCACTACA TCTACAAGAG ATGCAGCAGC
CTATTTGAGA TCAGAAAGAA CAAGTATAAA TATCCTTCAC ATGAGCAACA AGAATAG
 
Protein sequence
MNNSIDLFEN QDYEERIQRN RKYEEIEEDE EEESSSSTSE QGDFPTRLKS LLKLELVEKF 
DTAYFAIIIG FAISANICYE FPYPSRWLRI CGIIFFAFAV FFFLANLSLF IIGCCYHRER
IYAYHTDRRK SSFIGAFVMG FITIITFIHH LVGEKHAIFV WTLWWIAVFL SMYCYVIFYL
SYFSKLSKKY TLQDVNFTAI LLPVTFVVVA SNGETIAPSL PTLELRIVTE LVSLALWSIC
CCFGLILIPI VMLRMMIYKI PDSASVFALF LPIGYVGQLS YFIMLFGKNM SQMIPNHNIA
ESFTVACGLV SFVLMSFGYL LTLLAIGSVL SKIKPFAKTP NPTYSSSKTG LLVWHKSFWS
MNFPIGTMSL ANIEHSRGTV GGYKIEFFRV VSAIYGVALI LITICNSLGM LHYIYKRCSS
LFEIRKNKYK YPSHEQQE