Gene PICST_83849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83849 
SymbolGRR1 
ID4839325 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp987688 
End bp990737 
Gene Length3050 bp 
Protein Length725 aa 
Translation table12 
GC content43% 
IMG OID640390640 
Productprotein required for glucose repression and for glucose and cation transport 
Protein accessionXP_001385202 
Protein GI150865826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CATTTCCCGC CTTCCGCAGC TCGTCCACAC TGTCCAATCC CTGCCTCTAG GACGTTTCTC 
ATTCCCGGAC TTCTTTCCTG TGGTTCTCTA ATTGTTTGTG GATTCTCACT ATCCAGGGTG
GATAAGTATC ATCACTAACG CCAGTTCCAA GCGTCAACAC ACTGAACCTC AGCAATAAAA
CAAAACTTCT GGAATTTCCA CAGATCATAT TTCCAGACCC GGAACGCTAT CATTGATATC
GCCATTGGTA TTGTTATCGT TGTTAGCATT TCATCGTTGT GTTATCTGTG ACATACTCTC
ATTATCAGAG GACAGATTCT GTATATCCAA AAGTGCCGTA TAGTATCGTT GTGAACAAAC
ATAACCAGCA CATCAAAACA ACAAGCAAAT CCATAATCAA TCCTATCCTA CAGTTCTGAC
TGCATCAACT TAGTGTCAAT TTCGAAAACA TTGTTATCAT CGCATTACTA TTGATCTTCA
TTATCATTAA TCTATACAAT CATAAGTTCA ACTATCATTA TCATTAAGAT CATTATTAAT
ATCGATATCA TATAACATCA ATAGCCTGAT CTCATTTACA TTGTCTTAAT CCATTAACCT
ATATTCAAAA TGAGCTCCCC GAACAGCATA GAAGACGACT CTAATGGCTC TGCTTCCAAC
TCCATCAGAC CGGCTCTGCC TTCAAAATCC AGAGATGACT CGAGAAGCGA TCCCATCTCG
AAAAACAACG CCAAAAACGA TAACTCTACC TACTCCCATC CTTATCGCCG ACACCATGAC
TCGGAGTTTC TCATGGATAA CGAATACACT GACTTCCGTG AAGTGCGCGA CGCCATTATC
ACCAACCGCA GACTTCTGGA AGCTGCCCAT AGCCACCTGA TCGAAACTAT TATTGACAAG
ACGTCGTTGT TACGTTTACC TACCGAAGTT CTTCTTCAGG TGTTTCACCA TCTTGATCGC
AAAGACCTCT TCAATCTCTT GACCGTATGT CAGGAATTTG CTGATCTCAT TATCGAGATT
CTTTGGTTCA GACCCAACAT GCAAAATGAT TCTTCATTCA AGAAAATCAA GGATATAATG
CAGCTTCCTT CAAGTAAAAC TCATTGGGAT TATCGTCAGT TCATCAAGAG ATTGAACCTC
TCGTTTATGA CTAAGCTCGT AGATGACGAA TTACTTCTGC TCTTCATCGG CTGTCCCAAA
TTGGAACGGC TCACTTTGGT CAACTGCACA AAATTGACAA GAAATCCCAT AACTCAGGTG
CTTCATAACT GTGAGAAATT GCAATCCATA GATTTAACAG GAGTGACAGA CATCCACGAC
GACATCATTA ATGCTTTAGC TCGGAACTGT GTTCGTTTGC AAGGCTTGTA TGCTCCTGGC
TGTGGGAACG TCTCCGAGGA AGCCATTTTG AACTTGCTTG AATCGTGTCC CATGTTGAAA
CGAGTCAAGT TCAACAACTC CAACAACATC TCTGACGAAA GCATTCTCAA GATGTACGAC
AACTGTAAAT CTTTAGTAGA AATCGACTTG CACAACTGTC CCAAGGTCAC CGACAAGTAC
TTGAAGAAAA TCTTCTTGGA CTTATCACAA CTCAGAGAGT TTAGAATCAG TAACGCTCCC
GGAATCACAG ACAAGTTGTT TGAGTTGTTG CCGGAGGGCT TCTACCTTGA AAAATTAAGA
ATAATAGACA TCTCCGGCTG CAATGCCATT ACTGACAAGT TGGTGGAGAA ATTGGTTTTG
TGTGCACCAA GACTAAGAAA CGTTGTGCTA TCGAAATGTA TCCAGATCTC TGATGCTTCT
TTGCGGGCGT TAAGCCAGTT GGGAAGAAGT TTGCATTACA TCCATCTAGG CCATTGCGGG
CTTATCACCG ACTTTGGCGT AGCTTCATTG GTGCGTGCCT GTCACAGAAT ACAGTATATA
GATTTGGCGT GTTGTTCACA GCTTACTGAC TGGACGTTGG TAGAGTTGGC CAATTTGCCC
AAGCTTCGTA GAATAGGCTT GGTGAAGTGC AGTCTTATCA CAGACAGCGG CATTTTGGAG
TTAGTCAGAC GTCGTGGAGA ACAGGACTGT TTGGAACGAG TCCATTTGTC GTACTGTACA
AACTTGACGA TTGGTCCCAT TTACTTGCTT CTAAAGAGTT GTCCAAAGTT GACTCATTTG
CTGTTAACGG GAATATCATC GTTTTTGCGT AGAGAAATCA CACAATACTG TCGTGACCCT
CCTCCGGACT TTACAGAGAT GCAGAAAGCC CAGTTCTGCG TATTTTCTGG TAATGGAGTC
AACCAGTTGC GCAACTACCT TAACCAAGTA ATGGAAGAAA GAGCATATCT GATTGACCAG
GGCGAGATTC AGGCACTCTT TATGGAACGA AGAAGAAGGC AGATCAATGC CGACGTAGAC
ATGGATGACG AGGAAATGAA CATATGGGTC CGTAGGGGTC TTGGCTTGCT CCAGCAAGAT
GCTACCGAGC CGCAGAATCC GGAAATGGTG GAAATTAATA GAGAAATCTT TCGCGAGCTC
AATGAAGGCA ACATGACTCC GGAAGAAATG CGAGACTATT TCATGAGATT GATCCGGAAC
CGTCACCACA CACGAATCTT CGAACATCAA CAACAAAGAA TAGAACAACA GGTACAGCAG
CTTCAGCAAC AGGGTCAAAT GCCGCAACAG GTACAACCTA TACGTCCTCC ACGGAATCCG
CAGAATGCCC CACAGATCAC GCAGCCACCG GTATTTCCCA GCGACGACCA GAATATTCCA
TCTCTTGCTG ACGACGAAGA CGATGTAGAC ATGGAACCGC TTTTCCCGCG GAATCCACCA
GCTTAACTGG TTCAAGTTGG TGATAGCGGT GGTGCGGAGG CAGAATTTGC TTGACCCACT
AGGTACATGC TTCCTAATTT ATTACGCTAA AATAATAGAT GGTATTCTAT GTCTCATATG
ATGTAGTGTA TTGTTAATAG TGTGATAGTA TTATGGTGTT AGTCTTATGG TGTTAGAGTT
TATAGTTAGC TATCATCTTG GCATTATAGT AGTATCACAG AATCAAAGTT
 
Protein sequence
MSSPNSIEDD SNGSASNSIR PASPSKSRDD SRSDPISKNN AKNDNSTYSH PYRRHHDSEF 
LMDNEYTDFR EVRDAIITNR RLSEAAHSHS IETIIDKTSL LRLPTEVLLQ VFHHLDRKDL
FNLLTVCQEF ADLIIEILWF RPNMQNDSSF KKIKDIMQLP SSKTHWDYRQ FIKRLNLSFM
TKLVDDELLS LFIGCPKLER LTLVNCTKLT RNPITQVLHN CEKLQSIDLT GVTDIHDDII
NALARNCVRL QGLYAPGCGN VSEEAILNLL ESCPMLKRVK FNNSNNISDE SILKMYDNCK
SLVEIDLHNC PKVTDKYLKK IFLDLSQLRE FRISNAPGIT DKLFELLPEG FYLEKLRIID
ISGCNAITDK LVEKLVLCAP RLRNVVLSKC IQISDASLRA LSQLGRSLHY IHLGHCGLIT
DFGVASLVRA CHRIQYIDLA CCSQLTDWTL VELANLPKLR RIGLVKCSLI TDSGILELVR
RRGEQDCLER VHLSYCTNLT IGPIYLLLKS CPKLTHLSLT GISSFLRREI TQYCRDPPPD
FTEMQKAQFC VFSGNGVNQL RNYLNQVMEE RAYSIDQGEI QALFMERRRR QINADVDMDD
EEMNIWVRRG LGLLQQDATE PQNPEMVEIN REIFRELNEG NMTPEEMRDY FMRLIRNRHH
TRIFEHQQQR IEQQVQPIRP PRNPQNAPQI TQPPVFPSDD QNIPSLADDE DDVDMEPLFP
RNPPA