Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83849 |
Symbol | GRR1 |
ID | 4839325 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 987688 |
End bp | 990737 |
Gene Length | 3050 bp |
Protein Length | 725 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390640 |
Product | protein required for glucose repression and for glucose and cation transport |
Protein accession | XP_001385202 |
Protein GI | 150865826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CATTTCCCGC CTTCCGCAGC TCGTCCACAC TGTCCAATCC CTGCCTCTAG GACGTTTCTC ATTCCCGGAC TTCTTTCCTG TGGTTCTCTA ATTGTTTGTG GATTCTCACT ATCCAGGGTG GATAAGTATC ATCACTAACG CCAGTTCCAA GCGTCAACAC ACTGAACCTC AGCAATAAAA CAAAACTTCT GGAATTTCCA CAGATCATAT TTCCAGACCC GGAACGCTAT CATTGATATC GCCATTGGTA TTGTTATCGT TGTTAGCATT TCATCGTTGT GTTATCTGTG ACATACTCTC ATTATCAGAG GACAGATTCT GTATATCCAA AAGTGCCGTA TAGTATCGTT GTGAACAAAC ATAACCAGCA CATCAAAACA ACAAGCAAAT CCATAATCAA TCCTATCCTA CAGTTCTGAC TGCATCAACT TAGTGTCAAT TTCGAAAACA TTGTTATCAT CGCATTACTA TTGATCTTCA TTATCATTAA TCTATACAAT CATAAGTTCA ACTATCATTA TCATTAAGAT CATTATTAAT ATCGATATCA TATAACATCA ATAGCCTGAT CTCATTTACA TTGTCTTAAT CCATTAACCT ATATTCAAAA TGAGCTCCCC GAACAGCATA GAAGACGACT CTAATGGCTC TGCTTCCAAC TCCATCAGAC CGGCTCTGCC TTCAAAATCC AGAGATGACT CGAGAAGCGA TCCCATCTCG AAAAACAACG CCAAAAACGA TAACTCTACC TACTCCCATC CTTATCGCCG ACACCATGAC TCGGAGTTTC TCATGGATAA CGAATACACT GACTTCCGTG AAGTGCGCGA CGCCATTATC ACCAACCGCA GACTTCTGGA AGCTGCCCAT AGCCACCTGA TCGAAACTAT TATTGACAAG ACGTCGTTGT TACGTTTACC TACCGAAGTT CTTCTTCAGG TGTTTCACCA TCTTGATCGC AAAGACCTCT TCAATCTCTT GACCGTATGT CAGGAATTTG CTGATCTCAT TATCGAGATT CTTTGGTTCA GACCCAACAT GCAAAATGAT TCTTCATTCA AGAAAATCAA GGATATAATG CAGCTTCCTT CAAGTAAAAC TCATTGGGAT TATCGTCAGT TCATCAAGAG ATTGAACCTC TCGTTTATGA CTAAGCTCGT AGATGACGAA TTACTTCTGC TCTTCATCGG CTGTCCCAAA TTGGAACGGC TCACTTTGGT CAACTGCACA AAATTGACAA GAAATCCCAT AACTCAGGTG CTTCATAACT GTGAGAAATT GCAATCCATA GATTTAACAG GAGTGACAGA CATCCACGAC GACATCATTA ATGCTTTAGC TCGGAACTGT GTTCGTTTGC AAGGCTTGTA TGCTCCTGGC TGTGGGAACG TCTCCGAGGA AGCCATTTTG AACTTGCTTG AATCGTGTCC CATGTTGAAA CGAGTCAAGT TCAACAACTC CAACAACATC TCTGACGAAA GCATTCTCAA GATGTACGAC AACTGTAAAT CTTTAGTAGA AATCGACTTG CACAACTGTC CCAAGGTCAC CGACAAGTAC TTGAAGAAAA TCTTCTTGGA CTTATCACAA CTCAGAGAGT TTAGAATCAG TAACGCTCCC GGAATCACAG ACAAGTTGTT TGAGTTGTTG CCGGAGGGCT TCTACCTTGA AAAATTAAGA ATAATAGACA TCTCCGGCTG CAATGCCATT ACTGACAAGT TGGTGGAGAA ATTGGTTTTG TGTGCACCAA GACTAAGAAA CGTTGTGCTA TCGAAATGTA TCCAGATCTC TGATGCTTCT TTGCGGGCGT TAAGCCAGTT GGGAAGAAGT TTGCATTACA TCCATCTAGG CCATTGCGGG CTTATCACCG ACTTTGGCGT AGCTTCATTG GTGCGTGCCT GTCACAGAAT ACAGTATATA GATTTGGCGT GTTGTTCACA GCTTACTGAC TGGACGTTGG TAGAGTTGGC CAATTTGCCC AAGCTTCGTA GAATAGGCTT GGTGAAGTGC AGTCTTATCA CAGACAGCGG CATTTTGGAG TTAGTCAGAC GTCGTGGAGA ACAGGACTGT TTGGAACGAG TCCATTTGTC GTACTGTACA AACTTGACGA TTGGTCCCAT TTACTTGCTT CTAAAGAGTT GTCCAAAGTT GACTCATTTG CTGTTAACGG GAATATCATC GTTTTTGCGT AGAGAAATCA CACAATACTG TCGTGACCCT CCTCCGGACT TTACAGAGAT GCAGAAAGCC CAGTTCTGCG TATTTTCTGG TAATGGAGTC AACCAGTTGC GCAACTACCT TAACCAAGTA ATGGAAGAAA GAGCATATCT GATTGACCAG GGCGAGATTC AGGCACTCTT TATGGAACGA AGAAGAAGGC AGATCAATGC CGACGTAGAC ATGGATGACG AGGAAATGAA CATATGGGTC CGTAGGGGTC TTGGCTTGCT CCAGCAAGAT GCTACCGAGC CGCAGAATCC GGAAATGGTG GAAATTAATA GAGAAATCTT TCGCGAGCTC AATGAAGGCA ACATGACTCC GGAAGAAATG CGAGACTATT TCATGAGATT GATCCGGAAC CGTCACCACA CACGAATCTT CGAACATCAA CAACAAAGAA TAGAACAACA GGTACAGCAG CTTCAGCAAC AGGGTCAAAT GCCGCAACAG GTACAACCTA TACGTCCTCC ACGGAATCCG CAGAATGCCC CACAGATCAC GCAGCCACCG GTATTTCCCA GCGACGACCA GAATATTCCA TCTCTTGCTG ACGACGAAGA CGATGTAGAC ATGGAACCGC TTTTCCCGCG GAATCCACCA GCTTAACTGG TTCAAGTTGG TGATAGCGGT GGTGCGGAGG CAGAATTTGC TTGACCCACT AGGTACATGC TTCCTAATTT ATTACGCTAA AATAATAGAT GGTATTCTAT GTCTCATATG ATGTAGTGTA TTGTTAATAG TGTGATAGTA TTATGGTGTT AGTCTTATGG TGTTAGAGTT TATAGTTAGC TATCATCTTG GCATTATAGT AGTATCACAG AATCAAAGTT
|
Protein sequence | MSSPNSIEDD SNGSASNSIR PASPSKSRDD SRSDPISKNN AKNDNSTYSH PYRRHHDSEF LMDNEYTDFR EVRDAIITNR RLSEAAHSHS IETIIDKTSL LRLPTEVLLQ VFHHLDRKDL FNLLTVCQEF ADLIIEILWF RPNMQNDSSF KKIKDIMQLP SSKTHWDYRQ FIKRLNLSFM TKLVDDELLS LFIGCPKLER LTLVNCTKLT RNPITQVLHN CEKLQSIDLT GVTDIHDDII NALARNCVRL QGLYAPGCGN VSEEAILNLL ESCPMLKRVK FNNSNNISDE SILKMYDNCK SLVEIDLHNC PKVTDKYLKK IFLDLSQLRE FRISNAPGIT DKLFELLPEG FYLEKLRIID ISGCNAITDK LVEKLVLCAP RLRNVVLSKC IQISDASLRA LSQLGRSLHY IHLGHCGLIT DFGVASLVRA CHRIQYIDLA CCSQLTDWTL VELANLPKLR RIGLVKCSLI TDSGILELVR RRGEQDCLER VHLSYCTNLT IGPIYLLLKS CPKLTHLSLT GISSFLRREI TQYCRDPPPD FTEMQKAQFC VFSGNGVNQL RNYLNQVMEE RAYSIDQGEI QALFMERRRR QINADVDMDD EEMNIWVRRG LGLLQQDATE PQNPEMVEIN REIFRELNEG NMTPEEMRDY FMRLIRNRHH TRIFEHQQQR IEQQVQPIRP PRNPQNAPQI TQPPVFPSDD QNIPSLADDE DDVDMEPLFP RNPPA
|
| |