Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77168 |
Symbol | GIN1 |
ID | 4837929 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 139082 |
End bp | 142289 |
Gene Length | 3208 bp |
Protein Length | 907 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389244 |
Product | transcriptional regulator |
Protein accession | XP_001383312 |
Protein GI | 150864478 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTGTCCCGT GTCTGTCCGC TGCTGTACAC CGTTAGTTTT GCTCTAGAAG CAGCCGTCTT TTTCGTGGTA GTTCACTCGT GGAAATTGTC ACATTCGTAA TTTTTGACAT ATTTCACAGA TTTGTCACAG ATTTCTCACG TAGTCTTTCA AAACAGATTC TACGCTGACG CCAGCGACCT GAAAGCAGTG CCATTCGAGT CAATTTCATT GTGCCACTGC GCAACTGCCA ACCTATAATT ACAGCCAAGT TGGTGTTCTG AATCAGTATC TCATCGCATC ATAATCAGAG ACTTAAGTTC AGAATAAGAA ATACAAGCTT AAAGCCTCTT ACTTTCCCAC AGCCATTCTC TTCTATTTTC AAAGCGTTCG AGTTTTTCCG TCATGATCAA AAGAGAGGGC GAACCGTTCC CCGACTACTC GCACCCGTCA GAAGAGTCCG ACAACGAGTT CCAGCTGTCG GGCCTTCTCC ATTCCACCTC CATCGCTGCG TCCAACTCCT CCAACACATC GAACAACAAC AACGGAAACA GCAACCGAGG AGACCAGAAA CGTAGACGAG TCACCAGAGC ATGCGACAAT TGCCGTCAGA AAAAGGTCAA ATGTGATGGT AAACAGCCTT GTATCCATTG CACTGTTTAT TCGTACAAAT GCACTTATGA TCAACCCAAC GTCAGAAACA AGAAGCATAG CGGAATACCC ATTCCGACCG CGCTGCCATC TTCCGCAGTT TTAGCCGCCG CCGCTGCCAA CATTGCCAAT TCTAACGGAA GCAACGTTGG CGACCTTTCT CAGACTTTGT CGGCAACATC TGCATATCCA GAGAGAAACC TATTGCTCGC CCAAGGCATC ATCAAGTTGT TGCTTCCCAA GTTGAAGATC AACTGCTTCG ACCACAACTT GCAGTTTGAC TTGGAAAAGT TGCAGAAAGT TGTCAACCAC ATCGACGGAA AGTCGTCTTC CGCCTTGAAC GATATCAGTG ACGTCTACTT GGACAACGCT CCCTTACCAT CTCCCAGTAA TCCAGGATCT ATTCGTCACG AACGGCAATC ATCCGTGTCT TCCTCTGACG AAACGACTAT GGGAACAGAA ATCAAATTAT ATTTGCCCAA GAAAGAAATC GCCTTGGACC TCATCTACAC CACATGGAAC AAGGCTTGTG TTTTGTTCAG ATTCTACCAT CGTCCATCGC TTCTAGAAGA AGTAGACCTT CTTTACTCTT TGGACCCGTA CAATTACGGC GACAGACAGC AAAAATTCTT GCCCTTCTTG TACTCGATTT TGGCCTGTGG ATCTCTTTTC TCCAAGTCAG CGACAACATC TTCTGATCGT AACGAGAACT TGGAAGATGA TGGGTTCAAA TATTTCTTGG AAGCCCGTAA GTTGATCGAT ATCAGCAATG TAGGCGATAT TAACTCCATC CAAACCATTG TCATGATGAT AATGTATTTG CAATGCTCAG CCAGATTATC CACCTGTTAT TCGTATATTG GTATTGCCTT GAGAAGTGCT CTCAAAGAAG GGTTGCATCG TAACTTGACC ATCTTCCAGA ACTCGCGAAG GAAATTGGAC CCAATTGAAG TCGATACCAG AAAGAGACTA TTCTACACCA TCTACAAGAT GGATATCTAC ATCAACTCCT TATTGGGCTT ACCCAGATCC ATCAATGAAG ATGAATTTGA CCAGGATTTG CCTGAAGACT TGGATGACGA AAACGTCACT CGTACAGAGT ACTTGTACGA CAAACAAGAA GGACGATTGT CGTCATCTGC ATGTGCTAAC CAACATACGA AGCTAATGCT TATACTTTCA CATATCGTCA AGAAATTGTA TCCAATCAAA GTGAAGACCA CTGAAAAGGA ATCGGTGACC CCAGACAGAA TACATTCTAA AGTTACTGAA TTGGAATTGG AATTGAAGAC TTGGCTAGAT GCATTACCAA AAGAATTGAA ACCTACTGAT CCCAATGATA TCAACTCAGG TAAAGAGATT CCAGAGAAAT TTGTATTAGC CAACTACTAC CTTCATTTGG CATTTTTGAA CTGCCAAATC ATGTTGTACC GTCCCTTCAT CCATTTCGTC AGCGATGGAC CCAATAACTT TCAAAAATCC GATCCTCGTT CGTTGATTAG AGGTAGAAAC TGCATCAAGG TAGCCAGAAT GGTCGTGAAG TTGGCCAACA AGATGATCGA CAAGAACTTG TTGGTGGGAA CATACTGGTT TTCTATGTAC ACTATTTTCT TCTCTGTTGC ATGTTTGATT TACTACTTCC ATTTCGCAAA CTATAATAAT ACTCAAAGTG GTGGTGCCAA CTATGTTGGT GTTCTCTTTG ACGATGATTT AAACATTGAT ATGATCAAAA AAGACATCGA GATCGGTAAG AAGGTGCTTG ATTGTCTTAA GAACAACTCC AATGCTTCAT TGCGTATTTT CAACATCTTG AACACATTAT TTGAACAATT GAATAGAAGA ACAGCCACCA CTTCACGTTT GAAGAATACT AGAGTGGTAG ACCCATATAT CATTAACAAT AACTTGCAGA ACGAAAATGT TCAATCTACA TTCAAGAACT TTGACATCAT GAATAACTTT AAGACAGGGG GCGCAAAATT GTCAGAATCC TCCAATGTGG TTACCAGCGA TAAGAATCCT TATGCTCCGT ACACTCAACA AGATCCACTG AAGAAGTCCG AGGCTCTTGG TCTGTTGTTT ACCGATACCT CGTTGGAAGC ATTCCACTCT TCTAATGATC CTGAGTTGGC TGCGGTTGCC CCTACTGTAC CTTCACTTCC AAGTAGTCTT CCAACTATTG AAGAACAACC TGGAATCACG AATTCAAATA CTACTGACGA AGACAGAAAC GACTATGTTC CTGGTGTATT TGACAAGTTG GATGCTCAAA TCTTTGGCAA GATCTTACCT CCTTACATGT TAGAGAAGAA TGCCCAGAGG TCACATGACA AGTCTGAACA AGAGCGTATT CCAGCACTTG CCAAAGCTGC TCTTTCACAA TACCAGGCTC AGGTTCCTTC CCTGTTGAAT GACCAAGCCT TTGGCCTGGT GAACTTAGAC GATTTTGATT TTTCTCTGTT GGGGGACCCT AACAATCTTG ACTACTTGGA CCCGTTTAAT AATGAATCGC AGTTCCGTTA CTGAGAATTA CGAAATTTAG GTTTAACTAT TTATTTGTAA TGTATATTAG TACCAGTT
|
Protein sequence | MIKREGEPFP DYSHPSEESD NEFQSSGLLH STSIAASNSS NTSNNNNGNS NRGDQKRRRV TRACDNCRQK KVKCDGKQPC IHCTVYSYKC TYDQPNVRNK KHSGIPIPTA SPSSAVLAAA AANIANSNGS NRNLLLAQGI IKLLLPKLKI NCFDHNLQFD LEKLQKVVNH IDGKSSSALN DISDVYLDNA PLPSPSNPGS IRHERQSSVS SSDETTMGTE IKLYLPKKEI ALDLIYTTWN KACVLFRFYH RPSLLEEVDL LYSLDPYNYG DRQQKFLPFL YSILACGSLF SKSATTSSDR NENLEDDGFK YFLEARKLID ISNVGDINSI QTIVMMIMYL QCSARLSTCY SYIGIALRSA LKEGLHRNLT IFQNSRRKLD PIEVDTRKRL FYTIYKMDIY INSLLGLPRS INEDEFDQDL PEDLDDENVT RTEYLYDKQE GRLSSSACAN QHTKLMLILS HIVKKLYPIK VKTTEKESVT PDRIHSKVTE LELELKTWLD ALPKELKPTD PNDINSGKEI PEKFVLANYY LHLAFLNCQI MLYRPFIHFV SDGPNNFQKS DPRSLIRGRN CIKVARMVVK LANKMIDKNL LVGTYWFSMY TIFFSVACLI YYFHFANYNN TQSGGANYVG VLFDDDLNID MIKKDIEIGK KVLDCLKNNS NASLRIFNIL NTLFEQLNRR TATTSRLKNT RVVDPYIINN NLQNENVQST FKNFDIMNNF KTGGAKLSES SNVVTSDKNP YAPYTQQDPS KKSEALGSLF TDTSLEAFHS SNDPELAAVA PTVPSLPSSL PTIEEQPGIT NSNTTDEDRN DYVPGVFDKL DAQIFGKILP PYMLEKNAQR SHDKSEQERI PALAKAALSQ YQAQVPSSLN DQAFGSVNLD DFDFSSLGDP NNLDYLDPFN NESQFRY
|
| |