Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_935 |
Symbol | FST4 |
ID | 4840929 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 419169 |
End bp | 422231 |
Gene Length | 3063 bp |
Protein Length | 760 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640392244 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001386474 |
Protein GI | 150866770 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.664373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCGCGTAAAA GGAACAGAAT AACGTTGGTC TGTAACGTGT GCAAGTTCAG AAAGGTTAAA TGCGATAGGG GACAGCCATG CTCATCGTGT ACAAAGTACA ACACGTCTCA TATCTGTGCT TACAATAAGC CCTACTATGG AGATGCTTCG GTAGATGTTG TTAATGTAGA ACCAGATGTT CTGCAAAAGA ATTCACTCTC TGAATCTTCG TCTGTTAACT CGGCTCAAAC TGCTAGTGGT ATAGTAGATT CCCCCAATTC CGGACCATCA TCCACGTCTA CATCATCTAC ATCTGTTTCG GCTATTTCGG CAAATCAGCC TGCTAGTTCT TCTGTATTTT CTAAAGTGAT TACACTGGCT ACTGCCCAAG ACACCACGTC TCTGAATTTG TTCAATAGAA ATGCCGTTCC AACTCCTTTA GAAAGCCACG ATAGATTATC CAATAGCAAT ACTTCGGCAT TCAAACCTGA CTACCAAACA CCTAGACGAA TAACACTGAA GCAAAAACTC CAAGCTCAGC ATCCTCAATT AGGTCTTCAA CAACCTCAAC AAGCACTAGG ACAACAACCA TTGGCACAGC AACCGTCATC ACAACAACCA TTGTCACAAC AACCCTTAGA TCAACAGCCC CAAGCTTTAC GCCATCAAAA TACTACTCTT CCTACTCCAC AACGACCACC TCCACAATTA ACACATGTGC CACAGCAAGA TGCAAGATTG CGAAGGGTGC ATCTGTCGGT TTCGCTGGAA TCGTATAAAT CTTCAGAGAG ACCATACAAG CTGTCAATTT CATCAACTGG TTCCTATTCA ACAACATCAC CAGTTGTAAC TGGCAGAAGT CCGCATGGCC TAAATAATAT CAATAGTGTA GATCAGTCCT TAAACAAACT TCTGGAAGTA CAGTCCAATG ATTCCGACAA AGGTAGTCAA CAATTCACTT CTCCGTCCTA TGCCTCTGCA ACAACTCATG CCAGTACAGG ACCTACATCC GTTAGTTCAA CTACCCATCT TGACTACAAA GAGCCGAACT TGACCCCAAA TTCTAGAGTA TACCTTGTAG CGAAACAGTA TGTAGACATA ATGAAGCATT TCCAGGACTA TAAATCAGTG ATTGGAATCA ACCCGGTTAG CTCACCTTCA GACACAATTA ATTTCTTTGA TTATCCGCCT ATTTCCACAA CCCCAGACAA ATATGAAGTC AACCACGGAC CTTTTACGTG GCATGCATTC TTGAAGATGG ACATGGGGTT GTCTGATTTG TGGGCATTCA TGTCAACTAA GTCTCAGAGC TTCCACGAGG CTAAGACTAA ATTTATGGAT TGTGTGCCTC CTGGCAAGAA CAAAGAAAGT TCAAGATCTT TTGAAAATCA GTCAGTTTTG AACAAAACAC CAATCCAAGT ACTTCAAAGA TTGAATATAG CAGCTAAGAA AAAATTTAAC ACTTCTAAGC ATAATATCAA TGAAGGTATA CCTTTGGGAC TTAACTTCAT TACCGACCAT GAATATTTAA ACAATAATGA TATTGAATTG TCTTCCAAGA TCTTGACAGT ATTGCCAAGC AAGAGAATTG TTTGGATCCA TATTAATAGG TTTTTCAAAT TTATCTACCC TTATATTCCT TTCCTAGATG AAACTTCGTT CAAGAAGTCT ATTGAGGCTA TTATTGGTCC AGAGTCATTT ACGGAAGAAA AGATAGAAGG CCTAAAGATA GAAGATAATA TTGACTATGC CAAACTTGGA ATCATGTTGA TTATTCTCAG AATGAGTTAT CTTTCGTTAA TCACCAACAA CCAGGAAGTC AACGAGTTCT TCTTGGGGTA TGACGGGCAG AAGCGCAATA CATTCTATAA AGAAAAGCAG AAATTGACGG ATTCGGAATT GCGTAGCTTG AAAATGCTAC TCCTTAACTC TATTGGAATT GATGTCATTG TATTAGCAAG AAACTGTTTG AACAAATTCC AGTTATTCCA GAAGATGAAC CTATCCATTT TGCAATTGGC ATTGTTCATG AGAATCTACG TTTTCATTGC GCCAGAAGAT TCCGAAGGTC CTGATCGCAA CCAGTTTCAG ATTTACAATG GCACCTTGCT TCTGATGGCT TATTCAATAG GATTGAATAG GGAACCGGAT TCGTTCAAGT CCGTCTTAAA TGATCCAAAG GAAAATAATA TCAGAAGAAA GATTTGGTTC CATTTGAACT ATATCGATTT ATTACATGGA TTTTCTTCCG GTAACCCATT AACCACTAAT CCAATAGCTT CAGATGTAAA GTTTCCTTTC CACGAGGATG GAAATGAAAA CATTAGTTTT GAAGCTCTTG ACTCTTATAC ACATGTTGCT TGGAAATTTT TGCAACCGCT CAGTACCAAA ATGAGAAATA TTCTTAATAT CGTGGGAAAT GTCAGTGAAG GAGTCAAAAT TATAGAATTA GTAGAAGCAT TGAATGACTT CGAGATATTT GTGGCTACTA ATCTCGGCAC CCACATTTCT GAATTTATAG ATGCTTTCAA GGAGGATTTT TTTAATGACA ACTTCAGAAG AGTCTTGCAA GTTCATTACT ATATCCAAGT TAAATTTTTC ATGGTATCTA TCTATCATCA TCTTTTTATT CATTACGAGA CAGAAAAGAA TCAGGAATTG AGCTTTTTCT ATCTCAAAAA GATGTTAATG GTATTGACTG AAGAATTTCT TCCTAGTTAC TACACTATTT TGGAACATGA GCATTATTAC TTCAGCTACA GTGCAAACCT TTTTTTGAAT CCTGTCATTG AGTCGGCAAT GCATAAGTCT AATGGTATCT TGTTCAGTCT CATTATCCGC ATGAGCTACA CAATCAGAGC CATGGAGGGA GCTAATCCTT CTGCTCATGT TGAAAAAATG GCTACGGATG AGTCGTATAA ATCATACTTC GATGATATGC TGCAATTGCA ATACTATCTC ACAAAGTGTG CTAAACTCGG GACTCTAGGA ATATCAAAGC TTGGTAACCG TTACTTGTAC GCTTGGAGAA TTAGCAAATC CAACTTGTTT ATTCTTCGGG CTATTCTGAG TGACGAGTTT TAC
|
Protein sequence | ARKRNRITLV CNVCKFRKVK CDRGQPCSSC TKYNTSHICA YNKPYYGDAS VDVVNVEPDV SQKNSLSESS SVNSAQTASG IVDSPNSGPS STSTSSTSVS AISANQPAMI GINPVSSPSD TINFFDYPPI STTPDKYEVN HGPFTWHAFL KMDMGLSDLW AFMSTKSQSF HEAKTKFMDC VPPGKNKESS RSFENQSVLN KTPIQVLQRL NIAAKKKFNT SKHNINEGIP LGLNFITDHE YLNNNDIELS SKILTVLPSK RIVWIHINRF FKFIYPYIPF LDETSFKKSI EAIIGPESFT EEKIEGLKIE DNIDYAKLGI MLIILRMSYL SLITNNQEVN EFFLGYDGQK RNTFYKEKQK LTDSELRSLK MLLLNSIGID VIVLARNCLN KFQLFQKMNL SILQLALFMR IYVFIAPEDS EGPDRNQFQI YNGTLLSMAY SIGLNREPDS FKSVLNDPKE NNIRRKIWFH LNYIDLLHGF SSGNPLTTNP IASDVKFPFH EDGNENISFE ALDSYTHVAW KFLQPLSTKM RNILNIVGNV SEGVKIIELV EALNDFEIFV ATNLGTHISE FIDAFKEDFF NDNFRRVLQV HYYIQVKFFM VSIYHHLFIH YETEKNQELS FFYLKKMLMV LTEEFLPSYY TILEHEHYYF SYSANLFLNP VIESAMHKSN GILFSLIIRM SYTIRAMEGA NPSAHVEKMA TDESYKSYFD DMSQLQYYLT KCAKLGTLGI SKLGNRYLYA WRISKSNLFI LRAISSDEFY
|
| |