Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_76670 |
Symbol | FST5 |
ID | 4837758 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1742226 |
End bp | 1745640 |
Gene Length | 3415 bp |
Protein Length | 1026 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389073 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001383619 |
Protein GI | 150864684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTTTAACCAT TTTTCCTACG TCCTAAGCAC AACGCTGGTA CTGGCGTCAC TCAGCGAGTC CCGTGACGTT TTTCCTTACC CTGCCACCTC TTCCCACCGC TTCCATCCAT GGCCCAATAG CCATGTCTAC CAAGAAGCGC AATAGAGCCA CTAAGGTGTG CGACTTCTGT AGAAGGCGCA AAGTTAAGTG TGATTTGGGG AACCCCTGCT CCACCTGTGT CAAGTACGGC CACCGCAATT GCCGATACAA GGAACAAGAC AACGAGGCCG ACTCAGACTT CAAAGTCCAG GATCAGTTGC TCCAGTTAAA GGAGAAATTG AGACTGCTCG AAGAGTCTGT AAATCAGAAC CCAGCCTTCA GTGAGTACCA CAAGGAAAAG AGCAGCATCA GCAGTAGCAG CAGCAGCAGC AGCAACATTA ATAACTATAA CAGTAATCGT AACATTTCTA GTAACAACAG TGTCAATAAC AGTAGTAGCG TGAATAGCAA GATCAATAAC AATCTAAACA ACAATCTTAG CAATCTTAAC ATTAACAACA ACCCTAACAG CATCAATGTT AATGACAACC TCAATATCAA CAACGAAGAT GTCAACTTCA ACAGCAATAT CACCAACAAT AACAAACGAC AACATCTTGC GGCCAGCAGT TCGCCACCAT TTGAGAACCC TGGTTTTGGA ATCTTGGATG TACGGAGCGT CTTAGGGAGA AACCCCGTTG CATCAGAGGA GGATGTAATC AACTTCTACG ATGGATACTC GTCCGTCTAC GACAAGGAAC CAATTAGAAG ACGGAATTTC GGGCCCATGG CATGGGTCAC TTTAGTAAAA GTGGATAACT GTAGTGGGAA GCTCTGGGAC TATATACATT GTGTCAAGAA CTCGAACAAG GAAATCCGGG GCGCCAAGTT TAACGGCGAG ATGTTAAGCA AAGTAGACGT TTTTCCCACA GCCAACGCAG TGGACCAGGA TTTCCGAGAA AAGGTCACCG AAGATGAAGG GTTCAACGAA GTGAGGCCAT ATAAAGATGT TGCCTATAAA CTCTCACCGG GACACAATAA GTCTTCTATT GACAATAAGC ATTCTCAGAA TACTTTGAAT GAGAAGGCCA AGAGCTTAGG CTTGATGTTC TACCAAGGGG GATTGGACGA GGAGTTGGCG TTGATCGAAA AGATCCGCTT GGTCTTGCCC AAACGTAAAG TCATATGGCT ATTGTATAAG CGATACTTCA CCCACTTGTA TCTGGCCTTG CCATTGCTTG ACGAAGTCGT CTTCAAGGAA AAGATCCAGA AGCTCGTGGG CCAGGAAAGC TACGAGGATG TAGATGTCAA AGTGCAAGTG GAGATGCGAT TGGATTTCGC CCAGTTGGGA TTGTTGCTCA TAGTACTTCG TTTGAGTTAC TTGACGTTGT TCACGAATGT CGCTTCTATC AATGAGGCGA ACTTGATTCT GAATGATCCA TCACCAAGAG CCCAGGAAAT CAAGTATTTA TTGAACAATC CCATCAACAT CGACGTTATA GATGTGGCTC AGAGCTGTCT TAACCAGTTT AACCTAATGA AGAATGTCAA CATGACCACC ATGCAATTGG CCATGTACAT GAGGATCTAC CACATGTATG CTCCTGAAGA AGGCGACGGT ATCGACGGTG GAGATGCGCA GGTGTTTAAT GCCATGTTGA TTCAGATGGC TTACTCTCTA GGCTTGCATC GTGAACCAGA CAAATTTCCC AAAGAATTGA ACGACGAAAA GACAAACAAC CTTGGAAGAA AAATGTGGTA CATGTTGTTG GTGTTTGACA TGAACAACTG TATGGCTAAT GGTACTCCCA TGAATGTACA CAGGTTGTCT TTCGACACCG TATTCCCTTT TTATAGACCA GGAAACGAGA ACGTCATAGA TGTGGAGGTA GAAAAAAGCG TCTTAAGTTC ATTCAACCGT TTCAACAACG TGTACGCTCC CATGACAGAA ATTCTCGATA TGATTGTCAA AGTCAAGGGA GGGGTCAAGA TGACCGATCT TGCTGAGAAA TTGTCGTATA TGGAATCTCA CTTTGTAGAG GAGTATGGCA AGATAGGCAA TCATTTCGAC GCTGGAAACT TGAGTCAAGT TGAAGTATAC CAGACAACAT TAAAAGTCAA AATCCATCTA ACCTCCAATA TGTTTGTTGT CACAATTTTT TTCCACATCT TCAACTATTA CGAGAAGAAA GGATTTTCTG AGTTAGCCTT CTTTTACCTC AAGAAGATTT TCTTGATCAC CATCTTAGAC TTAATGCCTT TCTATTTTGA ATTCTTAGAC CGCTCCCATA TTATTTTCAA GAATTCCACG GATTTGAGCA TTACTCCCGC ATTTGAAATG GTGACTCATA AGGCTCTTAT TGTATTGATG AGTATACTAT TAAGAGTTAG ATTTGCTATC AAGACGAGCG AGGACTCCTT TGATCACAAT ATGAAATTAA TCAAGGTTAT CAGCTACAAG ATATATTACG ACTGTTTGAT TAGAATTAAG GGTCTTCTCG AAAAGTGTTT GGGCGTTTTC AGAGAGAGCA TTGCAAAGTT AAGTCACAGA TATTACTATT CGTGGAGAAT TACCAAGGCA CAGAACTTCT TAAACACACT TTTGACAAGC GACCAGTTAT ACGAGACTTA TGCTCCCCGC ATGAGCGGGC CCAGATTGGA ATTCACTAAT GCCATGTTGG AAGAATTGAC CACTATTCTT GAAAAGGCTT TATACAAAGT GAAGGAACAC AAGAAGGCCC AAAAAGGTGG GTCTCAAGCA TCTAAAGCAG ATCCAACGGT CAAGAGGGAT CCTGCGAGCT ACACCGTTAG TGGATCTGTT GGACTGAGTG ACGATCGCTG TAATTCATAC GAAGATCGTG ACAAGATTAC ACCTACTTTT TCGAGTACTT CTGTTGGATC AACTACCTCT AATGACAACT ATTTTGACGG TGACTACTTG CCCAATGACC AGATCGACTC CATTTGGTTA CAGATGATGT CGTTGAAGAG TCAGGCGGCA GATCCCACAG CCAACTTCAA CACCCCTGCC CCACTTGGAG TGGGCATTCC TGGATCAAGT ATGTATGACG CATTGGGAAC GCCTGGTGCG TCCCTTGGTG GGGACATGGG TCCTTATCTT GAGACAGATA ATCTCAACCT CTATCAGCGA GACGATTTCT TTGGAAACAT GCCTCTTGAA GAGATCTTCA AAGACTTCAG CTGATTTTTT CGTTTTTTTT CCACGACTTG TTTTACTATT GCTTTTGAAA CTAGAAATTA CCATAAAAAC CATCAAAAAG ACTAAAAAAA AGAATGCTAA TTGAATTCAG GAGATCTATT CCAGATTTTG TATATATCAT TGTTTATTAA CTTGTAAATA GAACGATTGT GAATGAATGA ATGTCTATGT AAATG
|
Protein sequence | MSTKKRNRAT KVCDFCRRRK VKCDLGNPCS TCVKYGHRNC RYKEQDNEAD SDFKVQDQLL QLKEKLRSLE ESVNQNPAFS EYHKEKSSIS SSSSSSSNIN NYNSNRNISS NNSVNNSSSV NSKINNNLNN NLSNLNINNN PNSINVNDNL NINNEDVNFN SNITNNNKRQ HLAASSSPPF ENPGFGILDV RSVLGRNPVA SEEDVINFYD GYSSVYDKEP IRRRNFGPMA WVTLVKVDNC SGKLWDYIHC VKNSNKEIRG ANKVDVFPTA NAVDQDFREK VTEDEGFNEV RPYKDVAYKL SPGHNKSSID NKHSQNTLNE KAKSLGLMFY QGGLDEELAL IEKIRLVLPK RKVIWLLYKR YFTHLYSALP LLDEVVFKEK IQKLVGQESY EDVDVKVQVE MRLDFAQLGL LLIVLRLSYL TLFTNVASIN EANLISNDPS PRAQEIKYLL NNPINIDVID VAQSCLNQFN LMKNVNMTTM QLAMYMRIYH MYAPEEGDGI DGGDAQVFNA MLIQMAYSLG LHREPDKFPK ELNDEKTNNL GRKMWYMLLV FDMNNCMANG TPMNVHRLSF DTVFPFYRPG NENVIDVEVE KSVLSSFNRF NNVYAPMTEI LDMIVKVKGG VKMTDLAEKL SYMESHFVEE YGKIGNHFDA GNLSQVEVYQ TTLKVKIHLT SNMFVVTIFF HIFNYYEKKG FSELAFFYLK KIFLITILDL MPFYFEFLDR SHIIFKNSTD LSITPAFEMV THKALIVLMS ILLRVRFAIK TSEDSFDHNM KLIKVISYKI YYDCLIRIKG LLEKCLGVFR ESIAKLSHRY YYSWRITKAQ NFLNTLLTSD QLYETYAPRM SGPRLEFTNA MLEELTTILE KALYKVKEHK KAQKGGSQAS KADPTVKRDP ASYTVSGSVG SSDDRCNSYE DRDKITPTFS STSVGSTTSN DNYFDGDYLP NDQIDSIWLQ MMSLKSQAAD PTANFNTPAP LGVGIPGSSM YDALGTPGAS LGGDMGPYLE TDNLNLYQRD DFFGNMPLEE IFKDFS
|
| |