Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29444 |
Symbol | FST6 |
ID | 4837170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 274723 |
End bp | 278130 |
Gene Length | 3408 bp |
Protein Length | 1135 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388485 |
Product | Fungal specific transcription factor |
Protein accession | XP_001382818 |
Protein GI | 150864115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.746145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTCC ACGTCAACAC ATTTCGGGCC ATGGACCAGT CCAAGCTCGA AAACCGCAAA CAGGAGCTGC ATTCACGTGA ACGACCAGCT TTCCACGATC CTCTTGGCCC TGTGGTTGGT ACCACCATTA CTAATCAGAC GGGTGCAAAT AATCGAAGTG AGCATACATT TGCTCCAGAA AATGGAAAAC AAATCAGGTT TGAAAACTCA AGTGGAAATT CAAGCGATAA CTCAAGCGAC AACGTAGTTG ATAATCTGAG CGTTGCCATG AGTCGAAGAT TTTCCAAAAA CGAAAATAAT GGTGGGGCTG CCACTTTGAA AAATTCTCGC AATACTATGG TAGATGAAGG CTACAGCCAT AAAGCTAGCT TTAAGGTCAA GCTGGAGCCT CGGAAGAGAC ACAGACTCAC TTTGGTGTGT AACAACTGCA AAAAAAGAAA GATCAAATGT AACAAACAAG TTCCATGTGA CTCTTGTGTC AAGTCCGGAA ACTCCCAAAG TTGTTGTTAC GATGCTGCAC TCAGTCCGTC TAACACCACT ATGCCATATG TCTCTAGCGC TAGAAGACCA ATAGTGCTTA AGGAGGAACA GATAGTTATG CCAAAGAGAC ACAAGACTTC ACAAGATAGA AAGCTCATAC CTCATACAAC AACTATTGGT AACAAGGCTG CCTATAATGC AAAAAATGAC GTTTTTCACT CTGCCATAGT TGCGCTGAAT GGTAACAATG GTGAAACCTA TTCTGGAAAT GTAGATTCTT CTGGAGGTGG ACAAAAAGTC AATATATATA AGAAGGAACT AGATGCCTTG AAGGAGAGAA TTCGGCAGTT GGAGTCCTTA AACGGTCTGC AAACCGCTTT GGCTCCAACA AGCGGTGCTT ACTCTGCTGT TTCGGAATCT AATATGACTC CTTCTCTCGT TCCTACATCA TCATACGCCG TGGACGAGTC TGCTCAGAAA CAGATGAATC TACCACCACC ACCTTCTTCT TCTATTCTAT CGTCATATTC TCCCTTGTCA GGTGAGCAAC AGAGTGGCAG TACATCGAGT ACTACTCTTG GCACCCCTTC TCTGGTAGCT ATACAAAAGA TATTGGATTC CAAGCCTTTA TATCAACCAA CATACAATGC ACCAATCAAG TTGCCTCCTA TCAATTGGAA GGCCACGCAA TCTAGCTCAA CTGTTTCATC GACTCCAGTG TCTGACGATC TAGCAAAATC CAGAAGTCAT TCACAGCCTG ACGGAAACGC CCAGGAATTG ACATCCCTTC TTGGTATCAA TCCGGTTCTG CAGCAGCTGG ACGGTATCAA CTTCTACCGT TTTGACAAAG ACACCCCCTT CAGTTGGTCG TCTCTAATGA AGAGTGATGA CCAATTACGA CTGCTCTTGC AATATGTCAA AACTCAACAC GCAGTGCTAG ACGAGGATAT CAGTTATCCC AAGAGTCCTT CCGAGTTGGA AATGTTTGAT ACTAGCACCC AGGAGTATCG GTCTTTAATT CGTCAAACTA TGTCTAGAAC TACTTATGAC CACCACCAAT CGCTCACTGA TCGTATTACC GAACGCTTAC CAGAAAAGGA AATAGTGTGG TTGTTGATCG ATCGATTCTT CGAATTCTTG TATCCGTTTT TACCCTTTGT CGATCAATTG ACATTTATAG AAGATATGGA AAGACTATTG GGTCCAAGAG AGCAAGAGTC TACTGGCTTC GATATTCCTT CACATATAAA GCTACACAAA GAAACGTACG AACATGATAT GGCAATGTTA GGAATATTGT TGCTTATGAT GAGATTTGGT TACCTCTCTC TTTTCAGAAA CAATGATGAT TTCAATATCG AGACAATTCA AGGAGGAACT AGTGCAAGAG TTATATCAGT ACTGCGTTTA CTTCGTCATC CTATATCTAT TGATTTAGCA GAGTTGGCCA GAGAATGCTT AGCTGGTTTT AATGTTCTTG ACGAAAACAT TCCATTCACT TTTTCTACCT TGCAGTTAAT GTTTTTCATG AGATTTTACT GTAGACTCCT GCCAGAGGAC GGCTGGGTTC ATGATGTAAT AGACTATCAT GGCGCCATAG TGCGAATGTC TCTTGCGATG AAGTTGAACA TTGATCCAGA TCAGGTACAT CCTTCGGAGA GTCCCAGAAT GAAGAATTTG CGAAGGAAGA TGTGGAATTT CTTGGTTATA GCTGATGTTC ACAACAGTCT AGCATTCGGT AGTCCGTTGT ATATCAAAGA AGAAACTTGC CACACAAAAG TTGTATACAT CACTGAAGAA AATAGCAACA TCATAGGCAA TTTTGAAAGA GAAAAATTCA TTTTTGAAGT TGTCCATAAA AAGTCTTACG ATTTTCATGT TCGTATGAAG TCAATGTTGA AGTTTATCTT GGATAGAAAC TCCACTACTC CATTAAGTCA AATAGTTGAC AAGTTGAATG ATTTTGAAGT ATTGGTGAAC GAGCATTTTG GAGGCTCTCT GATTGTCAAT GCTATCTGCG GAGACTGTAA GTGGAAGGTG GTTTGTAATA AGGGTGATTA TGTTAACATA TTGAATGAAG ACGACCGGCT CTCGTTGAGC TGTTTTGAGA GAATTCATAC AATCAAAGTC TTCTTATCGA TCAATACCTT CTTGATGACT ATTTATTATT ACATGTATTT GTTCTATGGG GACTCCCTGA ACAATATTTC GTGGTTTTAC TTGAAGAAAT CGATGATCTA TATTACTGAT TTGATCCCTA CCTATCACAG TCTCTTGAAT GAATCTCAAA CATGTTCTGA CTTCATTATC AACCCTACTT TGCAGGGATT GATCCACAAA TCCAATCAAG TACAGCTTTC TTTAATCATA AAGTTCAGTA TGAATAGCGG AGACGAGAAA TTGCGTTCTT TACTTGTGAC AACATACCAG ATCTTGATCA ATTTGATAGG TGGAATCAGT AAAAGATATT ACTACGCTTG GAAGATTACA AAAGGACATA CATATTTCTT GTCAGTGATC CAGAGTCAAG CGTTTCAACG GTGGATGTTG CAGCAAGGTG GTGTCGAGTC AAGGTACAGC AAGTATCAGT TGGACGGCTT GAAGACTATT GTTTCACGAT GCATAGACAA GACTCAAGAA AACAGGGAAA ACAGAAGAAG TGGTAAGAAT AGGAACGCAT TAACTGAGTC CGAGTGCCTG GAAAATACTC AGGAATTCTC TAGAGAAACT CTGAGAACTC CGCCTCTGTT ACCTGGTTCT AACGTGGCGA GCAGCCCATC GATGGCAAAT ACCGATCAGC AATGGATAGA CTTGATTTTC AACATAAAGA ACGATCCGGC CACACGCGAG TTGACGTTGT TCGACTTGTT CGACGAGTTG GAGAGTAATC CTTTGGGTGC AAAAAACAAG CTACAAGACG ATAAGTAG
|
Protein sequence | MAFHVNTFRA MDQSKLENRK QESHSRERPA FHDPLGPVVG TTITNQTGAN NRSEHTFAPE NGKQIRFENS SGNSSDNSSD NVVDNSSVAM SRRFSKNENN GGAATLKNSR NTMVDEGYSH KASFKVKSEP RKRHRLTLVC NNCKKRKIKC NKQVPCDSCV KSGNSQSCCY DAALSPSNTT MPYVSSARRP IVLKEEQIVM PKRHKTSQDR KLIPHTTTIG NKAAYNAKND VFHSAIVASN GNNGETYSGN VDSSGGGQKV NIYKKELDAL KERIRQLESL NGSQTALAPT SGAYSAVSES NMTPSLVPTS SYAVDESAQK QMNLPPPPSS SILSSYSPLS GEQQSGSTSS TTLGTPSSVA IQKILDSKPL YQPTYNAPIK LPPINWKATQ SSSTVSSTPV SDDLAKSRSH SQPDGNAQEL TSLLGINPVS QQSDGINFYR FDKDTPFSWS SLMKSDDQLR SLLQYVKTQH AVLDEDISYP KSPSELEMFD TSTQEYRSLI RQTMSRTTYD HHQSLTDRIT ERLPEKEIVW LLIDRFFEFL YPFLPFVDQL TFIEDMERLL GPREQESTGF DIPSHIKLHK ETYEHDMAML GILLLMMRFG YLSLFRNNDD FNIETIQGGT SARVISVSRL LRHPISIDLA ELARECLAGF NVLDENIPFT FSTLQLMFFM RFYCRLSPED GWVHDVIDYH GAIVRMSLAM KLNIDPDQVH PSESPRMKNL RRKMWNFLVI ADVHNSLAFG SPLYIKEETC HTKVVYITEE NSNIIGNFER EKFIFEVVHK KSYDFHVRMK SMLKFILDRN STTPLSQIVD KLNDFEVLVN EHFGGSSIVN AICGDCKWKV VCNKGDYVNI LNEDDRLSLS CFERIHTIKV FLSINTFLMT IYYYMYLFYG DSSNNISWFY LKKSMIYITD LIPTYHSLLN ESQTCSDFII NPTLQGLIHK SNQVQLSLII KFSMNSGDEK LRSLLVTTYQ ILINLIGGIS KRYYYAWKIT KGHTYFLSVI QSQAFQRWML QQGGVESRYS KYQLDGLKTI VSRCIDKTQE NRENRRSGKN RNALTESECS ENTQEFSRET SRTPPSLPGS NVASSPSMAN TDQQWIDLIF NIKNDPATRE LTLFDLFDEL ESNPLGAKNK LQDDK
|
| |