Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58009 |
Symbol | HAP1.2 |
ID | 4838313 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 381492 |
End bp | 384710 |
Gene Length | 3219 bp |
Protein Length | 1047 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389628 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001383702 |
Protein GI | 150864741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.513512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0535661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGA AACGCCAGAG ACAGCGGAAC AGAGTTCCCG TGTCATGTCT CAACTGTAAG AAGCGTAAGG TCAAATGTGA CAAGGGCAAG CCGTCCTGTT CAGGGTGTAT CAAGAATGGT GTGCCGCATC TTTGCGAGTA TTTGGAGCCA GTGTGGTCGA AAAAGAGTTC TCAGGTGAAG GCAGAAGACG CAGAAGACTC ACATGATGCG AACGCATCTT TACTACAAGT CAAGATCGAA GAGACGAGTG AGTTTAAACA GTTTCGAGCG CATACTGACA AAGTCATCCT TTCTCAAAGA AAGGAGATCG ACGACTTGAA GCGACAGCTT TCGGTGCTCC AGCAGCTCTC GCCGAAGGTC CACGATGCCA CGGCTATGGG CTGTAAGCCT ATCTTGATTT TGACGAAGTT GAACCTCTCT CTCGTTAACA ACCGAGATCC ACTCACAATT CACCACGATC CAGCATACAG TGTAATAGGA CGTACTAGTT CTAAAGTTAA CCACATAGAT ACGTACTCGT GGATCAACTT GATTAAATTG GATCCACAGC TTACCACTCT CTGGTTCAAA ATCACAAATC TCCAGAAGAT CTACCACATG TACAAAATGA ACATGTTGAA CAACACTTCT AGAAACAGCC CGGGAGCCTT CAGCTTACCA AACCAGTCTG TCACATCCAA CCCTTTGTCG AAGAAATCAC CCTATAGAAT CAACGAAATC GACTTCACCT ACAGTGTGGT CAAGTCTGAA GAGCCAAATA AATTAAGGTG TCCAGTTATC GAGTGCGATT TCAACTTCAT GACAGAAGAC CAGATCACAC CAAGTCCAGT AGGTGGGATA AGCTCTCCTG TTCCTCTGGC TCGATCCACT GATACCCCAA GGAAGTACAA CACTGTAGTA CCAAATCAGT ATGGAACGGA ACAGGAGCAA TTCGCTTATC ATGACCTCGT ATCGGAGAAG GGAAGAACGT TGTTACTTAA GGTTCAGAAC CTCTGGGATT CCTCGCTCAA CTTGGTGCGA GGAAACGAAA AGATCAATTT CAAACAGCTC TATTTCCTCA TCGACTTCTA CTTCAACAAC AAGGTGTACG ACATCGAATC GAGGCATATA CTCTCCTTCT ACAAGATAGA AATTCAGAGT ATAATTAAGA AGAACGGGAA TGAGATATCC ATAAATATTG CCAATGATCC TAGTCTTAAA TTGACTGACG AACAGCTTTT CGAGCGCCTT AAGATGAAGG GAGTCTACTT GTGTATGTTA GCGTTGATAA TCGAGGAGTC GTTGGATACC TTGAGACTGA ATGTTAAAGT GGGGTTAGAA GAAGATATTG GCCTCAAGTT CCGTTCATTG TTTCCTACGG AAGTAGTCTA TGTTGGGCAA GGCTCCAAGT TCAGAAATAC TTTGTACATA GTCCAGGAGT TTGTGTTGCA CATAACCAAC TTGAAGTTTT CAGAAACATC TTCTCCGTCT CTTTGCACCA TAGCTTGTTA TATAACATTG CTTAATCGTG AAGTCGCCGA ATACAAAAAA GACGGTGCAA CCTCAGATCC AAAGCCAGGG TTCACAAGCT TGTTCACTGT ATTGTTGAAA ACTATATTGA GTGATGAAGG CACAGTCGAA TTATGGAAAG ACCCTGAGCT CGTCATCTTC AAGGAACAAG AAGCCAGAAA AAGAAACAGA GACTTGAAAA TCCATATGTG CTATATATGG ACAGATCTTG TCAGATTGGC CAACTTGGTT GGATTCAACT TTGTGCCCTT GATAAAACAC TCAGAAGCAA TTGACAACCT CTTGCAAAGA CTCTATACAA AGATAGAAGA GGCAGATCTG CTTCTGTATC ATCTAAAGTA CATCACTTCC CTCAATTCCC ATAAATTTGA TGAATTAACT ATAACACTTC ATCTCCACTA TTTGATTGCC AGAATCTCAT CTGCTTTAGC TCATGGGATT TCGAAGGTAG GCGATCTCAA ATTGACTATC GCCAATTTGG AATCATTGAT CAGACAATGC AGTACCTGGA TTGTAGATTT AGGATTACGG AAATTGAGGC ATATACGTAG ATTTGAATGC GTTTCAATGT TAATGTATCT CAGATACTTC ATGAAGTACA TTATATTATT ACAGGCAGAG GAAAGTATGG ACGAAGAGTT GGTAGCATCA TCTGTTCCGG ATATTTTTAC CAAGTATCTT GAAATCATTG ATCTGTTGAG AAAAGAACTT ATCAACGACC ATGATGGCAT GAACAAGCAG TACGTGTTAC TGGCGATAAC AGAACTTTTG ACTAGACTTA TCCAGATTAT TGTTGCTTTG CTAATGAGAG TTTCGAATGA TGATAATATG ATGTCACAAG AAACTGTATT GCGAATTCAG CTCAACAAGT ATTCGGCTTT CAATGGCGAC AAGGCAATTT CTGATTACGG AATGAGCATG GAAGACTTAA TTAAGACACA TATAGTAGGC GTTGTTGAGG ATTCAGTAGA CTTGCTTGCG AAGAGTCCCT TATTGGATAA GGATAAATCT GGTAAACTCT CGAAGTTATG GAAGTTCTAT TTGACGTTTG TCCGTAATTC CAAGAGAATG ACAAGCATCA ACTACGCCAA GATACATGCA AACATTCCGC AATTCCGAGG AATCGGGGCT GCTGGTGATA TGAAGTCTTG CCCAGTTATA ACACCTAGGA GCTTCAAGAA CTCAACGCCA CCAGCGATTA CATCAAAGGA ATATACGAAA TGTCCTATTT CGCATATCAC CACACCCATA GACGAAGACA GTTCGCCTAT CGACTCCAGG CCGGGTAAAT GTCCTGTTAA CCACAGTATT GTAACAACAG CTTCACCAGT GCCAGTGGAT GCTAAGAAAA GGAAGTGTCC ATTTGACCAT ACAGCACTTG ATAGAAGTTC AATGTCTCAA GGGTATAATG CTATTGAAAG TAATATCCGC GGTGTGATCA AGCGCCAGAG AGATAGTTCG GACTCGCCTT CGGATGTTGA AAAAAGTAGC GGATCTACTC CGGTAGTAGA GAGGCCAGAA CCAGTTGTTG AGATGTCTAA CGTCTCCATT TCTGAGACTC GTCCGGACAA CTTTCCTCCA CCCAATCTTG GTTTCGATTT GCAAGCATTC AACGACTTTG ACTTTGACTT TTTGCAGAGT GCCGTACTCT TGGATCAGAT TGAGTTTGGA AACAGCGATG CAGGCAACAT CGAGGGATTT TTTCAATAA
|
Protein sequence | MEQKRQRQRN RVPVSCLNCK KRKVKCDKGK PSCSGCIKNG VPHLCEYLEP VWSKKSSQVK AEDAEDSHDA NASLLQVKIE ETSEFKQFRA HTDKVILSQR KEIDDLKRQL SVLQQLSPKV HDATAMGCKP ILILTKLNLS LVNNRDPLTI HHDPAYSVIG RTSSKVNHID TYSWINLIKL DPQLTTLWFK ITNLQKIYHM YKMNMLNNTS RNSPGAFSLP NQSVTSNPLS KKSPYRINEI DFTYSVVKSE EPNKLRCPVI ECDFNFMTED QITPSPVGGI SSPVPSARST DTPRKYNTVE QFAYHDLVSE KGRTLLLKVQ NLWDSSLNLV RGNEKINFKQ LYFLIDFYFN NKVYDIESRH ILSFYKIEIQ SIIKKNGNEI SINIANDPSL KLTDEQLFER LKMKGVYLCM LALIIEESLD TLRSNVKVGL EEDIGLKFRS LFPTEVVYVG QGSKFRNTLY IVQEFVLHIT NLKFSETSSP SLCTIACYIT LLNREVAEYK KDGATSDPKP GFTSLFTVLL KTILSDEGTV ELWKDPELVI FKEQEARKRN RDLKIHMCYI WTDLVRLANL VGFNFVPLIK HSEAIDNLLQ RLYTKIEEAD SLSYHLKYIT SLNSHKFDEL TITLHLHYLI ARISSALAHG ISKVGDLKLT IANLESLIRQ CSTWIVDLGL RKLRHIRRFE CVSMLMYLRY FMKYIILLQA EESMDEELVA SSVPDIFTKY LEIIDSLRKE LINDHDGMNK QYVLSAITEL LTRLIQIIVA LLMRVSNDDN MMSQETVLRI QLNNMEDLIK THIVGVVEDS VDLLAKSPLL DKDKSGKLSK LWKFYLTFVR NSKRMTSINY AKIHANIPQF RGIGAAGDMK SCPVITPRSF KNSTPPAITS KEYTKCPISH ITTPIDEDSS PIDSRPGKCP VNHSIVTTAS PVPVDAKKRK CPFDHTALDR SSMSQGYNAI ESNIRGVIKR QRDSSDSPSD VEKSSGSTPV VERPEPVVEM SNVSISETRP DNFPPPNLGF DLQAFNDFDF DFLQSAVLLD QIEFGNSDAG NIEGFFQ
|
| |