Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67893 |
Symbol | FHL1 |
ID | 4839400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1144796 |
End bp | 1148977 |
Gene Length | 4182 bp |
Protein Length | 1264 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390715 |
Product | transcriptional regulator of the forkhead/HNF3 family |
Protein accession | XP_001384893 |
Protein GI | 150865609 |
COG category | [K] Transcription |
COG ID | [COG5025] Transcription factor of the Forkhead/HNF3 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACTTTTCCAG CGCCTTTCCG CTGTGTCCCA ACGCCAGAAG CAGATCATAA CGGGTTACGC AAGAAGCTGT GTCCTCAATT CTCCTGAGAC CACAAGAAGA TTTGCATCCA GAAGTCAGCA ATTTCTTAAG ATCGAACATC CGTGCCGCTA TACAAAACAT CTCAATCTCA TCTGAAAAGT AGATTAAATC ACAGCAACAG CATCGAATCA CTGGCACTAT TTCCACACAC TTCACGCCAT TCCAGAACAC CATTGCCATA GACTCCACAA GCAGAGCACT ATCATCTTAA ATCTGGAAAA TGATTAGTGA TCCCAGGGCG TCCAAGAGCC CAGGTGCCGA TCCTGGCCAG GTGAATGAGC TTCTATCTGA CCATCTCATG GACGACATCT CCACAATACT AGATAATAAC CACAATAGCC ATAATAACCA CACAGATCAT AACAACGACC AGAGCAATAG CCAGAGCAAT ATCAGTAACG TCCAAGATAA TAAAACAAAG AATCCTGATT TCAAACACAT TGAAAACGAA TTGGGACCAC TAGACAAAAG CCACCCGGTA ACCAAAATAG TTGTAGAAAA CTACGATGAA GAAGTAAATG CAAACAGAAC TGTCAGCAAT TCTCACCTGC CTGTTCCGGA AGTCGACAAC GAGCGCAAGA AGCCATTACT GCTTTCGCAA CAAGAACTCA GACGTAACTC TGCTCTCGTT CCTATTACTG GAGACGGTGC AGTTGCCAGC TCATCTGTAG GTCATGAATC ATCGAAAATC TCAGCGTATG CACGTTTGGA TTTTGACAAC TTCACTTTTT TCGTCCAGAC ATTGCAAGTA GTTCTTGGTA GAAAGAGCAA CGACCAGCTA CTTCAAGGAA GTCACCATGC TGTGGATGTG CATTTGTCTT CTAAAAAGGC AATTTCACGT AGACACGCAA AGATTTTCTA CAATTTCGGA ACTCAGAGAT TCGAGATCCT GATTTTGGGA CGAAACGGAG CGTTTGTAGA TGACACTTTT GTAGAAAAAG GCATAACAGT CCCGTTACAG GATGGAACAA AATTGCAGAT AGGCGACATT CCGTTTCTGT TTGTTTTGCC ATCTATAGAA CCTAATGAGG AAGATGAGGC AAATTCTACA AGGAACAAAC AGTTCAACCC AACTGATGCC ATCAATTTGA GATCCAACAT TTACAACTCT TCCAAGTCTC CTACCCCTAA GAAATCACCA AAGAAGGATA AGATACGACA GGACATCGAT GAAAAGATTC TAGAACCACC ACCAGCTCCA AAATTGCCTG TTAAGTCAGA GGCTGCCGAA AAGACAAAAA CACAGAGAGC AATGTCGATT GTTGAAGATC CCCATGGCGA AAAACACCAG TCTACAAGCA GAAGAAACTC TCTCTTGAAA ATACGAAGAT TGTCCAATGC TCGTAGGAAG TCTTTGGCAA GTTCCGCAAA TGACGAAATC AACGATATCT TGAAAGAGTT GGGAGTAGCA TCTATAGATG CGATCAACGA AGAAGAGTTA GATTCACAAT TACAGGATCT TCTTGAAGAA CATGAGAGAG ACGATATAGG GATCGATAAT AGAAGTATGC TCAAGTTTTC CCAATACTCT GAATCGGCAA TAGAAGACGA GGAAGACGAA ATCGACAAGT TGGTAAAGCA ACACAACTTG GAGCAAGGCG TAGTCTTAGA TGATGATTCT CAGAACGAAG AAAACAGTAC CCATGATATC GACATGGACC TCTCCGTTCT AGACCAGGAA ATCGCTTCGT TAGCACCTTT GATAGATGCA CACAATCAAG AATTGATGAA GCAAAAGGAA GAGAAACGCA AAAAGCTTGA ACAGGAAAAA AAGAAGAAGC TTCAGAAGAT CGACCTCCAG AGAACTTCTC CACTTATGGG AAAGCCAACT GCTCCCAGAA TGGGCAAACC TGCCTCAATA CAACCTCCAG CTTCTAGAAT ATATGTCAAT GGTGTAGGAA AGCTTCAGGA TAGTTCTATA CCAGCAAATA CTCTTGCATC AGATCTTTCC ACTCCTTTCG CTACAGCACT AGTGGGTCCA CCGAGACCTC CACCTCCAGT ATTGGAGGCC CCAATCGGTA TTATAACCGC CGAACCAGCT ACTATCAGAT CCCGTCCACC TCTTCGTGCT ATCACAGTCA AACCAACAGC CAACTTAGCC AACTTCAGAG TTCCCAAAAC AAATGACGAG CCTTCCAAAT TCCCTAAACA AAAGAGAAGA AGAGATGTCA GTAGGAAACC ACCTAAAAAA GTATATACTA TGGATGAAAT TCCAGACCAG TTCAGATCCA AACCAAATGT CTCTTATCCA TTGATGATCA TGACGGTTTT GAAGTCCGAG ATTGCTAAGA ACGGAATGAC CATCAACGAA ATAAACGAAG CAATCAAGGA ATTCTATCCC TACTACAAGT ACTGTCCTGA CGGCTGGCAG TTTTCGATTT CTCACAACGT CAAATTGACC AAGACTTTCA GACGTTCGGT GAAAAAGGGA TCTGAATGGG TATACACTGT GGATGAATTG TATATCAACG AAAGAGAAAA GGTAAGAAAG AAGCAACAAG AGTTAGCTGC TGCTAAGGCG AAGGCTGCTG CCATTAGAGC AGAGGAAATC AAAGAAAAAC AACGTCTTGA GGCCCAAATG ACAACACAAG CAGTGCCTAC CAGACCTTAC ACTTCACCCT ACGGAGTACC TATGGGTACT ACACTTCATA GTGGTTCGTC ATATTCTGCT CAACTTCCCT ATAAGGCTGT AGGAGGAGTA TCTGGTACCG CTAATGGCGA CCAAAAACCA AAGACAATAG CGGAGCTTGC TAGTGAGATT AGAAGAGATG GAAACACTTC TAAAACTCCA TTGTACTTCA AACCTCAGGC TGCAAGTCCG CTGGATGGAT TCAAGCGTAC TGATGCCACT AGTTTCCCAC CCAATACTAC AATAAAAGAA CAGTTAGCGG CCAACAGGTC TCCCTCCAAC TCTCAAACAT CTTCAGCTAA TGCCACTGCC AGTCCAGCTC CAGCTCCAGC ATCGCAGACC GGAGTACAGC AGAAACCTGG TATCGCTGGC ATGAACCAGG AAACAAAGAA GTCGTTGGCT TACTTACAGA AGGAGTTGGT CAATTTGTAC AAAGCTAGAA AGCTTTCGTA CAATACAGCC ACCACCACTG AGATTATAAC CAAGGCTCTA GCTACTACAA TTGCTCAGGT TAATGTCATT GGTGCAAAAG CTGGTTATGG TGACAATGCC TTGAGTTTCT TGGTTGACAA AGCTCCGCAG CAAGTAAGTA AAATTTTGGA TATTGCCTTG ACCAAGTCCA TAAAGGAAAA GCAAGGCATT CTCTCTTCTC GTCCATCAAG CAGAGCAGGC ACCCCCGGCC CTTCAGCTAT GCCTTCGCCT AGTAGAATAG AAAAGCCAGC CGTTCCAAGC ACTCAGCCTG TTCAACAGCC ACAACCAATT GCACATCCAA CAGCTGTTAA AGTTGTCGTG CCACCTACGG TAGTGGATGC TACTAGAGAT GTAAAGCCTA CTGCTGCTAT ACCAATTGCA GAACCCAGAC CACAGCCACC ACAGGCTGTT CCTGTAGCCA CACCACCACC AATGGCAGTG ACACCAACAG TACCAGTCAC TACTACGGGA CCAATAGCGG TAACAACACC ACCAGTATCA GCAACATCTA AGATTGTAGA GGCGGCATCT GCTTCGAAAC CTACAGCTGC TACCCCACAA GCATCCCCTA TTGTTTCAGC TGCATTGCCA GGAGTGTCTC CATCTCCTGG ACCAACTACA AGTCCATACA AACCCACAGC TCTAGGATCG GGCCTTGCCA GGCCACCCAG CTACAAAACT GACTCACTTG GACGGCCTCC TAGTGCTTTA TCGAGACCAC AGAGCTTTGG AAAGCCAGGA AGTGGACTCT CGAGACCTCC TACTTTCTTG TCTAACAAGC CTGGATTGAG ATCTACCTTT GGGTCTGAAT CCAACTCTGG ACCCCAGCAT CTTGGAAATG GCAATAATAG CAAGAGAGAA CACCCAGAAG ACAAAGAGGA CGAGAACGCG AAAAAGATTG CAAGAACCGA ATGAAGCTAA ACATGTACTA TAGAAAGAGT GACTGTGGTA TGTACCACTA ACATATAGAA TCTCTGTATA ATTAATGTAA ATTATTGTTA AT
|
Protein sequence | MISDPRASKS PGADPGQVNE LLSDHLMDDI STILDNNHNS HNNHTDHNND QSNSQSNISN VQDNKTKNPD FKHIENELGP LDKSHPVTKI VVENYDEEVN ANRTVSNSHS PVPEVDNERK KPLSLSQQEL RRNSALVPIT GDGAVASSSV GHESSKISAY ARLDFDNFTF FVQTLQVVLG RKSNDQLLQG SHHAVDVHLS SKKAISRRHA KIFYNFGTQR FEISILGRNG AFVDDTFVEK GITVPLQDGT KLQIGDIPFS FVLPSIEPNE EDEANSTRNK QFNPTDAINL RSNIYNSSKS PTPKKSPKKD KIRQDIDEKI LEPPPAPKLP VKSEAAEKTK TQRAMSIVED PHGEKHQSTS RRNSLLKIRR LSNARRKSLA SSANDEINDI LKELGVASID AINEEELDSQ LQDLLEEHER DDIGIDNRSM LKFSQYSESA IEDEEDEIDK LVKQHNLEQG VVLDDDSQNE ENSTHDIDMD LSVLDQEIAS LAPLIDAHNQ ELMKQKEEKR KKLEQEKKKK LQKIDLQRTS PLMGKPTAPR MGKPASIQPP ASRIYVNGVG KLQDSSIPAN TLASDLSTPF ATALVGPPRP PPPVLEAPIG IITAEPATIR SRPPLRAITV KPTANLANFR VPKTNDEPSK FPKQKRRRDV SRKPPKKVYT MDEIPDQFRS KPNVSYPLMI MTVLKSEIAK NGMTINEINE AIKEFYPYYK YCPDGWQFSI SHNVKLTKTF RRSVKKGSEW VYTVDELYIN EREKVRKKQQ ELAAAKAKAA AIRAEEIKEK QRLEAQMTTQ AVPTRPYTSP YGVPMGTTLH SGSSYSAQLP YKAVGGVSGT ANGDQKPKTI AELASEIRRD GNTSKTPLYF KPQAASPSDG FKRTDATSFP PNTTIKEQLA ANRSPSNSQT SSANATASPA PAPASQTGVQ QKPGIAGMNQ ETKKSLAYLQ KELVNLYKAR KLSYNTATTT EIITKALATT IAQVNVIGAK AGYGDNALSF LVDKAPQQVS KILDIALTKS IKEKQGILSS RPSSRAGTPG PSAMPSPSRI EKPAVPSTQP VQQPQPIAHP TAVKVVVPPT VVDATRDVKP TAAIPIAEPR PQPPQAVPVA TPPPMAVTPT VPVTTTGPIA VTTPPVSATS KIVEAASASK PTAATPQASP IVSAALPGVS PSPGPTTSPY KPTALGSGLA RPPSYKTDSL GRPPSALSRP QSFGKPGSGL SRPPTFLSNK PGLRSTFGSE SNSGPQHLGN GNNSKREHPE DKEDENAKKI ARTE
|
| |