Gene PICST_67893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67893 
SymbolFHL1 
ID4839400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1144796 
End bp1148977 
Gene Length4182 bp 
Protein Length1264 aa 
Translation table12 
GC content44% 
IMG OID640390715 
Producttranscriptional regulator of the forkhead/HNF3 family 
Protein accessionXP_001384893 
Protein GI150865609 
COG category[K] Transcription 
COG ID[COG5025] Transcription factor of the Forkhead/HNF3 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACTTTTCCAG CGCCTTTCCG CTGTGTCCCA ACGCCAGAAG CAGATCATAA CGGGTTACGC 
AAGAAGCTGT GTCCTCAATT CTCCTGAGAC CACAAGAAGA TTTGCATCCA GAAGTCAGCA
ATTTCTTAAG ATCGAACATC CGTGCCGCTA TACAAAACAT CTCAATCTCA TCTGAAAAGT
AGATTAAATC ACAGCAACAG CATCGAATCA CTGGCACTAT TTCCACACAC TTCACGCCAT
TCCAGAACAC CATTGCCATA GACTCCACAA GCAGAGCACT ATCATCTTAA ATCTGGAAAA
TGATTAGTGA TCCCAGGGCG TCCAAGAGCC CAGGTGCCGA TCCTGGCCAG GTGAATGAGC
TTCTATCTGA CCATCTCATG GACGACATCT CCACAATACT AGATAATAAC CACAATAGCC
ATAATAACCA CACAGATCAT AACAACGACC AGAGCAATAG CCAGAGCAAT ATCAGTAACG
TCCAAGATAA TAAAACAAAG AATCCTGATT TCAAACACAT TGAAAACGAA TTGGGACCAC
TAGACAAAAG CCACCCGGTA ACCAAAATAG TTGTAGAAAA CTACGATGAA GAAGTAAATG
CAAACAGAAC TGTCAGCAAT TCTCACCTGC CTGTTCCGGA AGTCGACAAC GAGCGCAAGA
AGCCATTACT GCTTTCGCAA CAAGAACTCA GACGTAACTC TGCTCTCGTT CCTATTACTG
GAGACGGTGC AGTTGCCAGC TCATCTGTAG GTCATGAATC ATCGAAAATC TCAGCGTATG
CACGTTTGGA TTTTGACAAC TTCACTTTTT TCGTCCAGAC ATTGCAAGTA GTTCTTGGTA
GAAAGAGCAA CGACCAGCTA CTTCAAGGAA GTCACCATGC TGTGGATGTG CATTTGTCTT
CTAAAAAGGC AATTTCACGT AGACACGCAA AGATTTTCTA CAATTTCGGA ACTCAGAGAT
TCGAGATCCT GATTTTGGGA CGAAACGGAG CGTTTGTAGA TGACACTTTT GTAGAAAAAG
GCATAACAGT CCCGTTACAG GATGGAACAA AATTGCAGAT AGGCGACATT CCGTTTCTGT
TTGTTTTGCC ATCTATAGAA CCTAATGAGG AAGATGAGGC AAATTCTACA AGGAACAAAC
AGTTCAACCC AACTGATGCC ATCAATTTGA GATCCAACAT TTACAACTCT TCCAAGTCTC
CTACCCCTAA GAAATCACCA AAGAAGGATA AGATACGACA GGACATCGAT GAAAAGATTC
TAGAACCACC ACCAGCTCCA AAATTGCCTG TTAAGTCAGA GGCTGCCGAA AAGACAAAAA
CACAGAGAGC AATGTCGATT GTTGAAGATC CCCATGGCGA AAAACACCAG TCTACAAGCA
GAAGAAACTC TCTCTTGAAA ATACGAAGAT TGTCCAATGC TCGTAGGAAG TCTTTGGCAA
GTTCCGCAAA TGACGAAATC AACGATATCT TGAAAGAGTT GGGAGTAGCA TCTATAGATG
CGATCAACGA AGAAGAGTTA GATTCACAAT TACAGGATCT TCTTGAAGAA CATGAGAGAG
ACGATATAGG GATCGATAAT AGAAGTATGC TCAAGTTTTC CCAATACTCT GAATCGGCAA
TAGAAGACGA GGAAGACGAA ATCGACAAGT TGGTAAAGCA ACACAACTTG GAGCAAGGCG
TAGTCTTAGA TGATGATTCT CAGAACGAAG AAAACAGTAC CCATGATATC GACATGGACC
TCTCCGTTCT AGACCAGGAA ATCGCTTCGT TAGCACCTTT GATAGATGCA CACAATCAAG
AATTGATGAA GCAAAAGGAA GAGAAACGCA AAAAGCTTGA ACAGGAAAAA AAGAAGAAGC
TTCAGAAGAT CGACCTCCAG AGAACTTCTC CACTTATGGG AAAGCCAACT GCTCCCAGAA
TGGGCAAACC TGCCTCAATA CAACCTCCAG CTTCTAGAAT ATATGTCAAT GGTGTAGGAA
AGCTTCAGGA TAGTTCTATA CCAGCAAATA CTCTTGCATC AGATCTTTCC ACTCCTTTCG
CTACAGCACT AGTGGGTCCA CCGAGACCTC CACCTCCAGT ATTGGAGGCC CCAATCGGTA
TTATAACCGC CGAACCAGCT ACTATCAGAT CCCGTCCACC TCTTCGTGCT ATCACAGTCA
AACCAACAGC CAACTTAGCC AACTTCAGAG TTCCCAAAAC AAATGACGAG CCTTCCAAAT
TCCCTAAACA AAAGAGAAGA AGAGATGTCA GTAGGAAACC ACCTAAAAAA GTATATACTA
TGGATGAAAT TCCAGACCAG TTCAGATCCA AACCAAATGT CTCTTATCCA TTGATGATCA
TGACGGTTTT GAAGTCCGAG ATTGCTAAGA ACGGAATGAC CATCAACGAA ATAAACGAAG
CAATCAAGGA ATTCTATCCC TACTACAAGT ACTGTCCTGA CGGCTGGCAG TTTTCGATTT
CTCACAACGT CAAATTGACC AAGACTTTCA GACGTTCGGT GAAAAAGGGA TCTGAATGGG
TATACACTGT GGATGAATTG TATATCAACG AAAGAGAAAA GGTAAGAAAG AAGCAACAAG
AGTTAGCTGC TGCTAAGGCG AAGGCTGCTG CCATTAGAGC AGAGGAAATC AAAGAAAAAC
AACGTCTTGA GGCCCAAATG ACAACACAAG CAGTGCCTAC CAGACCTTAC ACTTCACCCT
ACGGAGTACC TATGGGTACT ACACTTCATA GTGGTTCGTC ATATTCTGCT CAACTTCCCT
ATAAGGCTGT AGGAGGAGTA TCTGGTACCG CTAATGGCGA CCAAAAACCA AAGACAATAG
CGGAGCTTGC TAGTGAGATT AGAAGAGATG GAAACACTTC TAAAACTCCA TTGTACTTCA
AACCTCAGGC TGCAAGTCCG CTGGATGGAT TCAAGCGTAC TGATGCCACT AGTTTCCCAC
CCAATACTAC AATAAAAGAA CAGTTAGCGG CCAACAGGTC TCCCTCCAAC TCTCAAACAT
CTTCAGCTAA TGCCACTGCC AGTCCAGCTC CAGCTCCAGC ATCGCAGACC GGAGTACAGC
AGAAACCTGG TATCGCTGGC ATGAACCAGG AAACAAAGAA GTCGTTGGCT TACTTACAGA
AGGAGTTGGT CAATTTGTAC AAAGCTAGAA AGCTTTCGTA CAATACAGCC ACCACCACTG
AGATTATAAC CAAGGCTCTA GCTACTACAA TTGCTCAGGT TAATGTCATT GGTGCAAAAG
CTGGTTATGG TGACAATGCC TTGAGTTTCT TGGTTGACAA AGCTCCGCAG CAAGTAAGTA
AAATTTTGGA TATTGCCTTG ACCAAGTCCA TAAAGGAAAA GCAAGGCATT CTCTCTTCTC
GTCCATCAAG CAGAGCAGGC ACCCCCGGCC CTTCAGCTAT GCCTTCGCCT AGTAGAATAG
AAAAGCCAGC CGTTCCAAGC ACTCAGCCTG TTCAACAGCC ACAACCAATT GCACATCCAA
CAGCTGTTAA AGTTGTCGTG CCACCTACGG TAGTGGATGC TACTAGAGAT GTAAAGCCTA
CTGCTGCTAT ACCAATTGCA GAACCCAGAC CACAGCCACC ACAGGCTGTT CCTGTAGCCA
CACCACCACC AATGGCAGTG ACACCAACAG TACCAGTCAC TACTACGGGA CCAATAGCGG
TAACAACACC ACCAGTATCA GCAACATCTA AGATTGTAGA GGCGGCATCT GCTTCGAAAC
CTACAGCTGC TACCCCACAA GCATCCCCTA TTGTTTCAGC TGCATTGCCA GGAGTGTCTC
CATCTCCTGG ACCAACTACA AGTCCATACA AACCCACAGC TCTAGGATCG GGCCTTGCCA
GGCCACCCAG CTACAAAACT GACTCACTTG GACGGCCTCC TAGTGCTTTA TCGAGACCAC
AGAGCTTTGG AAAGCCAGGA AGTGGACTCT CGAGACCTCC TACTTTCTTG TCTAACAAGC
CTGGATTGAG ATCTACCTTT GGGTCTGAAT CCAACTCTGG ACCCCAGCAT CTTGGAAATG
GCAATAATAG CAAGAGAGAA CACCCAGAAG ACAAAGAGGA CGAGAACGCG AAAAAGATTG
CAAGAACCGA ATGAAGCTAA ACATGTACTA TAGAAAGAGT GACTGTGGTA TGTACCACTA
ACATATAGAA TCTCTGTATA ATTAATGTAA ATTATTGTTA AT
 
Protein sequence
MISDPRASKS PGADPGQVNE LLSDHLMDDI STILDNNHNS HNNHTDHNND QSNSQSNISN 
VQDNKTKNPD FKHIENELGP LDKSHPVTKI VVENYDEEVN ANRTVSNSHS PVPEVDNERK
KPLSLSQQEL RRNSALVPIT GDGAVASSSV GHESSKISAY ARLDFDNFTF FVQTLQVVLG
RKSNDQLLQG SHHAVDVHLS SKKAISRRHA KIFYNFGTQR FEISILGRNG AFVDDTFVEK
GITVPLQDGT KLQIGDIPFS FVLPSIEPNE EDEANSTRNK QFNPTDAINL RSNIYNSSKS
PTPKKSPKKD KIRQDIDEKI LEPPPAPKLP VKSEAAEKTK TQRAMSIVED PHGEKHQSTS
RRNSLLKIRR LSNARRKSLA SSANDEINDI LKELGVASID AINEEELDSQ LQDLLEEHER
DDIGIDNRSM LKFSQYSESA IEDEEDEIDK LVKQHNLEQG VVLDDDSQNE ENSTHDIDMD
LSVLDQEIAS LAPLIDAHNQ ELMKQKEEKR KKLEQEKKKK LQKIDLQRTS PLMGKPTAPR
MGKPASIQPP ASRIYVNGVG KLQDSSIPAN TLASDLSTPF ATALVGPPRP PPPVLEAPIG
IITAEPATIR SRPPLRAITV KPTANLANFR VPKTNDEPSK FPKQKRRRDV SRKPPKKVYT
MDEIPDQFRS KPNVSYPLMI MTVLKSEIAK NGMTINEINE AIKEFYPYYK YCPDGWQFSI
SHNVKLTKTF RRSVKKGSEW VYTVDELYIN EREKVRKKQQ ELAAAKAKAA AIRAEEIKEK
QRLEAQMTTQ AVPTRPYTSP YGVPMGTTLH SGSSYSAQLP YKAVGGVSGT ANGDQKPKTI
AELASEIRRD GNTSKTPLYF KPQAASPSDG FKRTDATSFP PNTTIKEQLA ANRSPSNSQT
SSANATASPA PAPASQTGVQ QKPGIAGMNQ ETKKSLAYLQ KELVNLYKAR KLSYNTATTT
EIITKALATT IAQVNVIGAK AGYGDNALSF LVDKAPQQVS KILDIALTKS IKEKQGILSS
RPSSRAGTPG PSAMPSPSRI EKPAVPSTQP VQQPQPIAHP TAVKVVVPPT VVDATRDVKP
TAAIPIAEPR PQPPQAVPVA TPPPMAVTPT VPVTTTGPIA VTTPPVSATS KIVEAASASK
PTAATPQASP IVSAALPGVS PSPGPTTSPY KPTALGSGLA RPPSYKTDSL GRPPSALSRP
QSFGKPGSGL SRPPTFLSNK PGLRSTFGSE SNSGPQHLGN GNNSKREHPE DKEDENAKKI
ARTE