Gene PICST_58009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58009 
SymbolHAP1.2 
ID4838313 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp381492 
End bp384710 
Gene Length3219 bp 
Protein Length1047 aa 
Translation table12 
GC content42% 
IMG OID640389628 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001383702 
Protein GI150864741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.513512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0535661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA AACGCCAGAG ACAGCGGAAC AGAGTTCCCG TGTCATGTCT CAACTGTAAG 
AAGCGTAAGG TCAAATGTGA CAAGGGCAAG CCGTCCTGTT CAGGGTGTAT CAAGAATGGT
GTGCCGCATC TTTGCGAGTA TTTGGAGCCA GTGTGGTCGA AAAAGAGTTC TCAGGTGAAG
GCAGAAGACG CAGAAGACTC ACATGATGCG AACGCATCTT TACTACAAGT CAAGATCGAA
GAGACGAGTG AGTTTAAACA GTTTCGAGCG CATACTGACA AAGTCATCCT TTCTCAAAGA
AAGGAGATCG ACGACTTGAA GCGACAGCTT TCGGTGCTCC AGCAGCTCTC GCCGAAGGTC
CACGATGCCA CGGCTATGGG CTGTAAGCCT ATCTTGATTT TGACGAAGTT GAACCTCTCT
CTCGTTAACA ACCGAGATCC ACTCACAATT CACCACGATC CAGCATACAG TGTAATAGGA
CGTACTAGTT CTAAAGTTAA CCACATAGAT ACGTACTCGT GGATCAACTT GATTAAATTG
GATCCACAGC TTACCACTCT CTGGTTCAAA ATCACAAATC TCCAGAAGAT CTACCACATG
TACAAAATGA ACATGTTGAA CAACACTTCT AGAAACAGCC CGGGAGCCTT CAGCTTACCA
AACCAGTCTG TCACATCCAA CCCTTTGTCG AAGAAATCAC CCTATAGAAT CAACGAAATC
GACTTCACCT ACAGTGTGGT CAAGTCTGAA GAGCCAAATA AATTAAGGTG TCCAGTTATC
GAGTGCGATT TCAACTTCAT GACAGAAGAC CAGATCACAC CAAGTCCAGT AGGTGGGATA
AGCTCTCCTG TTCCTCTGGC TCGATCCACT GATACCCCAA GGAAGTACAA CACTGTAGTA
CCAAATCAGT ATGGAACGGA ACAGGAGCAA TTCGCTTATC ATGACCTCGT ATCGGAGAAG
GGAAGAACGT TGTTACTTAA GGTTCAGAAC CTCTGGGATT CCTCGCTCAA CTTGGTGCGA
GGAAACGAAA AGATCAATTT CAAACAGCTC TATTTCCTCA TCGACTTCTA CTTCAACAAC
AAGGTGTACG ACATCGAATC GAGGCATATA CTCTCCTTCT ACAAGATAGA AATTCAGAGT
ATAATTAAGA AGAACGGGAA TGAGATATCC ATAAATATTG CCAATGATCC TAGTCTTAAA
TTGACTGACG AACAGCTTTT CGAGCGCCTT AAGATGAAGG GAGTCTACTT GTGTATGTTA
GCGTTGATAA TCGAGGAGTC GTTGGATACC TTGAGACTGA ATGTTAAAGT GGGGTTAGAA
GAAGATATTG GCCTCAAGTT CCGTTCATTG TTTCCTACGG AAGTAGTCTA TGTTGGGCAA
GGCTCCAAGT TCAGAAATAC TTTGTACATA GTCCAGGAGT TTGTGTTGCA CATAACCAAC
TTGAAGTTTT CAGAAACATC TTCTCCGTCT CTTTGCACCA TAGCTTGTTA TATAACATTG
CTTAATCGTG AAGTCGCCGA ATACAAAAAA GACGGTGCAA CCTCAGATCC AAAGCCAGGG
TTCACAAGCT TGTTCACTGT ATTGTTGAAA ACTATATTGA GTGATGAAGG CACAGTCGAA
TTATGGAAAG ACCCTGAGCT CGTCATCTTC AAGGAACAAG AAGCCAGAAA AAGAAACAGA
GACTTGAAAA TCCATATGTG CTATATATGG ACAGATCTTG TCAGATTGGC CAACTTGGTT
GGATTCAACT TTGTGCCCTT GATAAAACAC TCAGAAGCAA TTGACAACCT CTTGCAAAGA
CTCTATACAA AGATAGAAGA GGCAGATCTG CTTCTGTATC ATCTAAAGTA CATCACTTCC
CTCAATTCCC ATAAATTTGA TGAATTAACT ATAACACTTC ATCTCCACTA TTTGATTGCC
AGAATCTCAT CTGCTTTAGC TCATGGGATT TCGAAGGTAG GCGATCTCAA ATTGACTATC
GCCAATTTGG AATCATTGAT CAGACAATGC AGTACCTGGA TTGTAGATTT AGGATTACGG
AAATTGAGGC ATATACGTAG ATTTGAATGC GTTTCAATGT TAATGTATCT CAGATACTTC
ATGAAGTACA TTATATTATT ACAGGCAGAG GAAAGTATGG ACGAAGAGTT GGTAGCATCA
TCTGTTCCGG ATATTTTTAC CAAGTATCTT GAAATCATTG ATCTGTTGAG AAAAGAACTT
ATCAACGACC ATGATGGCAT GAACAAGCAG TACGTGTTAC TGGCGATAAC AGAACTTTTG
ACTAGACTTA TCCAGATTAT TGTTGCTTTG CTAATGAGAG TTTCGAATGA TGATAATATG
ATGTCACAAG AAACTGTATT GCGAATTCAG CTCAACAAGT ATTCGGCTTT CAATGGCGAC
AAGGCAATTT CTGATTACGG AATGAGCATG GAAGACTTAA TTAAGACACA TATAGTAGGC
GTTGTTGAGG ATTCAGTAGA CTTGCTTGCG AAGAGTCCCT TATTGGATAA GGATAAATCT
GGTAAACTCT CGAAGTTATG GAAGTTCTAT TTGACGTTTG TCCGTAATTC CAAGAGAATG
ACAAGCATCA ACTACGCCAA GATACATGCA AACATTCCGC AATTCCGAGG AATCGGGGCT
GCTGGTGATA TGAAGTCTTG CCCAGTTATA ACACCTAGGA GCTTCAAGAA CTCAACGCCA
CCAGCGATTA CATCAAAGGA ATATACGAAA TGTCCTATTT CGCATATCAC CACACCCATA
GACGAAGACA GTTCGCCTAT CGACTCCAGG CCGGGTAAAT GTCCTGTTAA CCACAGTATT
GTAACAACAG CTTCACCAGT GCCAGTGGAT GCTAAGAAAA GGAAGTGTCC ATTTGACCAT
ACAGCACTTG ATAGAAGTTC AATGTCTCAA GGGTATAATG CTATTGAAAG TAATATCCGC
GGTGTGATCA AGCGCCAGAG AGATAGTTCG GACTCGCCTT CGGATGTTGA AAAAAGTAGC
GGATCTACTC CGGTAGTAGA GAGGCCAGAA CCAGTTGTTG AGATGTCTAA CGTCTCCATT
TCTGAGACTC GTCCGGACAA CTTTCCTCCA CCCAATCTTG GTTTCGATTT GCAAGCATTC
AACGACTTTG ACTTTGACTT TTTGCAGAGT GCCGTACTCT TGGATCAGAT TGAGTTTGGA
AACAGCGATG CAGGCAACAT CGAGGGATTT TTTCAATAA
 
Protein sequence
MEQKRQRQRN RVPVSCLNCK KRKVKCDKGK PSCSGCIKNG VPHLCEYLEP VWSKKSSQVK 
AEDAEDSHDA NASLLQVKIE ETSEFKQFRA HTDKVILSQR KEIDDLKRQL SVLQQLSPKV
HDATAMGCKP ILILTKLNLS LVNNRDPLTI HHDPAYSVIG RTSSKVNHID TYSWINLIKL
DPQLTTLWFK ITNLQKIYHM YKMNMLNNTS RNSPGAFSLP NQSVTSNPLS KKSPYRINEI
DFTYSVVKSE EPNKLRCPVI ECDFNFMTED QITPSPVGGI SSPVPSARST DTPRKYNTVE
QFAYHDLVSE KGRTLLLKVQ NLWDSSLNLV RGNEKINFKQ LYFLIDFYFN NKVYDIESRH
ILSFYKIEIQ SIIKKNGNEI SINIANDPSL KLTDEQLFER LKMKGVYLCM LALIIEESLD
TLRSNVKVGL EEDIGLKFRS LFPTEVVYVG QGSKFRNTLY IVQEFVLHIT NLKFSETSSP
SLCTIACYIT LLNREVAEYK KDGATSDPKP GFTSLFTVLL KTILSDEGTV ELWKDPELVI
FKEQEARKRN RDLKIHMCYI WTDLVRLANL VGFNFVPLIK HSEAIDNLLQ RLYTKIEEAD
SLSYHLKYIT SLNSHKFDEL TITLHLHYLI ARISSALAHG ISKVGDLKLT IANLESLIRQ
CSTWIVDLGL RKLRHIRRFE CVSMLMYLRY FMKYIILLQA EESMDEELVA SSVPDIFTKY
LEIIDSLRKE LINDHDGMNK QYVLSAITEL LTRLIQIIVA LLMRVSNDDN MMSQETVLRI
QLNNMEDLIK THIVGVVEDS VDLLAKSPLL DKDKSGKLSK LWKFYLTFVR NSKRMTSINY
AKIHANIPQF RGIGAAGDMK SCPVITPRSF KNSTPPAITS KEYTKCPISH ITTPIDEDSS
PIDSRPGKCP VNHSIVTTAS PVPVDAKKRK CPFDHTALDR SSMSQGYNAI ESNIRGVIKR
QRDSSDSPSD VEKSSGSTPV VERPEPVVEM SNVSISETRP DNFPPPNLGF DLQAFNDFDF
DFLQSAVLLD QIEFGNSDAG NIEGFFQ