Gene PICST_56968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56968 
SymbolHAP4 
ID4838340 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp363956 
End bp367900 
Gene Length3945 bp 
Protein Length1085 aa 
Translation table12 
GC content42% 
IMG OID640389655 
Productpositive regulator of cytochrome C genes CYC1 and CYC7 
Protein accessionXP_001383699 
Protein GI150864739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.489386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACCGCAAAC GACATAGAAT CCCAGCTTCG TGCTCCATCT GTCGCAAACG GAAGTCCAAA 
TGTGACCGAG TCAGACCCGT CTGCGGAACA TGCAAAAAAA AATCCATAGC CCACCTCTGC
TACTACGAGC TGGATAAAGA CAACGTAGAC GACAGCGGGC TGCTAAACAG CCATGTACAC
CATCTACCAG AACAGCCCCC AGGCCCACCT CAACATCTCC AACATCCTCC CCCACAATTC
TTCCCCCCTC CTCCACATTT CCAGCCAGGA GTGCAACCTC AACTTCAACC ACCGCTCCAC
CAAAACCAGA TGGACCAACT CCACCAACAC CCCCATAATC CACAACAACC TATTCCACAA
CAGCATATTC AACCACAGCA TATTACAGCA CAGCAACCTG CTCCGTCGCA GCCCATACCC
CAGCAGTCTA TCCAATCACA GCAGCTTCCC CACCTTCAAG CTAACCAAGT GGTAGACTAC
CAGGGCTATC AGGGAAACGT TAACCTCAAA CAGTTGCCCA TCCCCCTTTC GAATCCAGCG
TATCCTGGAT TTCAGGGCCA GGAGCAGGAC TCCAACAGTG CGAATAGTGA CGTTCATGCC
AATGTGAACA CTGCGAATGG AGGCACTAGT AACTACGAAT ATCGTCAACC TCGACAGATT
CCTGTTCCTC CATATGTAGC GTCGCCAACT CAGGTACGTC CTGGAATTCC GCCACTACCT
GACAATAATG CCAATAATAG CATAACACCA GACAGTGAAA ACTTGCCTAA TATCACCAAT
GCGAATATAG GTCCTCCAAA TGCTGATTTG GCGAACAGTA ATGCTGCTAA TAGTAACGCC
CCCATAGCTC ATGTGGCTGC TTCTAATATC GGAGCTGATG GCATGAGTTC TGGTGTTCTA
GCCATGAACA ATACGATGAG CTCTATCAAT GCCAAGGGCT CCACCAGCTC CACCATGAAC
AATAATAACA GCGTCTCTCA CAAAGACAGT ATTGATAACA CCAGCAAGAA CTCCAATAAC
GACAACCCTG CAGCTGCTGT TGGTGCTATT AGCCCAGACG CCAATACTAG TAAGAATGCC
TACAACATCT CTGCTGTTCC AGCAACTGTT TCTAGTGCCA ATTCGGTTTC TGCATTTGCT
TATTCTCCAA CAGAAAGCAA TATTTCTATG AGCTCAGCCA ACCCCCCACT GACTCCTGGA
AAATTGCCAT CATCTTTTTC TAATAATAGA CTCGTTTCGA TTCCCTTGGG ACCCAACTCG
TCTCTACAGG TTAATCCTGA CGACACCATG AAGGTTTTTT CCAATGCAAG TTACGCACTT
AATCTTGAAG GGCCTCTTTG GCAATACCAA GGAGTGTTGT CCTACATTGG TTTAACCAAG
AGTGATCCTT TCATCAAGAT CATGCGGAAC TACGCCATCT TACTCTTCAA GTCTGGTGAA
ATGGGCAAGT TCATGAAGCT TGAAGCACCT AAAACGAAAA AGAGATCAGC TCCAGGAAGT
GCTACTTCTA GTATCCCTGA AACTGGAAGC TTAATCAGCA ATAATGGAAC AGAAGAGAAC
ACTTCGCCTG GTGCCAAGAA ACTGCGAATT GAATCATCTT TGGCTTCAGA TTCCATTCAT
GAATCTCCCG GTTCATATAT GTCTGAATGT GATAGCAAAA AAGATGTGGA TGTAGACCAT
GACAACATCC TCGAGGACGA CGGCTTGATA ACTACTAGAA TAGATGTAGT AGAACAGAAC
AGCACAACTG ACAAGGAAGA GGAATCCGAA CTTGAATCAC GCACTGAAAT TAAGACAACT
GATCCAGCTA TCAAAAAAGA ACCAGAAAAG AAAACGCACA AGCGTAAGAC AAATCCTCAT
GCCATTCCGA ACATTTTGCC TGGATTAAAA TCGCTTTATT CGGGTAAGAA AAATAGAAAG
GAGTACTATG AATTGGTAGA AAAAGCCATC ATAGGTGTTT TCCCCAGCAA GTTGAATATG
TTCATGCTCT TTTGTCGGTT TTTTAAGTAC GTCCATCCAT TTGCTCCCAT TATTGACGAG
AATTCTTTAA TGATGGATAT CACAACATTA TTGCAACAGT ACCCTTCTTT CAACCACGGG
TTTTACGACC AAGTCATCAT TGAATCTGAC CACGACTTAA CGGTGTTGGG TATTTTCCTA
CTTGTTTTAC GCTTGGGGTA CATGTCCTTG ATTCACAATG ATCCAGTAAA CAATCTGTAC
ACCAAAGAGG AACAGGGTAT GGTCCGTGAT ATGAGGCGTA TCAGCACAGC ATCTTATCTC
AATGTTGTGG ACCTATGTTT GGCAGACGAC CTTATCTGTA CTAAATCTAC TTTCAGAAAA
GTTCAAAGTT TGACTCTTCT TTATTTCTTC CGTAAAATGT CTCCAGACGA TTGTCACGGT
ATTGGTGGAA CTGACTCTAA CATTTTGTTG GGAGTTGCAA TAACCCATGC ATTTTCTATC
GGACTCAACC GGGATCCAAC ATGCTACGGC TCTCAGGATT TAATTAGTAA AAGAGAGCCA
TTAATTAGAA TCTGGAGGTC TCTTTGGAAT TATTTAATTA CTTCCGATGT TACATCAGCA
ATTCACTCGA GTACGCCGTT GAAGGTAGCT TCAACAGACG TTTTTGACGT CAAGTTGCCT
CTGTATAGTG AAGATTCAAC CGGAACCAAG AACGATACTA TCAACAAGCT ACATACGATT
GTTGAGGGCT ATAGAAACAT TATTAAAAAG ATGAACAACA TTCAAGATAA ACCAAAAGTT
ATTGACATTT TGATGGAAAC AAACAACTTG GAAAAGATTT TCTTCAGTTT CTTTGGCAAG
GATTTCTTCA AGGATTGCAT TTGCAAGCCG GCTGCTGTGC CACTGAATGG TACCAGTCAT
GATATCAATT CCCTCGCACA CCAGGAAAGT TACATGAAAG TCATCAAATA CTGTTTGTTT
ATTCAGTTAA GGACAGATCT TTCGTGCATG TATTACATGA TTGCCATCCA TTATGAAAAC
GAATACAATG AATCGCAAAC ACCATCAATG AATGCTGGCA TAGAGCTTTT TAAGATCTAT
ATAAAGAGTG TCGTCCAGTT GGTTTACATC ATGTCTTATG TCTTGGACAA TTCTGTCGAA
TTGTTCGGCA AGAACTATGA TTACGTTTTA ACCTCCAACA ACGAAAGATG TATCATAAAG
ACGCATGCTT TCTTGACATC ATTCTTTGTT AGATTGTTGC ATCACAAAAT CGATCTTACC
AAGAAGGTTG CATCTGATCC AACTCTCCAG CCAAGATTAG AAATTTTGGA CACATTATTC
ATGATGGTGT TGATAGAAGC CGAATTTTTC GTTGGAAACT TCAGAAAGTT GAGTAAGAAT
TACATTAATT CATACAAGTT GTATGTGATG ACGTATTTTG TTTTGAAGCA ATGTATGGAT
AACCCAATGG CTTTCTTCGA AGCCACGATG TACAATCCAA AGTTTTTCCA CGAAGGTACC
AATATGCTTG AATTTTTTAC CAATGGTGAG TTGACTTATT TGTGTAAATT GTGTCAAGAA
TTCAGAGGTG CTAAAGAAGA ACAACTGAAA AAGAAAACCT CTACCCGTCC ACCTACTAAT
CCTTCCAAGG AGCATTTCAA GTATCCAGCA CCACTAAATA ATGTGGATGT GCCCATCTCG
TTTGACCCTT CCAAGTTTTT GGCCAACAAT TCTAGGATAT CTAATCCCAT TAATGCCGGG
TCAGGTAGAG TGTTTGCAAT GGGAGCATCA CCCTCAATAC CAAGGTCGGC AATGTCTAGT
AATGCGCTGG CGATTAGTTC CAATTTGCAG CAGTCGCCAG GAGACCTTAG TTCTTTTTCA
CATCTGACAC CAGAGTTTGG CTATTTCAGC AACGGGCACA TTAGTACGGA AGACCTCTTG
AAGTTGTTCG AAATGTATGG AGACTTGGAC CATTTTGATG TCTAG
 
Protein sequence
YRKRHRIPAS CSICRKRKSK CDRVRPVCGT CKKKSIAHLC YYESDKDNVD DSGSLNSHVH 
HLPEQPPGPP QHLQHPPPQF FPPPPHFQPG VQPQLQPPLH QNQMDQLHQH PHNPQQPIPQ
QHIQPQHITA QQPAPSQPIP QQSIQSQQLP HLQANQVVDY QGYQGNVNLK HPDANTSKNA
YNISAVPATV SSANSVSAFA YSPTESNISM SSANPPSTPG KLPSSFSNNR LVSIPLGPNS
SLQVNPDDTM KVFSNASYAL NLEGPLWQYQ GVLSYIGLTK SDPFIKIMRN YAILLFKSGE
MGKFMKLEAP KTKKRSAPGK NTSPGAKKSR IESSLASDSI HESPGSYMSE CDSKKDVDVD
HDNILEDDGL ITTRIDVVEQ NSTTDKEEES ELESRTEIKT TDPAIKKEPE KKTHKRKTNP
HAIPNILPGL KSLYSGKKNR KEYYELVEKA IIGVFPSKLN MFMLFCRFFK YVHPFAPIID
ENSLMMDITT LLQQYPSFNH GFYDQVIIES DHDLTVLGIF LLVLRLGYMS LIHNDPVNNS
YTKEEQGMVR DMRRISTASY LNVVDLCLAD DLICTKSTFR KVQSLTLLYF FRKMSPDDCH
GIGGTDSNIL LGVAITHAFS IGLNRDPTCY GSQDLISKRE PLIRIWRSLW NYLITSDVTS
AIHSSTPLKV ASTDVFDVKL PSYSEDSTGT KNDTINKLHT IVEGYRNIIK KMNNIQDKPK
VIDILMETNN LEKIFFSFFG KDFFKDCICK PAAVPSNGTS HDINSLAHQE SYMKVIKYCL
FIQLRTDLSC MYYMIAIHYE NEYNESQTPS MNAGIELFKI YIKSVVQLVY IMSYVLDNSV
ELFGKNYDYV LTSNNERCII KTHAFLTSFF VRLLHHKIDL TKKVASDPTL QPRLEILDTL
FMMVLIEAEF FVGNFRKLSK NYINSYKLYV MTYFVLKQCM DNPMAFFEAT MYNPKFFHEG
TNMLEFFTNG ELTYLCKLCQ EFRGAKEEQS KKKTSTRPPT NPSKEHFKYP APLNNVDVPI
SFDPSKSAMS SNASAISSNL QQSPGDLSSF SHSTPEFGYF SNGHISTEDL LKLFEMYGDL
DHFDV