Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56968 |
Symbol | HAP4 |
ID | 4838340 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 363956 |
End bp | 367900 |
Gene Length | 3945 bp |
Protein Length | 1085 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389655 |
Product | positive regulator of cytochrome C genes CYC1 and CYC7 |
Protein accession | XP_001383699 |
Protein GI | 150864739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.489386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TACCGCAAAC GACATAGAAT CCCAGCTTCG TGCTCCATCT GTCGCAAACG GAAGTCCAAA TGTGACCGAG TCAGACCCGT CTGCGGAACA TGCAAAAAAA AATCCATAGC CCACCTCTGC TACTACGAGC TGGATAAAGA CAACGTAGAC GACAGCGGGC TGCTAAACAG CCATGTACAC CATCTACCAG AACAGCCCCC AGGCCCACCT CAACATCTCC AACATCCTCC CCCACAATTC TTCCCCCCTC CTCCACATTT CCAGCCAGGA GTGCAACCTC AACTTCAACC ACCGCTCCAC CAAAACCAGA TGGACCAACT CCACCAACAC CCCCATAATC CACAACAACC TATTCCACAA CAGCATATTC AACCACAGCA TATTACAGCA CAGCAACCTG CTCCGTCGCA GCCCATACCC CAGCAGTCTA TCCAATCACA GCAGCTTCCC CACCTTCAAG CTAACCAAGT GGTAGACTAC CAGGGCTATC AGGGAAACGT TAACCTCAAA CAGTTGCCCA TCCCCCTTTC GAATCCAGCG TATCCTGGAT TTCAGGGCCA GGAGCAGGAC TCCAACAGTG CGAATAGTGA CGTTCATGCC AATGTGAACA CTGCGAATGG AGGCACTAGT AACTACGAAT ATCGTCAACC TCGACAGATT CCTGTTCCTC CATATGTAGC GTCGCCAACT CAGGTACGTC CTGGAATTCC GCCACTACCT GACAATAATG CCAATAATAG CATAACACCA GACAGTGAAA ACTTGCCTAA TATCACCAAT GCGAATATAG GTCCTCCAAA TGCTGATTTG GCGAACAGTA ATGCTGCTAA TAGTAACGCC CCCATAGCTC ATGTGGCTGC TTCTAATATC GGAGCTGATG GCATGAGTTC TGGTGTTCTA GCCATGAACA ATACGATGAG CTCTATCAAT GCCAAGGGCT CCACCAGCTC CACCATGAAC AATAATAACA GCGTCTCTCA CAAAGACAGT ATTGATAACA CCAGCAAGAA CTCCAATAAC GACAACCCTG CAGCTGCTGT TGGTGCTATT AGCCCAGACG CCAATACTAG TAAGAATGCC TACAACATCT CTGCTGTTCC AGCAACTGTT TCTAGTGCCA ATTCGGTTTC TGCATTTGCT TATTCTCCAA CAGAAAGCAA TATTTCTATG AGCTCAGCCA ACCCCCCACT GACTCCTGGA AAATTGCCAT CATCTTTTTC TAATAATAGA CTCGTTTCGA TTCCCTTGGG ACCCAACTCG TCTCTACAGG TTAATCCTGA CGACACCATG AAGGTTTTTT CCAATGCAAG TTACGCACTT AATCTTGAAG GGCCTCTTTG GCAATACCAA GGAGTGTTGT CCTACATTGG TTTAACCAAG AGTGATCCTT TCATCAAGAT CATGCGGAAC TACGCCATCT TACTCTTCAA GTCTGGTGAA ATGGGCAAGT TCATGAAGCT TGAAGCACCT AAAACGAAAA AGAGATCAGC TCCAGGAAGT GCTACTTCTA GTATCCCTGA AACTGGAAGC TTAATCAGCA ATAATGGAAC AGAAGAGAAC ACTTCGCCTG GTGCCAAGAA ACTGCGAATT GAATCATCTT TGGCTTCAGA TTCCATTCAT GAATCTCCCG GTTCATATAT GTCTGAATGT GATAGCAAAA AAGATGTGGA TGTAGACCAT GACAACATCC TCGAGGACGA CGGCTTGATA ACTACTAGAA TAGATGTAGT AGAACAGAAC AGCACAACTG ACAAGGAAGA GGAATCCGAA CTTGAATCAC GCACTGAAAT TAAGACAACT GATCCAGCTA TCAAAAAAGA ACCAGAAAAG AAAACGCACA AGCGTAAGAC AAATCCTCAT GCCATTCCGA ACATTTTGCC TGGATTAAAA TCGCTTTATT CGGGTAAGAA AAATAGAAAG GAGTACTATG AATTGGTAGA AAAAGCCATC ATAGGTGTTT TCCCCAGCAA GTTGAATATG TTCATGCTCT TTTGTCGGTT TTTTAAGTAC GTCCATCCAT TTGCTCCCAT TATTGACGAG AATTCTTTAA TGATGGATAT CACAACATTA TTGCAACAGT ACCCTTCTTT CAACCACGGG TTTTACGACC AAGTCATCAT TGAATCTGAC CACGACTTAA CGGTGTTGGG TATTTTCCTA CTTGTTTTAC GCTTGGGGTA CATGTCCTTG ATTCACAATG ATCCAGTAAA CAATCTGTAC ACCAAAGAGG AACAGGGTAT GGTCCGTGAT ATGAGGCGTA TCAGCACAGC ATCTTATCTC AATGTTGTGG ACCTATGTTT GGCAGACGAC CTTATCTGTA CTAAATCTAC TTTCAGAAAA GTTCAAAGTT TGACTCTTCT TTATTTCTTC CGTAAAATGT CTCCAGACGA TTGTCACGGT ATTGGTGGAA CTGACTCTAA CATTTTGTTG GGAGTTGCAA TAACCCATGC ATTTTCTATC GGACTCAACC GGGATCCAAC ATGCTACGGC TCTCAGGATT TAATTAGTAA AAGAGAGCCA TTAATTAGAA TCTGGAGGTC TCTTTGGAAT TATTTAATTA CTTCCGATGT TACATCAGCA ATTCACTCGA GTACGCCGTT GAAGGTAGCT TCAACAGACG TTTTTGACGT CAAGTTGCCT CTGTATAGTG AAGATTCAAC CGGAACCAAG AACGATACTA TCAACAAGCT ACATACGATT GTTGAGGGCT ATAGAAACAT TATTAAAAAG ATGAACAACA TTCAAGATAA ACCAAAAGTT ATTGACATTT TGATGGAAAC AAACAACTTG GAAAAGATTT TCTTCAGTTT CTTTGGCAAG GATTTCTTCA AGGATTGCAT TTGCAAGCCG GCTGCTGTGC CACTGAATGG TACCAGTCAT GATATCAATT CCCTCGCACA CCAGGAAAGT TACATGAAAG TCATCAAATA CTGTTTGTTT ATTCAGTTAA GGACAGATCT TTCGTGCATG TATTACATGA TTGCCATCCA TTATGAAAAC GAATACAATG AATCGCAAAC ACCATCAATG AATGCTGGCA TAGAGCTTTT TAAGATCTAT ATAAAGAGTG TCGTCCAGTT GGTTTACATC ATGTCTTATG TCTTGGACAA TTCTGTCGAA TTGTTCGGCA AGAACTATGA TTACGTTTTA ACCTCCAACA ACGAAAGATG TATCATAAAG ACGCATGCTT TCTTGACATC ATTCTTTGTT AGATTGTTGC ATCACAAAAT CGATCTTACC AAGAAGGTTG CATCTGATCC AACTCTCCAG CCAAGATTAG AAATTTTGGA CACATTATTC ATGATGGTGT TGATAGAAGC CGAATTTTTC GTTGGAAACT TCAGAAAGTT GAGTAAGAAT TACATTAATT CATACAAGTT GTATGTGATG ACGTATTTTG TTTTGAAGCA ATGTATGGAT AACCCAATGG CTTTCTTCGA AGCCACGATG TACAATCCAA AGTTTTTCCA CGAAGGTACC AATATGCTTG AATTTTTTAC CAATGGTGAG TTGACTTATT TGTGTAAATT GTGTCAAGAA TTCAGAGGTG CTAAAGAAGA ACAACTGAAA AAGAAAACCT CTACCCGTCC ACCTACTAAT CCTTCCAAGG AGCATTTCAA GTATCCAGCA CCACTAAATA ATGTGGATGT GCCCATCTCG TTTGACCCTT CCAAGTTTTT GGCCAACAAT TCTAGGATAT CTAATCCCAT TAATGCCGGG TCAGGTAGAG TGTTTGCAAT GGGAGCATCA CCCTCAATAC CAAGGTCGGC AATGTCTAGT AATGCGCTGG CGATTAGTTC CAATTTGCAG CAGTCGCCAG GAGACCTTAG TTCTTTTTCA CATCTGACAC CAGAGTTTGG CTATTTCAGC AACGGGCACA TTAGTACGGA AGACCTCTTG AAGTTGTTCG AAATGTATGG AGACTTGGAC CATTTTGATG TCTAG
|
Protein sequence | YRKRHRIPAS CSICRKRKSK CDRVRPVCGT CKKKSIAHLC YYESDKDNVD DSGSLNSHVH HLPEQPPGPP QHLQHPPPQF FPPPPHFQPG VQPQLQPPLH QNQMDQLHQH PHNPQQPIPQ QHIQPQHITA QQPAPSQPIP QQSIQSQQLP HLQANQVVDY QGYQGNVNLK HPDANTSKNA YNISAVPATV SSANSVSAFA YSPTESNISM SSANPPSTPG KLPSSFSNNR LVSIPLGPNS SLQVNPDDTM KVFSNASYAL NLEGPLWQYQ GVLSYIGLTK SDPFIKIMRN YAILLFKSGE MGKFMKLEAP KTKKRSAPGK NTSPGAKKSR IESSLASDSI HESPGSYMSE CDSKKDVDVD HDNILEDDGL ITTRIDVVEQ NSTTDKEEES ELESRTEIKT TDPAIKKEPE KKTHKRKTNP HAIPNILPGL KSLYSGKKNR KEYYELVEKA IIGVFPSKLN MFMLFCRFFK YVHPFAPIID ENSLMMDITT LLQQYPSFNH GFYDQVIIES DHDLTVLGIF LLVLRLGYMS LIHNDPVNNS YTKEEQGMVR DMRRISTASY LNVVDLCLAD DLICTKSTFR KVQSLTLLYF FRKMSPDDCH GIGGTDSNIL LGVAITHAFS IGLNRDPTCY GSQDLISKRE PLIRIWRSLW NYLITSDVTS AIHSSTPLKV ASTDVFDVKL PSYSEDSTGT KNDTINKLHT IVEGYRNIIK KMNNIQDKPK VIDILMETNN LEKIFFSFFG KDFFKDCICK PAAVPSNGTS HDINSLAHQE SYMKVIKYCL FIQLRTDLSC MYYMIAIHYE NEYNESQTPS MNAGIELFKI YIKSVVQLVY IMSYVLDNSV ELFGKNYDYV LTSNNERCII KTHAFLTSFF VRLLHHKIDL TKKVASDPTL QPRLEILDTL FMMVLIEAEF FVGNFRKLSK NYINSYKLYV MTYFVLKQCM DNPMAFFEAT MYNPKFFHEG TNMLEFFTNG ELTYLCKLCQ EFRGAKEEQS KKKTSTRPPT NPSKEHFKYP APLNNVDVPI SFDPSKSAMS SNASAISSNL QQSPGDLSSF SHSTPEFGYF SNGHISTEDL LKLFEMYGDL DHFDV
|
| |