Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37775 |
Symbol | HAP1.1 |
ID | 4851322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1494974 |
End bp | 1497787 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393030 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001387931 |
Protein GI | 126274363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.709365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.204154 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGCC GTGGTCCAAT CCCAAACCAG ATAAATCCTC CACAGCAAAT GGGCCAACCC CTTTACAATA CACCACCACA GGCACACAGT GCGAATATAG AAAATGTTCG CAAGCGTACT TCAACTTCAC TAATGGGTGC ATCTTCGCGG GCATCTGCCA CATATCCACG AAAAAGAGCT CTTACTGCGT GCGACACTTG TCGTTTGAAG AAGATCAAGT GTGACAATGT CAGGCCGCGA TGCGGGTCTT GTGTCAAGAA CGGCAACATG AACTGTCACT ACCGTACCGA TGATCAGCAG AAAGACTATT CAAGCTATGA TCCAGCGTCA TTAAACATCT TGACCAAGTT GGATGTGATT CTCCGCGACT TGCGCGATCT CAAAAATGTC AACGGACTTG AATCTTCGAC TCCAGAAGAA CTTGGCCCAG GCTCAGGTCC TGGCCCAGGT TCAGGTACTA CTGCGTCCGG CTCAGGATCG GGTTCCGCTT CAAAAAGAAG ACAATACGGT TCAGAACATC GAGAGTTTCA TTTCGACAAC TGCATCTGGG ACATGTCGAT AACCTCGATC TTGAGGTGGA AGTACTTCAT CAAATGCTTT GGTGATACAC CAGAAGAAAC CGATAGAGTA TCCAACAGTC TCATTAAAAT GTACAATCGG TCGATTGTCG CCGTTAATCG TAACGGAACT CTAGAATCAC GACTCCTCAG GACCAAATCT CTTGAGGGGT TGTTGAGCAA GAACTTTTCC AATATCGTCA ACTCATTCTT TGTAAATTGC CACTCCAAGA TCCCAATCTT GGACACCTTG GAGTTGTTCG AGTCCTTAGA GATCTACAAA TGCTTGACTT CTCATTATAA ACTGTTCAGT TTCATCCAGA TATTGGAGTC TTACGATCTG GAAAATCCAG AATCTGACCA GCTTCCACGA GTTGTGCTTG ATGCTCTTAG AGCAAACAAC TTGGAAGATA CGCCTTTCCG TCGTAGAGCA TTTAAGACGT TGTGTCTTTC TGTTCCCAAC ATCATAGTGA TTTGTGCTCT CGGAGTAGTT TCCACTCCAG TGCAATTGGA GAATTTGACT AAATTCGACA GCTCTATAGA AGAGAGAAAG TCCATCGCCA TAGGCTGCCT TTCAGACTCC AGTGCCTTCA ATGGCGTTCA AGACGTGAGA CGCGACAGAC TCGAAATCTC AATTCTCTTG ATCAGATATG CGGAGTTGTT ACGCACTGCA TTTCCCTTCA CCGTTGACCA GAGTTCCTTA AGAGCAGTCG AGTTCCATCT TCTTTTGAAC CAGTACTACT TATATACCAT GACTCCATTA TTGGCGTACA GACATATCTC AACCGCATGC CAGCATATGA TGTACTATAT CAACATGAGG AGGGGTGATC CAATTAATAC CGATAATGCC TTGGGTGCAT CGAAAAAGGA AATGATAGAT CGTCTCTTCT GGTCTTGCTT GAAGTTGGAG TGTGAGTTGA GAGTGGAACT ATCACCGTAT GTACCCGTAT CTGGCATCAC ACAACAAGTG CCTCCAACTT CTTTCCCCAA GATTCCTGAT CCTTTATCTG ATGAAATAAA ACTGAATCAC AGCGAAGCAT GCATTAAGCT CGCCAACAAA TATGAAGATG AATATTCCTG GTACTATTTT CTCACTGAAA TTGCAGTGCG CAAGGTGGAC AACAAGATGT TTGACGAAAT ATACTCATAT GAAAGCCGTT TGAGAAACCT TTGGGACCAG GATAGTTTTG CTAATGAATC TGCATGGATT ATCTTCATAA AGTATCTAAA TCAGTACAAC GGGATCATCA ATTCATTGAG TCCCCAAATT AGAAACTTTG TTCTCCAAGA AATTAACGTC GATCAAATCC ACAGACGTAT GAAGAAGAAG TATGAGAAAA AACAACTGAA TATTAGCAGT GATGCCGATG TGTTTGACAC ATTAGACGAT TTTTTGATAG ACGATGACCT CTTGATTCGA GCCCAATCCG AATCAATCAT GTTCATAAAG ACAAGAATTA TAACCTCCAA GTTATTGTTG TTCCGCCCTA TCATCTACTT GCTTTTAGAA GATAAAATCC CCATTACCGA ATTGATGGAA GCTGCAATTT CAGTCATGGG TGCACAAGCC AATATTACTT CTGTATCGAT GAACAATCTA AATGCAATGG AATCTCCCGA CTCAGCTGGC TCAGTTCCGA ATTCCTTTTC CGGTGAAACA AACCCCTCGG ACGCAGACTT GGAGATGGAC TACTTTAATT TGATTAATGC ACCCTTGTTT TACCAGAGAC AGTATCCGGA CGAAGATTTC TCTAACGTGA TAGAGTACAC CAACAAAGAT AAAAGTGACA AAGACGAAGA CTTTGATGAC GAAAACAGTT TTTGCTTGAA AAGTCTTCCT TTGGCTCGAT CTCGGATCTT GAGGATCTTT TTGCAGAACT TGATTTCTTT GCCCAAAATG AATATTCCAA AATTGGGAGC ACATAGACAC CCTGGCCTGT GGTACTATTT GAGAAATCTT TTCATTGGTA ATGTTTTCCA GTTCTTATTG TACAATAAGT TGCAAGAGAT GTTACAAGTG GCAACTGCAG ACGAAGGAAT GAGAGCATTC CTTTCGCAAG TGCCCGAGAT TTCTTCGATG AACGATGTTA TGGATATGTT CAACGTTGTA ATAAACAAAA ATGACATAAT TGCTGGATTC GAACATTCGT TGATTTTATT TGATTACTGG AAGGAAGAAA TGAGCGATTG TGAAATTTAT CTGGATTATA TCAAGAGGTG TATTGAAAAG CTAGAACAAG GTACCAAAAA TTAA
|
Protein sequence | MPGRGPIPNQ INPPQQMGQP LYNTPPQAHS ANIENVRKRT STSLMGASSR ASATYPRKRA LTACDTCRLK KIKCDNVRPR CGSCVKNGNM NCHYRTDDQQ KDYSSYDPAS LNILTKLDVI LRDLRDLKNV NGLESSTPEE LGPGSGPGPG SGTTASGSGS GSASKRRQYG SEHREFHFDN CIWDMSITSI LRWKYFIKCF GDTPEETDRV SNSLIKMYNR SIVAVNRNGT LESRLLRTKS LEGLLSKNFS NIVNSFFVNC HSKIPILDTL ELFESLEIYK CLTSHYKLFS FIQILESYDL ENPESDQLPR VVLDALRANN LEDTPFRRRA FKTLCLSVPN IIVICALGVV STPVQLENLT KFDSSIEERK SIAIGCLSDS SAFNGVQDVR RDRLEISILL IRYAELLRTA FPFTVDQSSL RAVEFHLLLN QYYLYTMTPL LAYRHISTAC QHMMYYINMR RGDPINTDNA LGASKKEMID RLFWSCLKLE CELRVELSPY VPVSGITQQV PPTSFPKIPD PLSDEIKLNH SEACIKLANK YEDEYSWYYF LTEIAVRKVD NKMFDEIYSY ESRLRNLWDQ DSFANESAWI IFIKYLNQYN GIINSLSPQI RNFVLQEINV DQIHRRMKKK YEKKQLNISS DADVFDTLDD FLIDDDLLIR AQSESIMFIK TRIITSKLLL FRPIIYLLLE DKIPITELME AAISVMGAQA NITSVSMNNL NAMESPDSAG SVPNSFSGET NPSDADLEMD YFNLINAPLF YQRQYPDEDF SNVIEYTNKD KSDKDEDFDD ENSFCLKSLP LARSRILRIF LQNLISLPKM NIPKLGAHRH PGLWYYLRNL FIGNVFQFLL YNKLQEMLQV ATADEGMRAF LSQVPEISSM NDVMDMFNVV INKNDIIAGF EHSLILFDYW KEEMSDCEIY LDYIKRCIEK LEQGTKN
|
| |