Gene PICST_37775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37775 
SymbolHAP1.1 
ID4851322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1494974 
End bp1497787 
Gene Length2814 bp 
Protein Length937 aa 
Translation table 
GC content42% 
IMG OID640393030 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001387931 
Protein GI126274363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.709365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.204154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGCC GTGGTCCAAT CCCAAACCAG ATAAATCCTC CACAGCAAAT GGGCCAACCC 
CTTTACAATA CACCACCACA GGCACACAGT GCGAATATAG AAAATGTTCG CAAGCGTACT
TCAACTTCAC TAATGGGTGC ATCTTCGCGG GCATCTGCCA CATATCCACG AAAAAGAGCT
CTTACTGCGT GCGACACTTG TCGTTTGAAG AAGATCAAGT GTGACAATGT CAGGCCGCGA
TGCGGGTCTT GTGTCAAGAA CGGCAACATG AACTGTCACT ACCGTACCGA TGATCAGCAG
AAAGACTATT CAAGCTATGA TCCAGCGTCA TTAAACATCT TGACCAAGTT GGATGTGATT
CTCCGCGACT TGCGCGATCT CAAAAATGTC AACGGACTTG AATCTTCGAC TCCAGAAGAA
CTTGGCCCAG GCTCAGGTCC TGGCCCAGGT TCAGGTACTA CTGCGTCCGG CTCAGGATCG
GGTTCCGCTT CAAAAAGAAG ACAATACGGT TCAGAACATC GAGAGTTTCA TTTCGACAAC
TGCATCTGGG ACATGTCGAT AACCTCGATC TTGAGGTGGA AGTACTTCAT CAAATGCTTT
GGTGATACAC CAGAAGAAAC CGATAGAGTA TCCAACAGTC TCATTAAAAT GTACAATCGG
TCGATTGTCG CCGTTAATCG TAACGGAACT CTAGAATCAC GACTCCTCAG GACCAAATCT
CTTGAGGGGT TGTTGAGCAA GAACTTTTCC AATATCGTCA ACTCATTCTT TGTAAATTGC
CACTCCAAGA TCCCAATCTT GGACACCTTG GAGTTGTTCG AGTCCTTAGA GATCTACAAA
TGCTTGACTT CTCATTATAA ACTGTTCAGT TTCATCCAGA TATTGGAGTC TTACGATCTG
GAAAATCCAG AATCTGACCA GCTTCCACGA GTTGTGCTTG ATGCTCTTAG AGCAAACAAC
TTGGAAGATA CGCCTTTCCG TCGTAGAGCA TTTAAGACGT TGTGTCTTTC TGTTCCCAAC
ATCATAGTGA TTTGTGCTCT CGGAGTAGTT TCCACTCCAG TGCAATTGGA GAATTTGACT
AAATTCGACA GCTCTATAGA AGAGAGAAAG TCCATCGCCA TAGGCTGCCT TTCAGACTCC
AGTGCCTTCA ATGGCGTTCA AGACGTGAGA CGCGACAGAC TCGAAATCTC AATTCTCTTG
ATCAGATATG CGGAGTTGTT ACGCACTGCA TTTCCCTTCA CCGTTGACCA GAGTTCCTTA
AGAGCAGTCG AGTTCCATCT TCTTTTGAAC CAGTACTACT TATATACCAT GACTCCATTA
TTGGCGTACA GACATATCTC AACCGCATGC CAGCATATGA TGTACTATAT CAACATGAGG
AGGGGTGATC CAATTAATAC CGATAATGCC TTGGGTGCAT CGAAAAAGGA AATGATAGAT
CGTCTCTTCT GGTCTTGCTT GAAGTTGGAG TGTGAGTTGA GAGTGGAACT ATCACCGTAT
GTACCCGTAT CTGGCATCAC ACAACAAGTG CCTCCAACTT CTTTCCCCAA GATTCCTGAT
CCTTTATCTG ATGAAATAAA ACTGAATCAC AGCGAAGCAT GCATTAAGCT CGCCAACAAA
TATGAAGATG AATATTCCTG GTACTATTTT CTCACTGAAA TTGCAGTGCG CAAGGTGGAC
AACAAGATGT TTGACGAAAT ATACTCATAT GAAAGCCGTT TGAGAAACCT TTGGGACCAG
GATAGTTTTG CTAATGAATC TGCATGGATT ATCTTCATAA AGTATCTAAA TCAGTACAAC
GGGATCATCA ATTCATTGAG TCCCCAAATT AGAAACTTTG TTCTCCAAGA AATTAACGTC
GATCAAATCC ACAGACGTAT GAAGAAGAAG TATGAGAAAA AACAACTGAA TATTAGCAGT
GATGCCGATG TGTTTGACAC ATTAGACGAT TTTTTGATAG ACGATGACCT CTTGATTCGA
GCCCAATCCG AATCAATCAT GTTCATAAAG ACAAGAATTA TAACCTCCAA GTTATTGTTG
TTCCGCCCTA TCATCTACTT GCTTTTAGAA GATAAAATCC CCATTACCGA ATTGATGGAA
GCTGCAATTT CAGTCATGGG TGCACAAGCC AATATTACTT CTGTATCGAT GAACAATCTA
AATGCAATGG AATCTCCCGA CTCAGCTGGC TCAGTTCCGA ATTCCTTTTC CGGTGAAACA
AACCCCTCGG ACGCAGACTT GGAGATGGAC TACTTTAATT TGATTAATGC ACCCTTGTTT
TACCAGAGAC AGTATCCGGA CGAAGATTTC TCTAACGTGA TAGAGTACAC CAACAAAGAT
AAAAGTGACA AAGACGAAGA CTTTGATGAC GAAAACAGTT TTTGCTTGAA AAGTCTTCCT
TTGGCTCGAT CTCGGATCTT GAGGATCTTT TTGCAGAACT TGATTTCTTT GCCCAAAATG
AATATTCCAA AATTGGGAGC ACATAGACAC CCTGGCCTGT GGTACTATTT GAGAAATCTT
TTCATTGGTA ATGTTTTCCA GTTCTTATTG TACAATAAGT TGCAAGAGAT GTTACAAGTG
GCAACTGCAG ACGAAGGAAT GAGAGCATTC CTTTCGCAAG TGCCCGAGAT TTCTTCGATG
AACGATGTTA TGGATATGTT CAACGTTGTA ATAAACAAAA ATGACATAAT TGCTGGATTC
GAACATTCGT TGATTTTATT TGATTACTGG AAGGAAGAAA TGAGCGATTG TGAAATTTAT
CTGGATTATA TCAAGAGGTG TATTGAAAAG CTAGAACAAG GTACCAAAAA TTAA
 
Protein sequence
MPGRGPIPNQ INPPQQMGQP LYNTPPQAHS ANIENVRKRT STSLMGASSR ASATYPRKRA 
LTACDTCRLK KIKCDNVRPR CGSCVKNGNM NCHYRTDDQQ KDYSSYDPAS LNILTKLDVI
LRDLRDLKNV NGLESSTPEE LGPGSGPGPG SGTTASGSGS GSASKRRQYG SEHREFHFDN
CIWDMSITSI LRWKYFIKCF GDTPEETDRV SNSLIKMYNR SIVAVNRNGT LESRLLRTKS
LEGLLSKNFS NIVNSFFVNC HSKIPILDTL ELFESLEIYK CLTSHYKLFS FIQILESYDL
ENPESDQLPR VVLDALRANN LEDTPFRRRA FKTLCLSVPN IIVICALGVV STPVQLENLT
KFDSSIEERK SIAIGCLSDS SAFNGVQDVR RDRLEISILL IRYAELLRTA FPFTVDQSSL
RAVEFHLLLN QYYLYTMTPL LAYRHISTAC QHMMYYINMR RGDPINTDNA LGASKKEMID
RLFWSCLKLE CELRVELSPY VPVSGITQQV PPTSFPKIPD PLSDEIKLNH SEACIKLANK
YEDEYSWYYF LTEIAVRKVD NKMFDEIYSY ESRLRNLWDQ DSFANESAWI IFIKYLNQYN
GIINSLSPQI RNFVLQEINV DQIHRRMKKK YEKKQLNISS DADVFDTLDD FLIDDDLLIR
AQSESIMFIK TRIITSKLLL FRPIIYLLLE DKIPITELME AAISVMGAQA NITSVSMNNL
NAMESPDSAG SVPNSFSGET NPSDADLEMD YFNLINAPLF YQRQYPDEDF SNVIEYTNKD
KSDKDEDFDD ENSFCLKSLP LARSRILRIF LQNLISLPKM NIPKLGAHRH PGLWYYLRNL
FIGNVFQFLL YNKLQEMLQV ATADEGMRAF LSQVPEISSM NDVMDMFNVV INKNDIIAGF
EHSLILFDYW KEEMSDCEIY LDYIKRCIEK LEQGTKN