Gene PICST_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_1028 
SymbolHAL9 
ID4840699 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp718579 
End bp721659 
Gene Length3081 bp 
Protein Length868 aa 
Translation table12 
GC content40% 
IMG OID640392014 
ProductFungal Zn2Cys6 Cluster domain 
Protein accessionXP_001386333 
Protein GI150866665 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.580079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAAAGAGAA TAAGAGTCGC CTGTGATCAT TGTCGAAGGA AAAAGATCAA ATGTGATGGC 
AACTTGCCGT GTGGGAACTG CTCACTGGCT AAGGAACGCA ATTGTCATTA TAAAGAAAGA
CCAGTAAAGA AGAAAATGAA ACCACTCAAG TCTGGCGACA AGAAGGACGG CTTAAAACGT
AACTCAAAGA CAAGGACGAT CGAAGTTCTA GATACTCGTT TGTCTACTCT TGAGAATGTC
ATCATCAGAC TCACAGACAA ATTAGAGGAT ATCACCAGGG TAGTACCTGT AGCCCCATTA
GCGTCCATGT CTTCCTCAAA CAATACAAAC AAAACATATT CAAATGTCAT TGATGAGATT
TCTAATCGAA ATCAGGTAGA AAACGGAAAA CATCTAAGAA GTTCTGCTGA GTCTGAGTCC
GAGGACGAGA ATGAGAATAA CACAGTAGAC GAACAAGAAA ACGATTTAGA CCGTACAACT
TCTGAAACTG AAGATGAGGA CGATGACGAT GACGAAGAAG ATGACGAAGA AGAAGAAGAC
GACGAACATG AACATGCAAG AGACTCTTCT CAACACAACT TAAAAGAACT GTCGAGCAAG
AGCAAAGCAG AACCAATACT GTCTTCCAAA GATTCCTCCA CAGCAGCTTC CACGCAGAGT
TCAAGCTCCA AGACTAAGAT CCTCTTGAAC AACCGTACTT TAGAACAATA TTTTGGAATC
CATTCTATGA TGTGTATCTT CTCTGACAAG TCTTTGGCAT GGATAGAGCG TACATTAGGT
ACAGAAGGCG AAGATCTAAT CACACCAATT AAAAACGTGC CCATTGTATT TTATTCCAAG
TTTAAGACTT TTATGCTTAA ATGGATAGAC CCCCCTCTCA TTGACAGTAA GGGCCGTCGA
AGATTGCTAG AGAACCCATT CCCCGAAAAC TCTAAACTAG TGTTCGATTT TATCGACACT
CACTACAAAG ATGTCAACGA TATCAATTCT CTTTGCGAAC AAAGCGAAAT GCGAGGCTAC
TTTGAGGAGT ACTACAACAA TTTTAGAGAA CCTAACCATA GTAAGAGAAA GAAGTTTAAG
TTGTCTGAAT ACTTGATCAT GACAGCAGGC TTACTTTTGT GTATCAGTTC GAGAATAAAT
TTGGAATCAG CCCCATCCCT CAAGAGCACT CCTGTGGGCA AAGAGCAGAC ACAGTCCAGT
GAAATTCTGC TTTCCGATGA TGAATTGTTG AGCTGGCAGA ATACGTTATT TGACGATGCT
ATTTACTACT ACCACAGAGT GTCTGTGATC AGTGATGGTA TTACCACTAT TGAAGGGATT
TTGCTTTTGA TTATGTACAT AGAGATGGAC TGGTTGACCA GTCATGTCAA CTATATGTTG
AGTTCCGTTG CTATAAGGTA TGCCCAAGAG ATAGGATTGC ATAGAGCTGA AACATACGAA
GACTTGGAGT TTGAAGAGCA ACACAAAAGA AGAAAGATCT GGATGTTCTG CCACTATTTC
GATATGGAGA TTTGTTTCAG AGGTGGAAAG CCGCCTTTGA TTAATGCTAA TGATGTCACT
GCTAATAACG ACGAAGACTT GATGCGTTTC TGCATGTTGA AGATGAAACA CAAAGTTAAT
GACAAGCAAT TGATGAATAT TGATGTAAAC ATGTCATTAT CCAAGCAGAT GCTTTCGATT
ATTGGAAGAG CAGATGATCC CCTCACATAC CACTTGTACT TCTTGTTGCT TACCAGAATC
AGATCCAAGT CGTACACTAA GCTTTTCGTT GCTTCAGTAG AAAATGAAAC TATTCAAAAG
GTAGCCGATA CATTGAACGA CTTGAACAAC GAGATGTTTG AACTTGCTTC GTCTATGCAT
GAAGCAGATC GTCCTCGGTT TTTCAACGAC CCTGAGTTCA GTTTCATTCC AGACACGATG
CCTTATTACA GAAGGGAGAC TGTGATCGCA GTTCAAATCA CATTCTTCTT CCACTTGATG
GTGATGAACA GACTTCCTAC CATGGTTAAT GCTGAAGAAT TGGACAATTC GTCCAGCATG
AAGTTTAGAA ACTTGCATTT GGATTCAGCT CGTACGATTT TGGTCTTGGT AAGACAGCTT
AACCGTGAAA ACACGGGTGT CTCGTATTAT TATTGGATCT TGTTTTTCCC TGTTTCCGCT
TTTTTGTGTT TAGCAGCTGC AATTTTGAAC CATCCAAACT TACCTGAGTC ATACAGTGAT
CTCAAGCTAT TGATTGATTG CTCAATGAAC TTCTTTTCCA GTAAGAAAAA TTCTCACGTG
GAAGGAAAGA GTAGGTTCAA AATCTACTCC AAGAAGGATC TTGTAGTCTC GTTAATTATC
AAATTGATGC TTAAGATTGT TATTAAGTTC TATGAACAAA CAACCAAGAT ATCAGTTCTT
AACGGTGATG AAAAATTACA ACGTCATTTG GACAGTGCTA AAAAGATGTT CCCAGATATC
TTCAAAGATA GAGCCGAGTT TAATTCCAAA GTTTCCAGTG TATTCGGATT TTCGCCCTTT
ACCAATAGCC AAAATTTTTC TGCTAGTAAT TCCATGAGTG GCTCAAACAG TAGCCATGCT
GGAGCTCCTG GAAGGACGGT GTGGACTCCT GGTGGAGGTA TACCAACTCC AGTTTCCTCG
TCTCTCCAGA ATGCTAACAC CTCTCAGCAA AGACAATTCG GTGTTAGTCC TTCGGCAAAT
AGTCCAATTA CGTTTTCTCC GAGTTACAAT CCAGCGTTGT CTAACATTTT GCACCCTAGC
GATTTGCCTA TGCGTGCATC ACCTTCTGAA AACGCGTTTA ACGTTCAACG TACTCCAAGC
CGCTTACAAC AGCAGACACA GCAATTATCG AATTCTGGTA CTGTAGGAAT GCTCCCTCAT
CATAGATCAA TGGAATCTGC TCCACTTTTT GACAATTCCA TCAATGGCGA CAACTCCATA
AATGGGGATC TCAATGCGAA TTTCATCAAT AATGGCTCGG ATATGTTCCC GGAGTCACAT
AATGACGATG GCATATCGTC GTTGTTCTAC ACGCAAATGA ACAGCTTGCC TAATTTCTTC
TTTGACAATA ACTTGGGTAT A
 
Protein sequence
KKRIRVACDH CRRKKIKCDG NLPCGNCSSA KERNCHYKER PVKKKMKPLK SGDKKDGLKR 
NSKTRTIEVL DTRLSTLENV IIRLTDKLED ITRNNTVDEQ ENDLDRTTSE TEDEDDDDDE
EDDEEEEDDE HEHARDSSQH NLKESSSKSK AEPISSSKDS STAASTQSSS SKTKILLNNR
TLEQYFGIHS MMCIFSDKSL AWIERTLGTE GEDLITPIKN VPIVFYSKFK TFMLKWIDPP
LIDSKGRRRL LENPFPENSK LVFDFIDTHY KDVNDINSLC EQSEMRGYFE EYYNNFREPN
HSKRKKFKLS EYLIMTAGLL LCISSRINLE SAPSLKSTPV GKEQTQSSEI SLSDDELLSW
QNTLFDDAIY YYHRVSVISD GITTIEGILL LIMYIEMDWL TSHVNYMLSS VAIRYAQEIG
LHRAETYEDL EFEEQHKRRK IWMFCHYFDM EICFRGGKPP LINANDVTAN NDEDLMRFCM
LKMKHKVNDK QLMNIDVNIA DDPLTYHLYF LLLTRIRSKS YTKLFVASVE NETIQKVADT
LNDLNNEMFE LASSMHEADR PRFFNDPEFS FIPDTMPYYR RETVIAVQIT FFFHLMVMNR
LPTMVNAEEL DNSSSMKFRN LHLDSARTIL VLVRQLNREN TGVSYYYWIL FFPVSAFLCL
AAAILNHPNL PESYSDLKLL IDCSMNFFSS KKNSHVEGKS RFKIYSKKDL VVSLIIKLML
KIVIKFYEQT TKISVLNGDE KLQRHLDSAK KMFPDIFKDR AEFNSKVSSV FGFSPFTNSQ
NFSASNSMSG SNSSHAGAPG RTVSMESAPL FDNSINGDNS INGDLNANFI NNGSDMFPES
HNDDGISSLF YTQMNSLPNF FFDNNLGI