Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_1028 |
Symbol | HAL9 |
ID | 4840699 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 718579 |
End bp | 721659 |
Gene Length | 3081 bp |
Protein Length | 868 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640392014 |
Product | Fungal Zn2Cys6 Cluster domain |
Protein accession | XP_001386333 |
Protein GI | 150866665 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.580079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAAAGAGAA TAAGAGTCGC CTGTGATCAT TGTCGAAGGA AAAAGATCAA ATGTGATGGC AACTTGCCGT GTGGGAACTG CTCACTGGCT AAGGAACGCA ATTGTCATTA TAAAGAAAGA CCAGTAAAGA AGAAAATGAA ACCACTCAAG TCTGGCGACA AGAAGGACGG CTTAAAACGT AACTCAAAGA CAAGGACGAT CGAAGTTCTA GATACTCGTT TGTCTACTCT TGAGAATGTC ATCATCAGAC TCACAGACAA ATTAGAGGAT ATCACCAGGG TAGTACCTGT AGCCCCATTA GCGTCCATGT CTTCCTCAAA CAATACAAAC AAAACATATT CAAATGTCAT TGATGAGATT TCTAATCGAA ATCAGGTAGA AAACGGAAAA CATCTAAGAA GTTCTGCTGA GTCTGAGTCC GAGGACGAGA ATGAGAATAA CACAGTAGAC GAACAAGAAA ACGATTTAGA CCGTACAACT TCTGAAACTG AAGATGAGGA CGATGACGAT GACGAAGAAG ATGACGAAGA AGAAGAAGAC GACGAACATG AACATGCAAG AGACTCTTCT CAACACAACT TAAAAGAACT GTCGAGCAAG AGCAAAGCAG AACCAATACT GTCTTCCAAA GATTCCTCCA CAGCAGCTTC CACGCAGAGT TCAAGCTCCA AGACTAAGAT CCTCTTGAAC AACCGTACTT TAGAACAATA TTTTGGAATC CATTCTATGA TGTGTATCTT CTCTGACAAG TCTTTGGCAT GGATAGAGCG TACATTAGGT ACAGAAGGCG AAGATCTAAT CACACCAATT AAAAACGTGC CCATTGTATT TTATTCCAAG TTTAAGACTT TTATGCTTAA ATGGATAGAC CCCCCTCTCA TTGACAGTAA GGGCCGTCGA AGATTGCTAG AGAACCCATT CCCCGAAAAC TCTAAACTAG TGTTCGATTT TATCGACACT CACTACAAAG ATGTCAACGA TATCAATTCT CTTTGCGAAC AAAGCGAAAT GCGAGGCTAC TTTGAGGAGT ACTACAACAA TTTTAGAGAA CCTAACCATA GTAAGAGAAA GAAGTTTAAG TTGTCTGAAT ACTTGATCAT GACAGCAGGC TTACTTTTGT GTATCAGTTC GAGAATAAAT TTGGAATCAG CCCCATCCCT CAAGAGCACT CCTGTGGGCA AAGAGCAGAC ACAGTCCAGT GAAATTCTGC TTTCCGATGA TGAATTGTTG AGCTGGCAGA ATACGTTATT TGACGATGCT ATTTACTACT ACCACAGAGT GTCTGTGATC AGTGATGGTA TTACCACTAT TGAAGGGATT TTGCTTTTGA TTATGTACAT AGAGATGGAC TGGTTGACCA GTCATGTCAA CTATATGTTG AGTTCCGTTG CTATAAGGTA TGCCCAAGAG ATAGGATTGC ATAGAGCTGA AACATACGAA GACTTGGAGT TTGAAGAGCA ACACAAAAGA AGAAAGATCT GGATGTTCTG CCACTATTTC GATATGGAGA TTTGTTTCAG AGGTGGAAAG CCGCCTTTGA TTAATGCTAA TGATGTCACT GCTAATAACG ACGAAGACTT GATGCGTTTC TGCATGTTGA AGATGAAACA CAAAGTTAAT GACAAGCAAT TGATGAATAT TGATGTAAAC ATGTCATTAT CCAAGCAGAT GCTTTCGATT ATTGGAAGAG CAGATGATCC CCTCACATAC CACTTGTACT TCTTGTTGCT TACCAGAATC AGATCCAAGT CGTACACTAA GCTTTTCGTT GCTTCAGTAG AAAATGAAAC TATTCAAAAG GTAGCCGATA CATTGAACGA CTTGAACAAC GAGATGTTTG AACTTGCTTC GTCTATGCAT GAAGCAGATC GTCCTCGGTT TTTCAACGAC CCTGAGTTCA GTTTCATTCC AGACACGATG CCTTATTACA GAAGGGAGAC TGTGATCGCA GTTCAAATCA CATTCTTCTT CCACTTGATG GTGATGAACA GACTTCCTAC CATGGTTAAT GCTGAAGAAT TGGACAATTC GTCCAGCATG AAGTTTAGAA ACTTGCATTT GGATTCAGCT CGTACGATTT TGGTCTTGGT AAGACAGCTT AACCGTGAAA ACACGGGTGT CTCGTATTAT TATTGGATCT TGTTTTTCCC TGTTTCCGCT TTTTTGTGTT TAGCAGCTGC AATTTTGAAC CATCCAAACT TACCTGAGTC ATACAGTGAT CTCAAGCTAT TGATTGATTG CTCAATGAAC TTCTTTTCCA GTAAGAAAAA TTCTCACGTG GAAGGAAAGA GTAGGTTCAA AATCTACTCC AAGAAGGATC TTGTAGTCTC GTTAATTATC AAATTGATGC TTAAGATTGT TATTAAGTTC TATGAACAAA CAACCAAGAT ATCAGTTCTT AACGGTGATG AAAAATTACA ACGTCATTTG GACAGTGCTA AAAAGATGTT CCCAGATATC TTCAAAGATA GAGCCGAGTT TAATTCCAAA GTTTCCAGTG TATTCGGATT TTCGCCCTTT ACCAATAGCC AAAATTTTTC TGCTAGTAAT TCCATGAGTG GCTCAAACAG TAGCCATGCT GGAGCTCCTG GAAGGACGGT GTGGACTCCT GGTGGAGGTA TACCAACTCC AGTTTCCTCG TCTCTCCAGA ATGCTAACAC CTCTCAGCAA AGACAATTCG GTGTTAGTCC TTCGGCAAAT AGTCCAATTA CGTTTTCTCC GAGTTACAAT CCAGCGTTGT CTAACATTTT GCACCCTAGC GATTTGCCTA TGCGTGCATC ACCTTCTGAA AACGCGTTTA ACGTTCAACG TACTCCAAGC CGCTTACAAC AGCAGACACA GCAATTATCG AATTCTGGTA CTGTAGGAAT GCTCCCTCAT CATAGATCAA TGGAATCTGC TCCACTTTTT GACAATTCCA TCAATGGCGA CAACTCCATA AATGGGGATC TCAATGCGAA TTTCATCAAT AATGGCTCGG ATATGTTCCC GGAGTCACAT AATGACGATG GCATATCGTC GTTGTTCTAC ACGCAAATGA ACAGCTTGCC TAATTTCTTC TTTGACAATA ACTTGGGTAT A
|
Protein sequence | KKRIRVACDH CRRKKIKCDG NLPCGNCSSA KERNCHYKER PVKKKMKPLK SGDKKDGLKR NSKTRTIEVL DTRLSTLENV IIRLTDKLED ITRNNTVDEQ ENDLDRTTSE TEDEDDDDDE EDDEEEEDDE HEHARDSSQH NLKESSSKSK AEPISSSKDS STAASTQSSS SKTKILLNNR TLEQYFGIHS MMCIFSDKSL AWIERTLGTE GEDLITPIKN VPIVFYSKFK TFMLKWIDPP LIDSKGRRRL LENPFPENSK LVFDFIDTHY KDVNDINSLC EQSEMRGYFE EYYNNFREPN HSKRKKFKLS EYLIMTAGLL LCISSRINLE SAPSLKSTPV GKEQTQSSEI SLSDDELLSW QNTLFDDAIY YYHRVSVISD GITTIEGILL LIMYIEMDWL TSHVNYMLSS VAIRYAQEIG LHRAETYEDL EFEEQHKRRK IWMFCHYFDM EICFRGGKPP LINANDVTAN NDEDLMRFCM LKMKHKVNDK QLMNIDVNIA DDPLTYHLYF LLLTRIRSKS YTKLFVASVE NETIQKVADT LNDLNNEMFE LASSMHEADR PRFFNDPEFS FIPDTMPYYR RETVIAVQIT FFFHLMVMNR LPTMVNAEEL DNSSSMKFRN LHLDSARTIL VLVRQLNREN TGVSYYYWIL FFPVSAFLCL AAAILNHPNL PESYSDLKLL IDCSMNFFSS KKNSHVEGKS RFKIYSKKDL VVSLIIKLML KIVIKFYEQT TKISVLNGDE KLQRHLDSAK KMFPDIFKDR AEFNSKVSSV FGFSPFTNSQ NFSASNSMSG SNSSHAGAPG RTVSMESAPL FDNSINGDNS INGDLNANFI NNGSDMFPES HNDDGISSLF YTQMNSLPNF FFDNNLGI
|
| |