Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_17616 |
Symbol | HYR5.2 |
ID | 4850877 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 307132 |
End bp | 311502 |
Gene Length | 4371 bp |
Protein Length | 488 aa |
Translation table | |
GC content | 46% |
IMG OID | 640392585 |
Product | hyphally regulated cell wall protein |
Protein accession | XP_001387297 |
Protein GI | 126273827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGGCAT CAAGTGCTTT AGCACTCGAA GTTACTGAAG ATACCACAGA AGTAGGTACT ATTTCTCTTG ATGTTGGAGA CATTACAGTT GACGCTGGCG TCTACTACTC TATCATAAAC AACGCTCTTT CTGCCATCGT GGGTAATCTT GATGTTGAAG GTTCCTTCTA CATTACCAGT ACATCTGATT TGATTGCATT GACCGTCACA CTTGACGGTG TAATCAACTC CATCGTGAAC AATGGTCTTG TTTCTTTTAA TTCTGCTGAA TCTTTGACTG CTCCTACTTA TCAATTGGCC GGTATTTCCT TCGAAAACAA CGGTGAATTC TACTTAGGGG GAGATGGTTC TGTGGGTGTC CCAATAATGG CAATTACTTC TCTTACGTTT AACAACAATG GTTTAATGGT ATTCTACCAA AACCAAAGAT CCACTGGTGT AGTCACTCTT GGTGCTCCAG CTGCTACTAT TCACAACAAC GGTCAAATTT GTCTTTACAA CGAGATATAC CAACAAACAA CTGCTGTTGA TGGTACGGGT TGTATCACCG CCCAGGCTGA CTCCAGTGTT TTCATTTCCA ATGCATTGGT TCCATTCGCT CAAACTCAAA CCATCTACTT GGAATCTGAC ACAGCCAGTG TGAGAGCAAC TGCTATCAGT ACTCCTCAAA CATTCACAGT TGCAAACTTC GGTAACGGAA ATATCATTGG TTTAGATATT CCTCTTGTTA CTCTTCCTGG TTTGTCATCG TACAGTTACG ATGCAACAAG TGGTATCTTG ACCCTTCGTG GTGCTGGATT ATTATCCCAA AAATTCGATA TTGGTACTGG TTACGACACA ACCTTGTTCC TGATCACCAC AGATCCTGGT CTTGGATTGG TCTCAGTGCC TCTTGGTGCT GTTACATACT CTGGCCCTCC TCCTACTCCA GGTCAACCAG ATGCTTGTCA GGCTTGTAAG CCATTGCCCC CAGTTCCTTC CCCTACTAAC TCTCAAACTA CCACCACCAC CACTTGGACC GGAACTGACA CTGAAACAAC CACTGAGACT GCTACTGAAG GAGGAACCGA CACTGTTATC GTTGAAATTC CATCTAACTC CCAAACTACT ACCACCACCA CTTGGACCGG AACTGACACG GAAACTACCA CTATTACCGA CACCCAAGGA GGAACTGACA CTGTCATCGT TGAGGTTCCA TCTAACTCTC AAACCACTTT GACTTCTACT TGGACCGGAA CTGACACGGA AACCACCACT ATTACCGACA CCCAAGGAGG AACTGACACT GTCATCGTTG AAATTCCATC TAACTCTCAA ACCACTTTGA CTTCTACTTG GACCGGAACT GACACGGAAA CCACCACAAT TACCGACACC CAAGGAGGAA CTGACACTGT CATCGTTGAA ATTCCATCTA ACTCTCAAAC CACTTTGACT TCTACTTGGA CCGGAACTGA CACGGAAACC ACCACAATTA CTGACACTCA AGGAGGAACT GACACTGTCA TCGTTGAAAT TCCATCTAAC TCTCAAACCA CTTTGACTTC TACTTGGACC GGAACTGACA CGGAAACCAC CACAATTACC GACACTCAAG GAGGAACTGA CACTGTCATC GTTGAAATTC CATCTAACTC TCAAACCACT TTGACTTCTA CTTGGACCGG AACTGACACG GAAACCACCA CAATAACTGA CACCCAAGGA GGAACTGACA CTGTCATCGT TGAAGTTCCA TCTGTTCCTA ACAGTCAAAC TACCTTGACT TCGACTTGGA CCGGTCTTGA AACTACTACT GTTACCGAGA CTGACACTCC AGGTGGTACT GATACCGTCA TCGTTGAAGT TCCATCTTCT GCAAACAGTC AAACCACTTT GACATCTACC TGGACCGGTC TTGAAACTAC TACTGTTACC GAGACTGACA CTCCAGGTGG TACTGATACC GTCATCGTTG AAGTTCCATC TTCTGCAAAC AGTCAAACCA CTTTGACATC TACCTGGACC GGTCTTGAAA CTACTACTGT TACCGAGACT GACACTCCAG GTGGTACTGA TACCGTCATC GTTGAAGTTC CATCTTCTGC AAACAGTCAA ACCACTTTGA CATCTACCTG GACCGGTCTT GAAACTACTA CTGTTACCGA GACTGCTACT CCAGGTGGTA CTGATACCGT CATCGTTGAA GTTCCATCTT CTGCAAACAG TCAAACCACT TTGACATCTA CCTGGACCGG TCTTGAAACT ACTACTGTTA CCGAGACTGC TACTCCAGGT GGTACTGACA CTGTTATTGT TGAAGTTCCT TCATCCACCA ACAGTCAGTC TACCGTTACA TCTACATGGA CTGGTACTTT CTTTACTACT ACCACTATCA CGGCTGCTCC GGGTGGAACA GATACTGTTA TTGTAGAGGT TCCTTCCACT GCTAACAACC AAACTTTCTT AACATATACT TGGACTGGAA CTCAAACTAC AACTATTACC GAAACTGCTA GTCAAGGTGG AACTGACACC GTCATCGTTG AAGTTCCATC TTCTGCAAAC AGTCAAACCA CTTTGACATC TACCTGGACT GGTCTTGAAA CCACTACAAT TACCTTAACT GCTACTCCAG GTGGAACCGA CACTGTTATC GTTGAGGTTC CTTCATCTAC CAATAGTGAG ACTACATTAA CATCTACTGG AACTGGATCG GAAACTTCTA CTGTAACCGA AACTGCTAGT CCAGGTGGTA CTGATACTGT CATCGTTGTT GTACCATCTT CTGCAAACAG TCAAACCACT TTGACATCTA CTGGCACTGG ATCAGAAACT TCTACTGTAA CCGAAACTGC TAGTCCAGGT GGTACTGACA CTGTCGTCGT TGTTGTGCCA TCTTCTGCAA ACAGTCAAAC CACTTTGACA TCTACTGGAA CTGGATCGGA AACTTCTACT GTAACCGAAA CTGCTAGTCC AGGTGGTACT GACACTGTCG TCGTTGTTGT GCCATCTTCT GCAAACAGTC AAACCACTTT AACATCTACT GGCACTGGAT CAGAAACTTC TACTGTAACC GAAACTGCTA GTCCAGGTGG TACTGACACT GTCGTCGTTG TTGTACCATC TTCTGCTTCA TCATCTTCTG GTCCAAAGCT ATGCCTTAGA GATGGCGAAG GATGTCCTGA ACCAACGGAA TACACAACTA CTTTTACAAC CACCGAACCT GATGGATCAG TGGAAACCGA AAGTGGTGTC GTATTGATTT CTACTGATGA TTCCGGTAGC TGGTTCACAA CAACTTCTCT CTTCCCAGAA TCAACAGCTG AACCAACTGA ATACACAACA ACCTTCACAA CCACCGAACC TGATGGATCA GTGGAAACCG AAAGTGGTGT CGTATTGATT TCTACTGATG ATTCCGGTAG CTGGTTCACA ACAACTTCTC TCTTCCCAGA ATCAACAGCT GAACCAACTG AATACACAAC AACCTTCACA ACCACCGAAC CTGATGGATC AGTGGAAACC GAAAGTGGTG TCGTGTTGAT TTCTACTGAT GATTCCGGTA GCTGGTTCAC AACAACTTCT CTCTTCCCAG AACCTTCGAA TGCTGAACCA ACTGAATACA CAACTACTTT CGCAACCACC GAACCTGATG GATCAGTGGA AACCGAAAGT GGTGTCGTAT TGATTTCTAC TGATGATTCC GGTAGCTGGT TCACAACAAC TTCTCTCTTC CCAGAACCAT CGACTGCTAC CCTGTGGGAA TACGTGACAT CAACCGTTGT TGACTGTCCA CTCACTGTCT GCGTTAAGGG TAAATGTGCC ACTGAGACAA CTTCTTCCAC TCAAGTAGTA ACAGTGACAG TCTGTCCAGT TTGTGAACAC AAGCCTACTG AGGGCGCTCC AGGAGCCCCT GCCCCAGCTA CTACAGAAGC TGCTGCTCCA GGTGGTGCTA AGCCAGCTGC TCCGGAAGCC GCTACTCCTG CTTCTGTTGC CAAACCAGCC ACAACTGTTG CTGCTAAGCC AGAAACTACT GCTGGTGCTC CAGCTCCAGA AGCTGCTAAT CCAGAAACTA CTGCTGGTGC TCCAGCTCCA GAAGCTGCTA AGCCAGAAAC TACTGCTGGT GCCCCAGCTC CAGAAGCTGC TAAGCCTGAA ACTACTGCTG GTGCCCCAGC TCCAGAAGCT GCTAAGCCAG AAACTACTGC TGGTGCCCCA GCTCCAGAAG CTGCTAAGCC AGAAACTACT GCTAGTGCTG AAGAAGCTTC CCAATCTTTG ACTGTTGCAG CTGAATCTGC TGCAAGTGCA GTTCCTTCTG TTTCTGCAGT TGCTCCTTCT CCAGCTGAAC AACCATCTTC TGTATCTACA TTCGAAGGTG CTGCCTCGTT G
|
Protein sequence | LLASSALALE VTEDTTEVGT ISLDVGDITV DAGVYYSIIN NALSAIVGNL DVEGSFYITS TSDLIALTVT LDGVINSIVN NGLVSFNSAE SLTAPTYQLA GISFENNGEF YLGGDGSVGV PIMAITSLTF NNNGLMVFYQ NQRSTGVVTL GAPAATIHNN GQICLYNEIY QQTTAVDGTG CITAQADSSV FISNALVPFA QTQTIYLESD TASVRATAIS TPQTFTVANF GNGNIIGLDI PLVTLPGLSS YSYDATSGIL TLRGAGLLSQ KFDIGTGYDT TLFLITTDPG LGLVSVPLGA VTYSGPPPTP GQPDACQACK PLPPTATPGA EPTEYTTTFT TTEPDGSVET ESGVVLISTD DSGSWFTTTS LFPESTAEPT EYTTTFTTTE PDGSVETESE PSNAEPTEYT TTFATTEPDG SVETETATPA SVAKPATTVA AKPETTAGAP APEAANPETT AVPSVSAVAP SPAEQPSSVS TFEGAASL
|
| |