Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_17474 |
Symbol | HYR5.1 |
ID | 4840042 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 990088 |
End bp | 994134 |
Gene Length | 4047 bp |
Protein Length | 580 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391357 |
Product | hyphally regulated cell wall protein |
Protein accession | XP_001385882 |
Protein GI | 150866327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATTTC GAAATTATTT CGCAGCTGCA TTCTGCTTGA TCTCAGGAGT GGTGGCTAGG ACCATCACTC AAGATACTGT CAGTCGTGGT ACCATATCTC TTGGATTGGG TGACACTATC ATTAATGATG GTGTTTATTG GTCCATCATT GATAACGTAG TAACAGCCTT TGCAGGTAAT GTCGACGTGG GCACAGGCTC TGGTTTATAC ATCACAGGTC TCAATCCTTT ACTTTCCTTA CAAGTAACCC TTTTGCTGGG TTCTCTTACA AATGATGGTG TTATCGCCTT TAATGCCATT CAATCTTTAC TCGCTCCAAC ATACAATCTT GTCGGTATCT CGTTTACAAA TAATGGTGAA ATGTACTTAG GTGCCGATGG ATCAGTTGGT ATTCCCAGCA TCCTGATTAC CACTCCAGTC TGGAACAATA ATGGTTTATT GGTCTTCTAC CAAAATACCA GAACGACTGG TCCTGTTAAT TTGGGTACCA TTGGTAGTAC GATTAACAAC AATGGCCAAA TTTGTTTTTA CAATGAACTA TATACCCAGA CGACAAATAT CGCAGGTACT GGTTGTATTA CTTTAGTCGA AGACTCTAGT ATCTTCTTCT CTAATACTCT ATTGAACATC GATACAAATC AAGTCTTCTA TTTGGAAGAC TCTGCCTCCT CAATCAGAGC CACCGCCATT AGTGCCCCTA AAACTTATAC TGTGGCTGGG TTCGGTAACG GAAACAAGAT TGGTTTAGAT ATTCCACTTG TCAATATTCC TCCATTGCTC ACAGGTTACA CATATAGTAC TACCACAGGT ATTTTGACTC TTAAAGGTGC AGGTGTGTTG GCCATGAACT TCAACATTGG TAAAGGCTAC AATCCATCTC TTTTCTCCAT TGTCACTGAC GACGATGTTG GATTAGCTTC GGTTCTCTTC GGTGCAGTTT CCTACTCAGG ACCTCCTCCA AACCCAGTTC CTTCGATTTG TAAGCAATGT AAAACGCTTC CCCCTGCACC AGGAACCAGT CCGACCGTAA CTACGACCAC AGTAGCAACT ACGAATACTG CTGGATTCAC TTGTTCAGAA GTCGATCAAA TCCTTGTTTC CACAGATACC AATTATTCTT GGTTCACATC TACTTCAACT ATTACTGTAC TGTGTCCTTC AAACCCAACA ACTACAGTAA CTTCTACTTG GACAGGTTCT CAAACTACCA CCGTCACTGT TACTGACACA GTTGGAGGCA CTGACACCGT GATTGTTGAA GTTCCATCTA ACGAACAGAC TACTCTCACT TCGACCTGGA CTGGTACCGA AACTACCACT GTCACGTTAA CCGATACACA AGGAGGCACT GACACAGTTG TAGTTGAAGT CCCTTCCAAT GAACAAACCA CTCTCACTTC GACCTGGACT GGTACCGAAA CTACCACTGT CACGTTAACC GATACACAAG GAGGCACTGA CACAGTTGTA GTTGAAGTCC CTTCCAATGA ACAAACCACT CTCACCTCGA CCTGGACCGG AACTGAAACC ACCACAGTTA CTATTTCCGA CACAGTCGGC GGTACTGACA CTGTCATTGT TGAAGTTCCA TCTAACGAGG AGACTACTGT TATTTCTACA TGGACAGGTA CTGTCACCAG CACAGTTACT ATTTCGGACA CAGTCGGCGG AACGGACACA GTAATTGTTG TTGTTCCTTC TACTCCAAAC AGTCAGACCA CTCTTACTTC GACCTGGACT GGTACTGAAA CTACCACAGT TACTATTTCC GACACAGTTG GGGGTACTGA TACTGTCATT GTTGAAGTTC CATCTAACGA ACAGACTACT CTTACTTCGA CCTGGACTGG TACTGAAACT ACCACAGTTA CTATTTCCGA CACAGTCGGC GGTACTGACA CTGTCATTGT TGAAGTTCCA TCTAACGAGG AGACTACTGT TATTTCTACC TGGACTGGTA CTGTCACCAG CACAGTTACT ATTTCAGACA CAGTCGGCGG AACGGACACA GTAATTGTTG TTGTTCCTTC TACTCCAAAC AGTCAGACCA CTCTTACTTC GACCTGGACT GGTACTGAAA CTACCACAGT TACTATTTCC GACACAGTCG GCGGTACTGA CACAGTTGTA GTTGAAGTCC CTTCCAATGA ACAAACCACT CTCACTTCGA CCTGGACTGG TACCGAAACT ACCACTGTCA CGTTAACCGA TACACAAGGA GGCACTGACA CAGTTGTAGT TGAAGTCCCT TCCAATGAAC AAACCACTCT CACCTCGACC TGGACCGGAA CTGAAACCAC CACAGTTACT ATTTCCGACA CAGTCGGCGG TACTGACACT GTCATTGTTG AAGTTCCATC TAACGAGGAG ACTACTGTTA TTTCTACATG GACAGGTACT GTCACCAGCA CAGTTACTAT TTCGGACACA GTCGGCGGAA CGGACACAGT AATTGTTGTT GTTCCTTCTA CTCCAAACAG TCAGACCACT CTTACTTCGA CCTGGACTGG TACTGAAACT ACCACAGTTA CTATTTCCGA CACAGTTGGG GGTACTGACA CTGTCATTGT TGAAGTTCCA TCTAACGAAC AGACTACTCT CACTTCGACC TGGACTGGTA CTGAAACTAC CACTGTTACG TTAACCGACA CACAAGGAGG CACTGACACT GTCATTGTTG AAGTTCCTTC TACTCCAAAC AGTCAGACCA CTCTTACTTC GACCTGGACA GGAACCGAGA CAACTACAGT TACTATTTCC GACACAGTTG GAGGTACTGA CACTGTCATT GTTGAAGTTC CATCTAACGA ACAGACTACT CTCACTTCGA CCTGGACTGG TACTGAAACT ACCACTGTTA CGTTGACCGA CACACAAGGA GGCACTGACA CTGTCATTGT TGAAGTTCCT TCTACTCCAA ACAGTCAGAC CACTCTTACT TCGACCTGGA CAGGAACCGA GACAACTACA GTTACTATTT CCGACACAGT TGGAGGTACT GACACTGTCA TTGTTGAAGT TCCTTCCAAC CCTCAGACAA CTGTTGTGTC AACTTGGATT GGCACTGAAA CTACTACTGT AACTGTTACG GATACCGTAG GAGGTACTGA TACAGTTGTC ATCGTTGTTC CACCAAATCC AACCACTACA GTTACTTCTA CCTGGACCGG AGTAGATACT ACAACCTTAA CATTGACAGA CACCCAAGGT GGTACTGATA CTGTTGTTGT TGAAGTTCCG TCTAATGCCC AGACTACTGT TACTTCGACT TGGACTGGTA CTGAAATTAC TACTGTTACT ATTTCGGACA CAGTTGGTGG TACCGACACT GTAATAGTCG AAGTTCCATC CACTGCAAAC ATTCAAACAA CTCTTACTAG TACGTGGACT GGTTCTGATA TCACTACTAC AACAGTGACC GACACGCCAG GAGGAACCGA TACCGTTATT ATTGAAGTTC CAACCACTGC AAACAGTCAA ACAACTCTCA CATCCACCTG GACCGGAACA GAAACTACTA CTGTTACATT AACGGATACC TTAGGTGGCA CTGACACTGT CATTGTTGAA GTTCCTTCTA CTCCAAACAG TCAGACAACT CTTACTTCGA CCTGGACTGG TACTGAAACT ACCACAGTTA CTATTTCCGA CACAGTTGGA GGTACTGACA CTGTCATTGT TGAAGTCCCT TCAAACGTAG AGACAACTGT TATTTCTACA TGGATCGGAA CTGTAACTAC TACTGTTACT GTAACTGATA CCGTAGGTGG AACTGATACA GTTATCGTCG TTGTCCCACC AAATCCAACC ACTACAGTTA CTTCTACCTG GACCGGAGTA GATACTACAA CCTTAACATT GACAGACACC CAAGGTGGTA CTGACACTGT TGTTGTTGAA GTTCCATCTA ACGAACAGAC TACTCTCACT TCGACCTGGA CTGGTACCGA AACAACCACT GTTACGTTAA CCGACACACA AGGAGGCACT GACACCGTAA TTGTTGAAGT TCCATCTAAC GAACAGACTA CTCTCACTTC GACCTGG
|
Protein sequence | MLFRNYFAAA FCLISGVVAR TITQDTVSRG TISLGLGDTI INDGVYWSII DNVVTAFAGN VDVGTGSGLY ITGLNPLLSL QVTLLSGSLT NDGVIAFNAI QSLLAPTYNL VGISFTNNGE MYLGADGSVG IPSISITTPV WNNNGLLVFY QNTRTTGPVN LGTIGSTINN NGQICFYNEL YTQTTNIAGT GCITLVEDSS IFFSNTLLNI DTNQVFYLED SASSIRATAI SAPKTYTVAG FGNGNKIGLD IPLVNIPPLL TGYTYSTTTG ILTLKGAGVL AMNFNIGKGY NPSLFSIVTD DDVGLASVLF GAVSYSGPPP NPVPSICKQC KTLPPAPGTS PTVTTTTVAT TNTAGFTCSE VDQILVSTDT NYSWFTSTST ITVSCGTDTV VVEVPSNAQT TVTSTWTGTE ITTVTISDTV GGTDTVIVEV PSTANIQTTL TSTWTGSDIT TTTVTDTPGG TDTVIIEVPT TANSQTTLTS TWTGTETTTV TLTDTLGGTD TVIVEVPSTP NSQTTLTSTW TGTETTTVTI SDTTPKVTTL TSTWTGTETT TVTLTDTQGG TDTVIVEVPS NEQTTLTSTW
|
| |