Gene PICST_17474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_17474 
SymbolHYR5.1 
ID4840042 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp990088 
End bp994134 
Gene Length4047 bp 
Protein Length580 aa 
Translation table12 
GC content45% 
IMG OID640391357 
Producthyphally regulated cell wall protein 
Protein accessionXP_001385882 
Protein GI150866327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTC GAAATTATTT CGCAGCTGCA TTCTGCTTGA TCTCAGGAGT GGTGGCTAGG 
ACCATCACTC AAGATACTGT CAGTCGTGGT ACCATATCTC TTGGATTGGG TGACACTATC
ATTAATGATG GTGTTTATTG GTCCATCATT GATAACGTAG TAACAGCCTT TGCAGGTAAT
GTCGACGTGG GCACAGGCTC TGGTTTATAC ATCACAGGTC TCAATCCTTT ACTTTCCTTA
CAAGTAACCC TTTTGCTGGG TTCTCTTACA AATGATGGTG TTATCGCCTT TAATGCCATT
CAATCTTTAC TCGCTCCAAC ATACAATCTT GTCGGTATCT CGTTTACAAA TAATGGTGAA
ATGTACTTAG GTGCCGATGG ATCAGTTGGT ATTCCCAGCA TCCTGATTAC CACTCCAGTC
TGGAACAATA ATGGTTTATT GGTCTTCTAC CAAAATACCA GAACGACTGG TCCTGTTAAT
TTGGGTACCA TTGGTAGTAC GATTAACAAC AATGGCCAAA TTTGTTTTTA CAATGAACTA
TATACCCAGA CGACAAATAT CGCAGGTACT GGTTGTATTA CTTTAGTCGA AGACTCTAGT
ATCTTCTTCT CTAATACTCT ATTGAACATC GATACAAATC AAGTCTTCTA TTTGGAAGAC
TCTGCCTCCT CAATCAGAGC CACCGCCATT AGTGCCCCTA AAACTTATAC TGTGGCTGGG
TTCGGTAACG GAAACAAGAT TGGTTTAGAT ATTCCACTTG TCAATATTCC TCCATTGCTC
ACAGGTTACA CATATAGTAC TACCACAGGT ATTTTGACTC TTAAAGGTGC AGGTGTGTTG
GCCATGAACT TCAACATTGG TAAAGGCTAC AATCCATCTC TTTTCTCCAT TGTCACTGAC
GACGATGTTG GATTAGCTTC GGTTCTCTTC GGTGCAGTTT CCTACTCAGG ACCTCCTCCA
AACCCAGTTC CTTCGATTTG TAAGCAATGT AAAACGCTTC CCCCTGCACC AGGAACCAGT
CCGACCGTAA CTACGACCAC AGTAGCAACT ACGAATACTG CTGGATTCAC TTGTTCAGAA
GTCGATCAAA TCCTTGTTTC CACAGATACC AATTATTCTT GGTTCACATC TACTTCAACT
ATTACTGTAC TGTGTCCTTC AAACCCAACA ACTACAGTAA CTTCTACTTG GACAGGTTCT
CAAACTACCA CCGTCACTGT TACTGACACA GTTGGAGGCA CTGACACCGT GATTGTTGAA
GTTCCATCTA ACGAACAGAC TACTCTCACT TCGACCTGGA CTGGTACCGA AACTACCACT
GTCACGTTAA CCGATACACA AGGAGGCACT GACACAGTTG TAGTTGAAGT CCCTTCCAAT
GAACAAACCA CTCTCACTTC GACCTGGACT GGTACCGAAA CTACCACTGT CACGTTAACC
GATACACAAG GAGGCACTGA CACAGTTGTA GTTGAAGTCC CTTCCAATGA ACAAACCACT
CTCACCTCGA CCTGGACCGG AACTGAAACC ACCACAGTTA CTATTTCCGA CACAGTCGGC
GGTACTGACA CTGTCATTGT TGAAGTTCCA TCTAACGAGG AGACTACTGT TATTTCTACA
TGGACAGGTA CTGTCACCAG CACAGTTACT ATTTCGGACA CAGTCGGCGG AACGGACACA
GTAATTGTTG TTGTTCCTTC TACTCCAAAC AGTCAGACCA CTCTTACTTC GACCTGGACT
GGTACTGAAA CTACCACAGT TACTATTTCC GACACAGTTG GGGGTACTGA TACTGTCATT
GTTGAAGTTC CATCTAACGA ACAGACTACT CTTACTTCGA CCTGGACTGG TACTGAAACT
ACCACAGTTA CTATTTCCGA CACAGTCGGC GGTACTGACA CTGTCATTGT TGAAGTTCCA
TCTAACGAGG AGACTACTGT TATTTCTACC TGGACTGGTA CTGTCACCAG CACAGTTACT
ATTTCAGACA CAGTCGGCGG AACGGACACA GTAATTGTTG TTGTTCCTTC TACTCCAAAC
AGTCAGACCA CTCTTACTTC GACCTGGACT GGTACTGAAA CTACCACAGT TACTATTTCC
GACACAGTCG GCGGTACTGA CACAGTTGTA GTTGAAGTCC CTTCCAATGA ACAAACCACT
CTCACTTCGA CCTGGACTGG TACCGAAACT ACCACTGTCA CGTTAACCGA TACACAAGGA
GGCACTGACA CAGTTGTAGT TGAAGTCCCT TCCAATGAAC AAACCACTCT CACCTCGACC
TGGACCGGAA CTGAAACCAC CACAGTTACT ATTTCCGACA CAGTCGGCGG TACTGACACT
GTCATTGTTG AAGTTCCATC TAACGAGGAG ACTACTGTTA TTTCTACATG GACAGGTACT
GTCACCAGCA CAGTTACTAT TTCGGACACA GTCGGCGGAA CGGACACAGT AATTGTTGTT
GTTCCTTCTA CTCCAAACAG TCAGACCACT CTTACTTCGA CCTGGACTGG TACTGAAACT
ACCACAGTTA CTATTTCCGA CACAGTTGGG GGTACTGACA CTGTCATTGT TGAAGTTCCA
TCTAACGAAC AGACTACTCT CACTTCGACC TGGACTGGTA CTGAAACTAC CACTGTTACG
TTAACCGACA CACAAGGAGG CACTGACACT GTCATTGTTG AAGTTCCTTC TACTCCAAAC
AGTCAGACCA CTCTTACTTC GACCTGGACA GGAACCGAGA CAACTACAGT TACTATTTCC
GACACAGTTG GAGGTACTGA CACTGTCATT GTTGAAGTTC CATCTAACGA ACAGACTACT
CTCACTTCGA CCTGGACTGG TACTGAAACT ACCACTGTTA CGTTGACCGA CACACAAGGA
GGCACTGACA CTGTCATTGT TGAAGTTCCT TCTACTCCAA ACAGTCAGAC CACTCTTACT
TCGACCTGGA CAGGAACCGA GACAACTACA GTTACTATTT CCGACACAGT TGGAGGTACT
GACACTGTCA TTGTTGAAGT TCCTTCCAAC CCTCAGACAA CTGTTGTGTC AACTTGGATT
GGCACTGAAA CTACTACTGT AACTGTTACG GATACCGTAG GAGGTACTGA TACAGTTGTC
ATCGTTGTTC CACCAAATCC AACCACTACA GTTACTTCTA CCTGGACCGG AGTAGATACT
ACAACCTTAA CATTGACAGA CACCCAAGGT GGTACTGATA CTGTTGTTGT TGAAGTTCCG
TCTAATGCCC AGACTACTGT TACTTCGACT TGGACTGGTA CTGAAATTAC TACTGTTACT
ATTTCGGACA CAGTTGGTGG TACCGACACT GTAATAGTCG AAGTTCCATC CACTGCAAAC
ATTCAAACAA CTCTTACTAG TACGTGGACT GGTTCTGATA TCACTACTAC AACAGTGACC
GACACGCCAG GAGGAACCGA TACCGTTATT ATTGAAGTTC CAACCACTGC AAACAGTCAA
ACAACTCTCA CATCCACCTG GACCGGAACA GAAACTACTA CTGTTACATT AACGGATACC
TTAGGTGGCA CTGACACTGT CATTGTTGAA GTTCCTTCTA CTCCAAACAG TCAGACAACT
CTTACTTCGA CCTGGACTGG TACTGAAACT ACCACAGTTA CTATTTCCGA CACAGTTGGA
GGTACTGACA CTGTCATTGT TGAAGTCCCT TCAAACGTAG AGACAACTGT TATTTCTACA
TGGATCGGAA CTGTAACTAC TACTGTTACT GTAACTGATA CCGTAGGTGG AACTGATACA
GTTATCGTCG TTGTCCCACC AAATCCAACC ACTACAGTTA CTTCTACCTG GACCGGAGTA
GATACTACAA CCTTAACATT GACAGACACC CAAGGTGGTA CTGACACTGT TGTTGTTGAA
GTTCCATCTA ACGAACAGAC TACTCTCACT TCGACCTGGA CTGGTACCGA AACAACCACT
GTTACGTTAA CCGACACACA AGGAGGCACT GACACCGTAA TTGTTGAAGT TCCATCTAAC
GAACAGACTA CTCTCACTTC GACCTGG
 
Protein sequence
MLFRNYFAAA FCLISGVVAR TITQDTVSRG TISLGLGDTI INDGVYWSII DNVVTAFAGN 
VDVGTGSGLY ITGLNPLLSL QVTLLSGSLT NDGVIAFNAI QSLLAPTYNL VGISFTNNGE
MYLGADGSVG IPSISITTPV WNNNGLLVFY QNTRTTGPVN LGTIGSTINN NGQICFYNEL
YTQTTNIAGT GCITLVEDSS IFFSNTLLNI DTNQVFYLED SASSIRATAI SAPKTYTVAG
FGNGNKIGLD IPLVNIPPLL TGYTYSTTTG ILTLKGAGVL AMNFNIGKGY NPSLFSIVTD
DDVGLASVLF GAVSYSGPPP NPVPSICKQC KTLPPAPGTS PTVTTTTVAT TNTAGFTCSE
VDQILVSTDT NYSWFTSTST ITVSCGTDTV VVEVPSNAQT TVTSTWTGTE ITTVTISDTV
GGTDTVIVEV PSTANIQTTL TSTWTGSDIT TTTVTDTPGG TDTVIIEVPT TANSQTTLTS
TWTGTETTTV TLTDTLGGTD TVIVEVPSTP NSQTTLTSTW TGTETTTVTI SDTTPKVTTL
TSTWTGTETT TVTLTDTQGG TDTVIVEVPS NEQTTLTSTW