Gene PICST_50059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50059 
SymbolHYR2.1 
ID4840580 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp26684 
End bp30589 
Gene Length3906 bp 
Protein Length473 aa 
Translation table12 
GC content45% 
IMG OID640391895 
Producthyphally regulated cell wall protein 
Protein accessionXP_001386218 
Protein GI150866574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTTT CAAGTCTTCT AGCAAGAATA CTTGTTCTTG TTTCTGTTGT CTATGCTACT 
AGAGATGTTG TTATTACTGG ATCAAAAGTC GATAAGGGCC CCATTAATTT GTCCATTGGT
GATATAACCA TCGAAGAAAA TTCTTATTGG TCTATCTTTG ATAACTCTGT GTCTGCTTTG
AAGGGTTCAC TCTGGGTAAA AAAGGGCGCC GGTTTCTATA TTACAGGTAC AGACAGGTTA
ATAAACTTAT CCGTTACACT TGCATTTGGT CAAAATTCCA TAAAGAACGA AGGTACTGTT
GTGTTCAACT CCATCTTCTG CAAACTTGCA ACCAAATATG AGCTTATTGG TTCTTCTTTC
TATAACTCAG GTGAAATGTT CTTGGCCACC AATGGAGCAC TTGGAAGTTT AACTGCAATC
ACGTCGAGTT CTTGGACAAA CTCAGGTTTG TTGGTCTTTT ACCAAAACTC AAGAACTAGT
GCAATTGTTT CGCTTGGTAA ACCACTAGGA ACGATTCGTA ACGATGGTCA AATCTGTTTG
CATAACGAAT TGTTCAAGCA AGAAACGAAC ATTGTTGGTA GTGGTTGTAT CACTGCTGAT
AAGAATTCTT CTATTTTCTT ATCGAATTGT TTATTGGAGA TCGATCCAAA GCAATCATTC
TACCTTGCTG ATTCCCAATC CTCCATTAGA GCAAAAGCAA TTTCTAAGTC AAAGACCTAT
ACCGTGTATG GGTTTGGTAA TGGAAACAAG ATTGGTTTGG ATCTTCCTCT TATTTCGCTT
CCAATTCTTG GTAGTTGGGG TTATGATGAG AAGAGAGGTA TTTTGACTCT TAGAGCACTT
GGAAACCTTG CACACAATTT CATAATCGGT ACTGGATACG ATAAAAAGAA GTTTCAAATT
ACTACTGATT CCAGTCTTGG TTTGTTGAGT GTATTTAACG GCGCTTTGAA ATATAATGGC
CCAGTTCCAA ACCTGAGTAT CCCATCTGCA TGTAGACCAT GCAAGCCAGT ACCTTCTGTT
CCAGATGGTC CTCATACAAC ATCCCAGACT ACTGCCAAGA CACCATGTAC AACATCGACA
AAGAAGCACT CTACCAGATC TTCTAAATCG TGTAATAGTC AGATCACAGT TACAAGCACA
TGGACTGAAA TCACCACAAC TACAGTTACT ATCAGTGAAA CTGAAGGAGG TACTGATACT
GTCATCATTG TTGTACCTAC TGAATGTACA ACTAGTACTG GAAACGATCA AGCTTCTTCT
ACTCCAAACC CTCAAACTAC TGTTACCACT ACATGGACCG GTTCTTACAC CACTACCATC
ACCGAGACTG ATACTCAGGG TGGCACTGAT ACTGTCATTG TGGAAGTTCC AACATCTGGA
AACGATCAGG CTTCTTCTAC TCCAAACCCT CAAACTACCG TTACCACTAC ATGGACCGGT
TCTTACACCA CTACCATCAC CGAGACTGAT TCTCAGGGTG GAACTGATAC TGTCATTGTA
GAGGTTCCAT CTACTCCAAA CCCTCAAACC ACCGTTACCA CTACATGGAC CGGTTCTTAC
ACCACTACCA TCACCGAGAC TGATACTCAG GGTGGCACTG ATACTGTCAT TGTGGAAGTT
CCAACATCTG GAAACGATCA GGCTTCTTCT ACTCCAAACC CTCAAACTAC CGTTACCACT
ACATGGACCG GTTCTTACAC CACTACCATC ACCGAGACTG ATTCTCAGGG TGGCACTGAT
ACTGTCATTG TGGAAGTTCC AACATCTGGA AACGATCAGG CTTCTTCTAC TCCAAACCCT
CAAACTACCG TTACCACTAC ATGGACCGGT TCTTACACCA CTACCATCAC CGAGACTGAT
TCTCAGGGTG GAACTGATAC TGTCATTGTA GAGGTTCCAT CTACTCCAAA CCCTCAAACC
ACCGTTACCA CTACATGGAC CGGTTCTTAC ACCACTACCA TCACCGAGAC TGATACTCAG
GGTGGCACTG ATACTGTCAT TGTGGAAGTT CCGTGCACTG AGAAAACTCA AGTTACAGTA
ACTACCACAT GGACAGGCTA CACAACTAAA ACAGTTACTG TAAGTGAAAC TGAGGGATCT
ACCGAAACAG TCATTATTGT GATCCCTTGT TCCAGTTCTG ATGAATCCGC AACTGAAAGT
AACTCTTCGG TCCCTTCAGA CTCCACTTCT GTTGAATTCA CTTCTGAAAT CTCCTCTACT
GAAAGTCCAT CTGGCTCGAA CACTGAAGGT CCATCTACTT CTGAAGGCCC ATCTTCTGAG
GGTCCATCTA CTTCTGAAGG TCCATCAGGT TCAAACACTG AAGGTCCATC TACTTCTGAG
GGCCCATCTT CTGAAGGTCC ATCTACTTCT GAAGGTCCAT CAGGTTCAAA CACTGAAGGT
CCATCTACTT CTGAAGGCCC ATCTTCTGAA GGCCCATCTT CTGAGGGTCC ATCTTCTGAG
GGTCCATCTA CTTCTGAAGG TCCATCTGGT TCAAACACTG AAGGCCCATC TACTTCTGAG
GGTCCATCTG GCTCGAACAC TGAAGGCCCA TCTACTTCTG AGGGCCCATC TTCTGAAGGT
CCATCTACTT CTGAAGGTCC ATCAGGTTCA AACACTGAAG GTCCATCTAC TTCTGAAGGC
CCATCTTCTG AAGGCCCATC TTCTGAGGGT CCATCTTCTG AGGGTCCATC TTCTGAGGGT
CCATCAGGTT CAAACACTGA AGGCCCATCT TCTGAGGGTC CATCTGGCTC GAACACTGAA
GGTCCATCTA CTTCTGAAGG TCCATCTGGC TCGAACACTG AAGGTCCATC TACTTCTGAA
GGTCCATCTG GCTCGAACAC TGAAGGTCCA TCTACTTCTG AAGGTCCATC AGGTTCAAAC
ACTGAAGGTC CATCTACTTC TGAGGGCCCA TCTTCTGAAG GTCCATCTAC TTCTGAAGGT
CCATCAGGTT CAAACACTGA AGGTCCATCT ACTTCTGAAG GCCCATCTTC TGAAGGCCCA
TCTTCTGAGG GTCCATCTTC TGAGGGTCCA TCTTCTGAGG GTCCATCTTC TGAGGGTCCA
TCTTCTGAGG GTCCATCTAC TTCTGAAGGT CCATCTTCTG AGGGTCCATC TACTTCTGAA
GGTCCATCTT CTGAAGGTCC ATCTACTTCT GAAGGTCCAT CAGGTTCAAA CACTAAAGGT
CCATCTACTT CTGAAGGCCC ATCTTCTGAA GGCCCATCTT CTGAGGGTCC ATCTTCTGAG
GGTCCATCTT CTGAGGGTCC ATCTACTTCT GAAGGTCCAT CTGGCTCGAA CACTGAAGGC
CCATCTACTT CTGAAGGCCC ATCTTCTGAG GGTCCATCAA CTTCTGAAGG TCCATCAGGT
TCAAACACTG AAGGCCCATC TTCTGAGGGT CCATCTGGCT CGAACACTGA AGGCCCATCT
TCTGAGGGTC CATCTGGCTC GAACACTGAA GGTCCATCTA CTTCTGAAGG TCCATCTGGC
TCGAACACTG AAGGTCCATC TACTTCTGAA GGTCCATCAG GTTCAAACAC TGAAGGTTCA
TCTTCTGAGG GTCCATCTAC TTCTGAAGGC CCATCTTCTG AAGGTCCATC TACTTCTGAG
GGCCCATCTT CTGAGGGACC ATCTACTTCT GAAACGCCAT CAGGCTCTGA CACAGAAGTT
TCAAGTTCTG TAGAAGCATC GTGTGTTCCA TCTTACATTA CAGTAACAGT CACAACTACT
GCAGAGCCAA TTGACACAGT CACCGTTACT AGTTGTCCAA CTATGAAACC ACCACATTCG
ACAGATACCA TTTATACAGT GACATCAGTT TATATTACCG AAACAACTAC CACCACAGCA
CGTACAACAC TCCCAACTGT TACAGTTACA GAAACGAAGA CTAAAGGGTA CTTTAGGTTC
TTCTAG
 
Protein sequence
MLLSSLLARI LVLVSVVYAT RDVVITGSKV DKGPINLSIG DITIEENSYW SIFDNSVSAL 
KGSLWVKKGA GFYITGTDRL INLSVTLAFG QNSIKNEGTV VFNSIFCKLA TKYELIGSSF
YNSGEMFLAT NGALGSLTAI TSSSWTNSGL LVFYQNSRTS AIVSLGKPLG TIRNDGQICL
HNELFKQETN IVGSGCITAD KNSSIFLSNC LLEIDPKQSF YLADSQSSIR AKAISKSKTY
TVYGFGNGNK IGLDLPLISL PILGSWGYDE KRGILTLRAL GNLAHNFIIG TGYDKKKFQI
TTDSSLGLLS VFNGALKYNG PVPNSSIPSA CRPCKPVPSV PDGPHTTSQT TAKTPCTTST
KKHSTRSSKS CNSQITVTST WTEITTTTVT IISSSVEASC VPSYITVTVT TTAEPIDTVT
VTSCPTMKPP HSTDTIYTVT SVYITETTTT TARTTLPTVT VTETKTKGYF RFF