Gene PICST_50278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50278 
SymbolHYR5.4 
ID4841023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp962252 
End bp963448 
Gene Length1197 bp 
Protein Length399 aa 
Translation table12 
GC content42% 
IMG OID640392338 
Producthyphally regulated cell wall protein 
Protein accessionXP_001386767 
Protein GI150866981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTCA GCAAGTACTT GACTGCATGT GTTTCTCTAT TCTCTATGGC GATGGCTGTC 
ACCATCAGCC AAAATACAAT CAACAGAGGT TTTCTTAACT TGGATATTGG TGATATTACA
ATTGAAGATG GTGTCTATTG GTCTATTATC GACAATGCCA TTAGTGCAAT GGCCGGTTCC
CTTAAAGTCG GTCAGGGTTC TGGTTTTTAC ATTACTAGTA CTTCTAGTTT GATTGCATCC
ACAGTCACTC TTCTTTCTCC ATTGGGTAGT CTTGAAAATA ATGGTATTAT TTCGTTCAAT
GCGTTACAAT CATTAACTGC GCCTGTTTAC AATCTTGTCG GTCTCTCCTT CACCAACAAT
GGTGAAATGT ACTTAGGAGG TGATGGTTCT GTTCTTGTTC CAGTCCAGCT GATTACATCT
GCCACCTGGG ACAACAATGG TTTATTGGTT TTCTACCAGA ACACAAGATC TACTGGTCCA
GTTCTTCTTG GTGCTCCATT GGGAACAATA AACAACAATG GTCAAATCTG TTTGTACAAT
GAACTTTATA CTCAGACTAA CAATATTGTC GGTACAGGTT GTATCACTGC CAATACAGAC
TCCAGCATCT TCTTTTCCAA TACCCTCTTG AATATCGATC CAAACCAAAC AATCTATCTT
GCTGATCTGG CATCTTCACT TAGAGCTACT GCCATCAGTG TTCCAAAGAC TTTCACCGTC
GCTGGATTCG GAAACGGTAA CACAATTGGT TTAGATATTC CACTTGTCAA TCTTCCTCCT
TTACTTGATG CCTGGGATTA TGATGCTTCA GCTGGCATAT TGACCCTTCG TGGTGCTGGT
TTGTTATCGC AAAATTTCCT TATCGGTACT GGTTACGACA ACGGTCTCTT CTCTATAACT
ACTGACGATG ATCTTGGATT ACTTTCTGTT CCTTTTGGAG CAGTAACTTA TTCGGGACCA
GTTCCAAACC CTGGTATCCC AGCTGCATGT CAACCTTGTA AGACACTTCC CGACTCACCA
GGAACAAGTG CAACTACTTC TACCACTACA ATTGTATCCA CCAACTCTGA CGGATCAATA
TGTACTGAAG TCGACGGTAT TGTCATTGCC ACTGATAGCC AAGGTTCTTG GTTCACTACC
ACCACCACTA TCTCTGAAAC GTGTGCCAGC AATCCAACTA CTACTATCAC CTCTACC
 
Protein sequence
MLFSKYLTAC VSLFSMAMAV TISQNTINRG FLNLDIGDIT IEDGVYWSII DNAISAMAGS 
LKVGQGSGFY ITSTSSLIAS TVTLLSPLGS LENNGIISFN ALQSLTAPVY NLVGLSFTNN
GEMYLGGDGS VLVPVQSITS ATWDNNGLLV FYQNTRSTGP VLLGAPLGTI NNNGQICLYN
ELYTQTNNIV GTGCITANTD SSIFFSNTLL NIDPNQTIYL ADSASSLRAT AISVPKTFTV
AGFGNGNTIG LDIPLVNLPP LLDAWDYDAS AGILTLRGAG LLSQNFLIGT GYDNGLFSIT
TDDDLGLLSV PFGAVTYSGP VPNPGIPAAC QPCKTLPDSP GTSATTSTTT IVSTNSDGSI
CTEVDGIVIA TDSQGSWFTT TTTISETCAS NPTTTITST