Gene PICST_74151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74151 
SymbolYVH1 
ID4841141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp707291 
End bp708473 
Gene Length1183 bp 
Protein Length326 aa 
Translation table12 
GC content42% 
IMG OID640392456 
Productnitrogen starvation-induced protein phosphatase 
Protein accessionXP_001386543 
Protein GI150866820 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.373498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAACCACGAA CAACAATGGT GGTTCGTATT CTTGGAGGAG TATACCTCTC ATCTATAGAG 
CCCATCAACA ATAGCATAGA TTTAAAGACA AAATACAGCA TCTCCCATAT ACTTTCTGTG
GTTCCAGGTC CTTTACCCCA AGAGTATCTT AAAGACTATG AGCACAAGCA AATCGAAGTC
ACCGACGAGG AAACGTCGAA TTTACTAGAA TACTTTGATT CAGCCTACGA TTTCATCGAA
GAAGGTTTGT TTAAAGAGTC GACAGATCCA AAGAAGCACC TGAGATGCGT TCTAGTTCAT
TGTTCACAAG GAGTATCCCG TTCTGTAACT GTAGTTGTAG CATATCTCAT GAAGAAGTAC
AATTTGACTT TGGAACAAGC AATGCATGCC GTCACACGGA AGGTGCCAGA AGCACAGCCC
AACGATGGCT TCATGGAGCA GTTGAAGCTC TACAAGGAAA TGGATTTGAA AGTCGACTCT
TCGAACGACT TGTACAGAGA ATTCGTCATC AACAACCAAC TTAGCTTAGA TCCTACTGGT
GCTACATTGA GAGATATGGA CCTTTTCAAA CCAAAACTGC AGCAGCAGCT TCTGGAAGCA
GATAAAAATT ACGAATTGAG GTGCAAAAGA TGTCGTCAAG TATTGGCCGT TGGTGGTCAG
ATTGAAAACC ACGAGCATCC TGATGCTGAA TCTCGCCAAT CTCAATTCAT CAAGAAAGCT
CCTAACTCTC GTAGAATCAT TTCAGTGCAA GAGGCCAGCT CTAACTGTTC GCACCATTTC
TTGGCTGAAC CCTTGACATG GATGAAAGAA GAACTAGAAA AAGGCGAGTT GGAAGGCAAG
TTTATGTGCC CAAAGTGTAT TGCAAAGGTA GGGGGCTACA GTTGGAGAGG TTCTAGATGT
TCGTGTGGAA AATGGATGAT CCCAGCTATA CATTTACAAT CGGCCAAAGT GGATAGTATC
AAAAACATAG TCTTGCCGAA TCACTCTACA GTATAAATAG ACTTTTAATT AGCTAGCATC
AACCACCATG TCGACGTACG AGTTGTTTTG ATCGTTGTCT TCGTTTTGAG TGACATCTTC
GTTTTCATGT CCGATTTCGT TTTCATTTCC GATTTCGTTC TCAATTTCGT TCTCATCCTG
GTGGTTTTCG TCGATACGAT TTAATGAATA AGCATGAGAA TTC
 
Protein sequence
MVVRILGGVY LSSIEPINNS IDLKTKYSIS HILSVVPGPL PQEYLKDYEH KQIEVTDEET 
SNLLEYFDSA YDFIEEGLFK ESTDPKKHSR CVLVHCSQGV SRSVTVVVAY LMKKYNLTLE
QAMHAVTRKV PEAQPNDGFM EQLKLYKEMD LKVDSSNDLY REFVINNQLS LDPTGATLRD
MDLFKPKSQQ QLSEADKNYE LRCKRCRQVL AVGGQIENHE HPDAESRQSQ FIKKAPNSRR
IISVQEASSN CSHHFLAEPL TWMKEELEKG ELEGKFMCPK CIAKVGGYSW RGSRCSCGKW
MIPAIHLQSA KVDSIKNIVL PNHSTV