Gene PICST_40365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40365 
SymbolPSP1 
ID4837096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp558749 
End bp559669 
Gene Length921 bp 
Protein Length306 aa 
Translation table12 
GC content44% 
IMG OID640388411 
Productphosphoserine phosphatase activity 
Protein accessionXP_001382867 
Protein GI150864153 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00338] phosphoserine phosphatase SerB
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.288179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.883376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGG AATACGCTCT TACAGTGATA GCACACGGCT GTGAGTTATC TGCAGAAGAT 
TTGACAGCCG TAAAGACGTT GTTGACCGAT GTCTTGAAAG TTGGAAAATT CACCTCTCAG
GATCTATCCA CCAGAGCTGT AGACTTCTAT TTCCAATCTG CCAAACCAGA AGAACTCCAA
CTGGCTGTGA AAAATGAGTT GTTGAACAAT TCTAATCACT ATGATTTGGT TTTCCAGAAA
CAATCCACAA GAAAATCCAA GAAGTTGTTC ATCTTCGATA TGGATTCAAC TCTCATCTAC
CAGGAAGTCA TCGAGTTGAT CGCAGCCTAT GCTAACATTG AAGATAAAGT AGCAGAAATA
ACAGAAAGAG CCATGAATGG CGAACTTGAT TTCAATGCTT CATTGGCTGA AAGAGTGCTG
TTGCTCAAGG GAATCGATGC TACGTCTATC TGGGAAGAGT TGAAACACAA GATCGAAGTA
ACCAATGGAG CCAAAGAACT CTGTCTTGCG TTAAAGAAAC TTAATGTGGT CATGGGTGTC
TGTTCTGGGG GCTTCATTCC TTTGGCTGAA CATGTGAAGC TTCACTTGGG TTTGGACTAT
GCCTATGCCA ATGTTCTTGG AACCAACGAG AAGTTGGAGT TAGACGGTAC TACCACCGGC
CCAATTGTCA ATGGCGAAAT GAAGGCTGAG CTCCTTCTTA AAATCGCCAA GAACCATGGC
ATAGATCCCC AGGATGCGGT CGCTGTAGGT GACGGTGCCA ATGACTTGAA GATGATGTCT
GTTGCTGGCT TTGGTGTAGC CTGGAATGCC AAGCCAAAGG TGCAACAGTT GGCCCCTTCG
TGTTTGAATT CGGACTCATT ATTGGATATC TTGTACATAT TAGGCTATAC TGAAGCTGAG
ATCAAGGAGT TGGTCAACTA G
 
Protein sequence
MSEEYALTVI AHGCELSAED LTAVKTLLTD VLKVGKFTSQ DLSTRAVDFY FQSAKPEELQ 
SAVKNELLNN SNHYDLVFQK QSTRKSKKLF IFDMDSTLIY QEVIELIAAY ANIEDKVAEI
TERAMNGELD FNASLAERVS LLKGIDATSI WEELKHKIEV TNGAKELCLA LKKLNVVMGV
CSGGFIPLAE HVKLHLGLDY AYANVLGTNE KLELDGTTTG PIVNGEMKAE LLLKIAKNHG
IDPQDAVAVG DGANDLKMMS VAGFGVAWNA KPKVQQLAPS CLNSDSLLDI LYILGYTEAE
IKELVN