Gene PICST_65227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65227 
SymbolTEP1 
ID4837125 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1499914 
End bp1501118 
Gene Length1205 bp 
Protein Length329 aa 
Translation table12 
GC content40% 
IMG OID640388440 
Productprotein tyrosine phosphatase 
Protein accessionXP_001383060 
Protein GI150864302 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.702487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTC TACTTCGTTC GGTGGTGGCC GCACCGAAGC AGTCTCATTA TGATAGTAGA 
CTCAATATGT ATCTAGACTT ATCATATATA AATTCACAAC TTATCGTAAG TTCTGGACCA
GTGACTGGGT ACCTCAAGTC CTTCTATCGG TATCCTGTTC TGGACTTGTT GCTATTCCTC
AATGAGAACC ATTCGAACCA CTGGCACATT TGGAATTTCA GAGGTGAAGA ACCTGGATAC
GAAGATATTG ACGTCTTGAA TAGAGTTAGC CATTTCCCTT TTCCTGACCA CCAGGCACCT
ACTCTAGATA TAATCGTAAA TAGCGTTTAC GATATAGACC AATTTTTGCA AACTTCGCCA
CAAAATGTGG CGGTACTTCA TTGCAAGGCA GGTAAGGGAA GATCAGGATC GATCTGCTGT
GCTTACATTA TGTACGATCT CGCCAGAATG GGGCGGTCGT TCGAAGTTGA TGAGATCATT
GCAGATTTCA CAGAAAGGAG AATGAAACAA TTTGCAGGTG AAGGCATTTC CATTGTTTCT
CAACGTAGAT ATCTTGGTTA TTGGTACAAA TATCTACAGG CAGATGAAAC TCTCAGAAAT
CATTACAAAG AGCTGCGGCA GACAGAGCCG TTCTTAGACA GAGTACGCAT CAGAAATGGA
CCTATTGAGT CACAAGTGTT CAAAATGGCA CCATTCCTTG AAACTTATGA GAAAGTAGAC
CGAGGTGGTA GAAAAAGTCT TGCAATAAAG AAAGTGTTTG GATTCACACC AGAAACGACT
ACTGTCGTAA CTAAAGGTAG AGATATGACT CTTATTCCCA TTAAGAATGC TAGTTTGGGC
TTAGATACAA GAGATCTACG GTTGAATATA AATGGTTGGT GTTACTCTTG GTTTGACATT
CAGTTTGAGA CAAAAGTAGA AAATATATCA AATGGGGTTA TCAAGATGAA GTGGAACGAA
TTGGATGGGT TCAAGGGCAC AAGACAGAGA GGAGTCAAAC TTTTTGACCA GATCGAGATC
TACTGGCGGT TTTCATAGAG TCCTAGAGTA TAGAAGTCAT GCTACATTTT CAAAAGTTTT
CATAGAAGGT ATGCTGAATT TTGTATATCC ATTAATAGCA GTATATAGAT CGGAAGTTAA
TGCCTCTTAT TTAGGTATAA TCTTCATAAG TGCATTAACC ACCACGTTTT TCGTTGTCGA
CTTCC
 
Protein sequence
MKSLLRSVVA APKQSHYDSR LNMYLDLSYI NSQLIVSSGP VTGYLKSFYR YPVSDLLLFL 
NENHSNHWHI WNFRGEEPGY EDIDVLNRVS HFPFPDHQAP TLDIIVNSVY DIDQFLQTSP
QNVAVLHCKA GKGRSGSICC AYIMYDLARM GRSFEVDEII ADFTERRMKQ FAGEGISIVS
QRRYLGYWYK YLQADETLRN HYKESRQTEP FLDRVRIRNG PIESQVFKMA PFLETYEKVD
RETTTVVTKG RDMTLIPIKN ASLGLDTRDL RLNINGWCYS WFDIQFETKV ENISNGVIKM
KWNELDGFKG TRQRGVKLFD QIEIYWRFS