Gene PICST_36571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36571 
Symbol 
ID4840129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp328853 
End bp330814 
Gene Length1962 bp 
Protein Length653 aa 
Translation table12 
GC content42% 
IMG OID640391444 
Productpredicted protein 
Protein accessionXP_001385405 
Protein GI150865973 
COG category[T] Signal transduction mechanisms 
COG ID[COG2365] Protein tyrosine/serine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0275107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATACG ATCCAATCCC AGTACCCAAA GAAATCCAGG TCACCATTGG AAAAGGTATC 
TCAGGTACTA TAGCCATACC CCACTCTGCC GAAGCTGAAA ACCCATATGA AGATGGGTAC
GCACCAGCTA CCCACAAGGC TGCTTTGATT CTCCATGGTC AGGGAGGTCA CAGAGACTAC
TGCTACCAAA AACGTCTTGC TCACAAGCTT GCAGCCGATC TCGGAATCTA CTCGCTTCGT
ATTGATTTCC GTGGCTGTGG ATCCTCGGCT GAAAATGAAG ATGCTCAGAA AGGAAGAGTC
CTTGCACAAG ATGTGGATGA CATTCAAGCT TGTGCTGAGT TCCTTAGAGA TGGAAAGCTC
AACCCTCTAG GCATGTCATT CACGTTGCTG TCGATTATCG GCCATTCGCG TGGTTCTGTA
GCCATGTTCT TGTGGGCCAT GCTCCAAGAT GAGTATCTGA AGCTTGGTGA TCCAAATGCT
ATCATCGTTC CAAATTTGAT TAACTGTTCA GGAAGATTTT CTCTGCCTAC TGTAGCTGAC
AGATACCCTC TTCATGACGA GTTCTTTAAA GAGGTACCCA TGATGTGTCT CAGACATGGC
CAGATGTCTG AGATCTTGAT CCCCAAAAGT GAGTTGGTGT CTCTATCCAA GCCCGATCTC
TCCAAGTTAC ACGGCTTGAC TACAGAATGG TCTGTCTTGA GTATTTATGG ACTTGAGGAC
GAGATTATAC CCATTAATGA TAGTTCCTTA TATGCCAATG CCTTGAACAG AGGTTATTTC
TCCCATAGAT TGGAATTAAT TCCCAAGGCT GACCACAATT TCTATGGAGT TGAACCAATT
GAACACGATG ACCACAACAT TGAACAAAAT CCAGAAAACT TACCACTTAA CAAAAAGCAG
GTTGTCAACT ACAACTTTAA GGTGATCGAT ATTATAGCCA ACTTCTTGAG TCCTGAAAAT
GAACTCCAAC GTTTCTTGCA CACGTCCTTG GAGATTGGAA GATTATCGAG ATGGAAAAAC
GTCGAAGGGG TGAGTAATTT TAGAGATATT GGTGGTTGGA AGATTCATAA TCCCACTTTC
CCCTTAAATT CAAGCTCAAG TTTCCCAGAA AAAAGCGCCT TGCAGTACTA TGTCAAGCCT
CATACCGCTT TCCGTTGTGC TAATATTTCT GGCATCAAAC CAGCAGGTTT GAAAACTCTC
CAAGAATTGG GGGTGAAGGC TGTGTTCGAT CTTCGTTCTG ATGGTGAAGT TGAGCAAGAT
GGAGTACCAC AAAACTTAGA GCAGTATGGA ATCAAAAGGA TACATGCACC AGTCTTCTCC
AAGGATGATT ACTCTCCTCA CGCAATTGCT ATTAGATATA CCAACTTAAT GACCAGTTGG
AACACTTATG TCCATGTTTA TGAGAATATG TTGGAATTTG GTATTGGTGC TTACAGAACT
ATTTTCGAGT ACATCCTCAA GGAAAACAAA CCTTTCGTGT TCCACTGCAC CGCTGGTAAG
GACAGAACTG GTATCTTAGG AATGTTGATA TTATTGTTAC TTGGTGTTGA TAAAAATACA
ATTGCCAAGG AATACGAGTT GACGACCATT GGCTTAAGAC CAGACCATCC TCAATTAAGG
GAAAAGTTTG TGGAAACGAC CAGAAAGTTG AGAGAGAAAT TGGGCGATAA TAGTGATGTC
GAACTCTTGA TTTCTCAAGG TAGAAAGAAT TGGACCATCG AAGAAGATGG ATTCAACAAC
TTGATCAGTT CCAGATACGA AGCTATGTTG GCCACAATTG AAATGTTCCA TGATACCTAT
GGTAACATTG TCAAATATAT GAAGACCGAA TTGGGCTTCA CAGACAGTGA AATCAAGAGA
ATCTACGAAA ACTTAATTAT TATTGATCCT CAAAGTCGTG GATTCGAAGT TTCGGGAGCT
CTCAACTGGG ACCACAGGAA CCTGGGAAGA GTCAAGTTGT AA
 
Protein sequence
MSYDPIPVPK EIQVTIGKGI SGTIAIPHSA EAENPYEDGY APATHKAALI LHGQGGHRDY 
CYQKRLAHKL AADLGIYSLR IDFRGCGSSA ENEDAQKGRV LAQDVDDIQA CAEFLRDGKL
NPLGMSFTLS SIIGHSRGSV AMFLWAMLQD EYSKLGDPNA IIVPNLINCS GRFSSPTVAD
RYPLHDEFFK EVPMMCLRHG QMSEILIPKS ELVSLSKPDL SKLHGLTTEW SVLSIYGLED
EIIPINDSSL YANALNRGYF SHRLELIPKA DHNFYGVEPI EHDDHNIEQN PENLPLNKKQ
VVNYNFKVID IIANFLSPEN ELQRFLHTSL EIGRLSRWKN VEGVSNFRDI GGWKIHNPTF
PLNSSSSFPE KSALQYYVKP HTAFRCANIS GIKPAGLKTL QELGVKAVFD LRSDGEVEQD
GVPQNLEQYG IKRIHAPVFS KDDYSPHAIA IRYTNLMTSW NTYVHVYENM LEFGIGAYRT
IFEYILKENK PFVFHCTAGK DRTGILGMLI LLLLGVDKNT IAKEYELTTI GLRPDHPQLR
EKFVETTRKL REKLGDNSDV ELLISQGRKN WTIEEDGFNN LISSRYEAML ATIEMFHDTY
GNIVKYMKTE LGFTDSEIKR IYENLIIIDP QSRGFEVSGA LNWDHRNSGR VKL