Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36571 |
Symbol | |
ID | 4840129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 328853 |
End bp | 330814 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391444 |
Product | predicted protein |
Protein accession | XP_001385405 |
Protein GI | 150865973 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2365] Protein tyrosine/serine phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.136589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0275107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATACG ATCCAATCCC AGTACCCAAA GAAATCCAGG TCACCATTGG AAAAGGTATC TCAGGTACTA TAGCCATACC CCACTCTGCC GAAGCTGAAA ACCCATATGA AGATGGGTAC GCACCAGCTA CCCACAAGGC TGCTTTGATT CTCCATGGTC AGGGAGGTCA CAGAGACTAC TGCTACCAAA AACGTCTTGC TCACAAGCTT GCAGCCGATC TCGGAATCTA CTCGCTTCGT ATTGATTTCC GTGGCTGTGG ATCCTCGGCT GAAAATGAAG ATGCTCAGAA AGGAAGAGTC CTTGCACAAG ATGTGGATGA CATTCAAGCT TGTGCTGAGT TCCTTAGAGA TGGAAAGCTC AACCCTCTAG GCATGTCATT CACGTTGCTG TCGATTATCG GCCATTCGCG TGGTTCTGTA GCCATGTTCT TGTGGGCCAT GCTCCAAGAT GAGTATCTGA AGCTTGGTGA TCCAAATGCT ATCATCGTTC CAAATTTGAT TAACTGTTCA GGAAGATTTT CTCTGCCTAC TGTAGCTGAC AGATACCCTC TTCATGACGA GTTCTTTAAA GAGGTACCCA TGATGTGTCT CAGACATGGC CAGATGTCTG AGATCTTGAT CCCCAAAAGT GAGTTGGTGT CTCTATCCAA GCCCGATCTC TCCAAGTTAC ACGGCTTGAC TACAGAATGG TCTGTCTTGA GTATTTATGG ACTTGAGGAC GAGATTATAC CCATTAATGA TAGTTCCTTA TATGCCAATG CCTTGAACAG AGGTTATTTC TCCCATAGAT TGGAATTAAT TCCCAAGGCT GACCACAATT TCTATGGAGT TGAACCAATT GAACACGATG ACCACAACAT TGAACAAAAT CCAGAAAACT TACCACTTAA CAAAAAGCAG GTTGTCAACT ACAACTTTAA GGTGATCGAT ATTATAGCCA ACTTCTTGAG TCCTGAAAAT GAACTCCAAC GTTTCTTGCA CACGTCCTTG GAGATTGGAA GATTATCGAG ATGGAAAAAC GTCGAAGGGG TGAGTAATTT TAGAGATATT GGTGGTTGGA AGATTCATAA TCCCACTTTC CCCTTAAATT CAAGCTCAAG TTTCCCAGAA AAAAGCGCCT TGCAGTACTA TGTCAAGCCT CATACCGCTT TCCGTTGTGC TAATATTTCT GGCATCAAAC CAGCAGGTTT GAAAACTCTC CAAGAATTGG GGGTGAAGGC TGTGTTCGAT CTTCGTTCTG ATGGTGAAGT TGAGCAAGAT GGAGTACCAC AAAACTTAGA GCAGTATGGA ATCAAAAGGA TACATGCACC AGTCTTCTCC AAGGATGATT ACTCTCCTCA CGCAATTGCT ATTAGATATA CCAACTTAAT GACCAGTTGG AACACTTATG TCCATGTTTA TGAGAATATG TTGGAATTTG GTATTGGTGC TTACAGAACT ATTTTCGAGT ACATCCTCAA GGAAAACAAA CCTTTCGTGT TCCACTGCAC CGCTGGTAAG GACAGAACTG GTATCTTAGG AATGTTGATA TTATTGTTAC TTGGTGTTGA TAAAAATACA ATTGCCAAGG AATACGAGTT GACGACCATT GGCTTAAGAC CAGACCATCC TCAATTAAGG GAAAAGTTTG TGGAAACGAC CAGAAAGTTG AGAGAGAAAT TGGGCGATAA TAGTGATGTC GAACTCTTGA TTTCTCAAGG TAGAAAGAAT TGGACCATCG AAGAAGATGG ATTCAACAAC TTGATCAGTT CCAGATACGA AGCTATGTTG GCCACAATTG AAATGTTCCA TGATACCTAT GGTAACATTG TCAAATATAT GAAGACCGAA TTGGGCTTCA CAGACAGTGA AATCAAGAGA ATCTACGAAA ACTTAATTAT TATTGATCCT CAAAGTCGTG GATTCGAAGT TTCGGGAGCT CTCAACTGGG ACCACAGGAA CCTGGGAAGA GTCAAGTTGT AA
|
Protein sequence | MSYDPIPVPK EIQVTIGKGI SGTIAIPHSA EAENPYEDGY APATHKAALI LHGQGGHRDY CYQKRLAHKL AADLGIYSLR IDFRGCGSSA ENEDAQKGRV LAQDVDDIQA CAEFLRDGKL NPLGMSFTLS SIIGHSRGSV AMFLWAMLQD EYSKLGDPNA IIVPNLINCS GRFSSPTVAD RYPLHDEFFK EVPMMCLRHG QMSEILIPKS ELVSLSKPDL SKLHGLTTEW SVLSIYGLED EIIPINDSSL YANALNRGYF SHRLELIPKA DHNFYGVEPI EHDDHNIEQN PENLPLNKKQ VVNYNFKVID IIANFLSPEN ELQRFLHTSL EIGRLSRWKN VEGVSNFRDI GGWKIHNPTF PLNSSSSFPE KSALQYYVKP HTAFRCANIS GIKPAGLKTL QELGVKAVFD LRSDGEVEQD GVPQNLEQYG IKRIHAPVFS KDDYSPHAIA IRYTNLMTSW NTYVHVYENM LEFGIGAYRT IFEYILKENK PFVFHCTAGK DRTGILGMLI LLLLGVDKNT IAKEYELTTI GLRPDHPQLR EKFVETTRKL REKLGDNSDV ELLISQGRKN WTIEEDGFNN LISSRYEAML ATIEMFHDTY GNIVKYMKTE LGFTDSEIKR IYENLIIIDP QSRGFEVSGA LNWDHRNSGR VKL
|
| |