Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_71233 |
Symbol | |
ID | 4837845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 430019 |
End bp | 431143 |
Gene Length | 1125 bp |
Protein Length | 344 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389160 |
Product | predicted protein |
Protein accession | XP_001383360 |
Protein GI | 150864516 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0500] SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.532537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAACTTACT AAGCGGGTAG AAATGACAGT AGTAACACCT GTGGAGTCAG ACAGTGAAAA TTTGGCCTTC TCTGAGTTGA AAATCGAAGA CCTGAAGCTG CAAATCCAGG CTGAGGCTCC AGAAAAAGAA TCCAAACCAC CTTTAGAGTC TAGAATAGGC AAGGATTCTC CCTTCACCTT CGGACAAAGA TACTTGAAAA GTGACGAGGA TGTTTTCAAC CATAATGCAT GGGACCATGT GGAATGGGGT GAAGAACAGA TCGAAGAGGC CAAGTCTATG ATAGCTAAAC AGTACGATCA TCCTGTAAAG GACTTTGACA AGAAGCTCTA CAATTCCAAC CCAGCCAAGT ATTGGGACAT TTTCTACAGA CATAACAGAG AGAACTTTTT CAAAGACAGA AAGTGGCTTC AAATCGAGTT CCCATCTTTG TATCAGGTTA CGGCTGAAGA CTACCAGGAA AAATGTACAA TTTTGGAAAT CGGATGCGGT GCTGGAAATA CATTTTTTCC AGTATTGAGT CAGAACAAGA ACGAAAACTT GAAGATTGTG GGCTGTGACT ATTCGAAAGT GGCCGTAGAT TTGGTTCGCT CTAATGAACA GTTTGCTCCT AACCATGAGA AGGGTGTAGC ATTCTCGTCA GTTTGGGATT TGGCTAATCC TGAAGGACAG CTTCCTGAAG ATGTAGAAGA AAACTCGGTG GACATAGTCA TTATGGTTTT TGTGTTTCTG GCGCTTTCAC CTGACCAATG GAAGCAGGCT GTCTCCAACT TGGCCAAGAT TTTGAAGCCC GGTGGAGAGA TTCTCTTCAG AGACTATGGC AGATACGACT TGGCCCAAGT CAGATTCAAG AAGGGAAGAC TCTTGGACGA TAACTTCTAT ATTAGAGGAG ATGGTACTAG AGTGTATTTC TTTACGGAAG AGGAGTTGAG ACAGATATTT TGCATAGACG GTCCTTTCAC CGAAGAGAGA ATTGCCACCG ACAGAAGATT GTTGGTGAAT AGAAAGAAAC AGTTGAAGAT GTACCGTAAC TGGTTGCAGG CTGTGTTCAG AGGATAACGG TAATTGTAAA TTAGAGCTAA GGAAATTAGA ATTATTGTAA ACTAGAACTA TTAGAATGAA CTTTT
|
Protein sequence | MTVVTPVESD SENLAFSELK IEDSKSQIQA EAPEKESKPP LESRIGKDSP FTFGQRYLKS DEDVFNHNAW DHVEWGEEQI EEAKSMIAKQ YDHPVKDFDK KLYNSNPAKY WDIFYRHNRE NFFKDRKWLQ IEFPSLYQVT AEDYQEKCTI LEIGCGAGNT FFPVLSQNKN ENLKIVGCDY SKVAVDLVRS NEQFAPNHEK GVAFSSVWDL ANPEGQLPED VEENSVDIVI MVFVFSALSP DQWKQAVSNL AKILKPGGEI LFRDYGRYDL AQVRFKKGRL LDDNFYIRGD GTRVYFFTEE ELRQIFCIDG PFTEERIATD RRLLVNRKKQ LKMYRNWLQA VFRG
|
| |