Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33238 |
Symbol | |
ID | 4840457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 94377 |
End bp | 95543 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391772 |
Product | conserved hypothetical protein |
Protein accession | XP_001386231 |
Protein GI | 150866582 |
COG category | [R] General function prediction only |
COG ID | [COG3568] Metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT CCGCTGAAGA AGGCACACAC CTTAGTGAGA GTACACCATT GTTGACTAGC GGCTCGGGTC TGGAGGAAAC TCGAACTAAT CCTCGTAGTT CTGGTAGATC CTCTCGTTTC AGACTATTGA TTATCCCCAC GTTGATCATT ATTGTCTTGG TCTACGCTAC CAATTGGTAC ATTACTAGCC ACAGCAGTGT TAACCAAGCC TTGCCATTTT TGGGAAAGCC TTTGAAACTT CGCGTCTACA CCAACAACAT TAGACTTGAT AATCGTTACC CAGTCAAGGG CGAGCAGCCA TGGTCCAAGC GTAAGAAGCA AGTCATCAAC TCCATTGACT TCAACACAGC TTTGGGGCAT GCCAACGTGG TATGCTTACA GGAAGTATTG CACAACCAAT TGGTTGATAT CCTTGAGGGG TTGAATAAGA ACGCTGAGCA GATCTGGACC TATTATGGAG TGGGTCGCAA CGATGGTTTA GAAGCTGGCG AATATGCTCC TATATTGTAC AAGAACTCTG ATTGGATTTT GCTCGATAAC CAGACGTTCT GGCTTAGTGA AACTCCTTGG AAGCCAAGTA AGGGATGGGA TGCAGCCCTT GAGAGAATTG TCACTATGGT CACATTGGAA TCTAGGATTA ATCCTTTGAT CAAGGTGAAT GTGTTCAATA CACACTTTGA CCATCGGGGT GTATTGGCTA GGAAGAAGTC GGCGGAGTTG ATTGTTGACA AGATGGAAAA CTTTAACGAT AACCCATCGT TTCTTTGCGG TGACTTTAAT ACCCAGCCCA AGGATCAACC TTACCATGTT TTATCTGATG CTGGATTCAA AGATAGTAGA AAGTTGGTTG ACTATGATTA CTCATATGGC CATAGTACGA CGTTCACCGG CTTTAATAAG GAGAAGGAGG ACTCTTCTAT TATTGATTAC ATCTGGTCAC CATACTTTTC CCAAGGAAAT TTCGGAAACG ATACCTCGCC GGTTAAAGAT TATGAAGATG AAGTAGCCAA TGAGATGAAC AACTACTACA ACCTTGAACA CCATTTGGTT TATGATATAG TAATCAAGCA ATTTGGGATC TTGCACAATT ACTTCAAAGG TTTTTACTTC TCTGACCACA GACCTGTCGT CGCCAGCTAT GAGATAACTA GAACACATCT TCTTTAA
|
Protein sequence | MSTSAEEGTH LSESTPLLTS GSGSEETRTN PRSSGRSSRF RLLIIPTLII IVLVYATNWY ITSHSSVNQA LPFLGKPLKL RVYTNNIRLD NRYPVKGEQP WSKRKKQVIN SIDFNTALGH ANVVCLQEVL HNQLVDILEG LNKNAEQIWT YYGVGRNDGL EAGEYAPILY KNSDWILLDN QTFWLSETPW KPSKGWDAAL ERIVTMVTLE SRINPLIKVN VFNTHFDHRG VLARKKSAEL IVDKMENFND NPSFLCGDFN TQPKDQPYHV LSDAGFKDSR KLVDYDYSYG HSTTFTGFNK EKEDSSIIDY IWSPYFSQGN FGNDTSPVKD YEDEVANEMN NYYNLEHHLV YDIVIKQFGI LHNYFKGFYF SDHRPVVASY EITRTHLL
|
| |