Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83454 |
Symbol | |
ID | 4838631 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 159885 |
End bp | 161751 |
Gene Length | 1867 bp |
Protein Length | 560 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389946 |
Product | predicted protein |
Protein accession | XP_001383993 |
Protein GI | 150864963 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1736] Diphthamide synthase subunit DPH2 |
TIGRFAM ID | [TIGR00272] diphthamide biosynthesis protein 2 [TIGR00322] diphthamide biosynthesis protein 2-related domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.171187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTG AAGCAGGAGT TCCAGTTGCG TTATCGACGT ATCAGGATGA ATCAACTTTT CAATTTGAAC GAGTCAAAGG CTCAGAAATC GTCCGACCCC ATCTTTCACT TGGCAAAAAC CCCAGTCGAG ATGAGCTTGA ACTGAAAATT ACCGAATATT ATTGTTTAGA CGAACTTGTT GAAGTACTCA AGAAGAGTAA AGAAGACAAT ATAAGTAGGG AATACAACAG AATCACACTT CAATTTCCAG ATCTGTTAAT TTGTGACTCT GCCACTATAG TTCATGAGCT TCAGCGTAGA CTTGGAGTGA GTTTAGAAAG CAGCCTGGAT GTAGCAAAAA CAACTGCAAA CGAAAGTAAT AGTGACAGCA ATGGATGTGG AAGTTGTGGA TGTACTGGAC CTGATTGCAA TGAAAAAGTT AATGACGCTG TATCAAGACA AAAGCTCTGG ATATTGGCTG ACACATCCTA TTCTCCATGT TGTATAGATG AGGTTGCAGC TGAACATGTC AATAGTGATC TTGTGGTACA TTTTGGAGAT GCCTGTTTGA ACCCCATAGA CAAATTGCCC GCAGTTTATG TTTTTGGGAA GCCGGTGGTG GATGTTGCCA ATTTAGTGAA TCAATTCAAA ACAAGATATC CTATAGAAGA ATGCCAGCTG CTGAAGATAT TGCTTATGTC CGACTCCCCG CACACATATA TCTTGAAGCA AGTATATGAA CAGTTAGCTG TTGAATATCT GGGCTTATGT TATGCTGATT TAGCTTTGGT TCCATCCACT AAAGCTACAA TAATAGGTTA TAAACCTCAC TCTGTAGTTG ATGCCAAATT CAAAACCATG AACAGAGCAT TGGTAGGATT GGAAAACGTT GAAGACTATG AAAATGATGA ATTTGACATT GATACTATAT TGAGCGAGCA TGAGTTGTTC CATATTTCAA CTCCTGAAGC TCCCAGACTT CTTCAGCTTG TCACCAAGTT TCTGTCAGTT ACATTGTACG ATGCCTTTAC CAAGCAGATC TCACAGGGTC CATACCCCAA CTTGATGAGA AGATACCGGT ATATGCACAT GGCTCGCTCG GCTGGTACTG TAGGGTTATT GGTGAATACT CTCTCTTTGG CCAATACGAA GAAATTGATT AACACGATGG CCAAAAGGAT CAAAGACGCA GGCAAGAAAC ATTACATCTT TGTTGTTGGC AAGCCAAATG TAGCGAAGTT GGCCAATTTT GAGAATGTAG ACATGTGGTG TGTTTTGGGC TGCGATCACC AGGGTATTAT TGTTGACCAA AGCAACGAGT ACTTCAAGCC TATTGTTACA CCTTACGAGC TTCTTCTTGC TCTCAGTGAC GAACTCACCT GGACGGGCAA ATGGATTACC GACTTCAAGC AGGTTTTGAA ACAAGTAGAT GAAGAAGAGG ATGCAGACGA AGAAGAGAAA CACGATGAAG ATGATGACGA AGATGCTCCT CCAGAATTTG ATGCAGTAAC TGGAAGATAC GTCAGTACTT CCAGACCATT GAGACAGCTT CAACACCTTC AGATCTCGTC ACAGGAAGAA GTTAAAAACG ACGTTGAGTC TAAGGCACTA GTCAACAAGC TCTCATCGGC AGTGGCTATC AAGAACACCG TTTCTACCTC TGCACAATAT CTCCAAACTC GTCATTGGAC TGGCTTGGGG AGCGACTACA ATACAGAAGA AGGAGAAATT TCTTCGGCAG GAGCCAACTT GGAAGAAGGA AGAGGAGGAA TTGCTCGTGG CTACGACTAC GATAGAGAGG TTCATAGTTA ATATGTATAG TAACATGTAT ATAGAATATA GAATATAGAG GAATACGAAG AGGTAGCTGT GAATCTACCA GGTTAAT
|
Protein sequence | MATEAGVPVA LSTYQDESTF QFERVKGSEI VRPHLSLGKN PSRDELESKI TEYYCLDELV EVLKKSKEDN ISREYNRITL QFPDSLICDS ATIVHELQRR LGVINDAVSR QKLWILADTS YSPCCIDEVA AEHVNSDLVV HFGDACLNPI DKLPAVYVFG KPVVDVANLV NQFKTRYPIE ECQSSKILLM SDSPHTYILK QVYEQLAVEY SGLCYADLAL VPSTKATIIG YKPHSVVDAK FKTMNRALVG LENVEDYEND EFDIDTILSE HELFHISTPE APRLLQLVTK FSSVTLYDAF TKQISQGPYP NLMRRYRYMH MARSAGTVGL LVNTLSLANT KKLINTMAKR IKDAGKKHYI FVVGKPNVAK LANFENVDMW CVLGCDHQGI IVDQSNEYFK PIVTPYELLL ALSDELTWTG KWITDFKQVL KQVDEEEDAD EEEKHDEDDD EDAPPEFDAV TGRYVSTSRP LRQLQHLQIS SQEEVKNDVE SKALVNKLSS AVAIKNTVST SAQYLQTRHW TGLGSDYNTE EGEISSAGAN LEEGRGGIAR GYDYDREVHS
|
| |