Gene PICST_83454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83454 
Symbol 
ID4838631 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp159885 
End bp161751 
Gene Length1867 bp 
Protein Length560 aa 
Translation table12 
GC content42% 
IMG OID640389946 
Productpredicted protein 
Protein accessionXP_001383993 
Protein GI150864963 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00272] diphthamide biosynthesis protein 2
[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.171187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTG AAGCAGGAGT TCCAGTTGCG TTATCGACGT ATCAGGATGA ATCAACTTTT 
CAATTTGAAC GAGTCAAAGG CTCAGAAATC GTCCGACCCC ATCTTTCACT TGGCAAAAAC
CCCAGTCGAG ATGAGCTTGA ACTGAAAATT ACCGAATATT ATTGTTTAGA CGAACTTGTT
GAAGTACTCA AGAAGAGTAA AGAAGACAAT ATAAGTAGGG AATACAACAG AATCACACTT
CAATTTCCAG ATCTGTTAAT TTGTGACTCT GCCACTATAG TTCATGAGCT TCAGCGTAGA
CTTGGAGTGA GTTTAGAAAG CAGCCTGGAT GTAGCAAAAA CAACTGCAAA CGAAAGTAAT
AGTGACAGCA ATGGATGTGG AAGTTGTGGA TGTACTGGAC CTGATTGCAA TGAAAAAGTT
AATGACGCTG TATCAAGACA AAAGCTCTGG ATATTGGCTG ACACATCCTA TTCTCCATGT
TGTATAGATG AGGTTGCAGC TGAACATGTC AATAGTGATC TTGTGGTACA TTTTGGAGAT
GCCTGTTTGA ACCCCATAGA CAAATTGCCC GCAGTTTATG TTTTTGGGAA GCCGGTGGTG
GATGTTGCCA ATTTAGTGAA TCAATTCAAA ACAAGATATC CTATAGAAGA ATGCCAGCTG
CTGAAGATAT TGCTTATGTC CGACTCCCCG CACACATATA TCTTGAAGCA AGTATATGAA
CAGTTAGCTG TTGAATATCT GGGCTTATGT TATGCTGATT TAGCTTTGGT TCCATCCACT
AAAGCTACAA TAATAGGTTA TAAACCTCAC TCTGTAGTTG ATGCCAAATT CAAAACCATG
AACAGAGCAT TGGTAGGATT GGAAAACGTT GAAGACTATG AAAATGATGA ATTTGACATT
GATACTATAT TGAGCGAGCA TGAGTTGTTC CATATTTCAA CTCCTGAAGC TCCCAGACTT
CTTCAGCTTG TCACCAAGTT TCTGTCAGTT ACATTGTACG ATGCCTTTAC CAAGCAGATC
TCACAGGGTC CATACCCCAA CTTGATGAGA AGATACCGGT ATATGCACAT GGCTCGCTCG
GCTGGTACTG TAGGGTTATT GGTGAATACT CTCTCTTTGG CCAATACGAA GAAATTGATT
AACACGATGG CCAAAAGGAT CAAAGACGCA GGCAAGAAAC ATTACATCTT TGTTGTTGGC
AAGCCAAATG TAGCGAAGTT GGCCAATTTT GAGAATGTAG ACATGTGGTG TGTTTTGGGC
TGCGATCACC AGGGTATTAT TGTTGACCAA AGCAACGAGT ACTTCAAGCC TATTGTTACA
CCTTACGAGC TTCTTCTTGC TCTCAGTGAC GAACTCACCT GGACGGGCAA ATGGATTACC
GACTTCAAGC AGGTTTTGAA ACAAGTAGAT GAAGAAGAGG ATGCAGACGA AGAAGAGAAA
CACGATGAAG ATGATGACGA AGATGCTCCT CCAGAATTTG ATGCAGTAAC TGGAAGATAC
GTCAGTACTT CCAGACCATT GAGACAGCTT CAACACCTTC AGATCTCGTC ACAGGAAGAA
GTTAAAAACG ACGTTGAGTC TAAGGCACTA GTCAACAAGC TCTCATCGGC AGTGGCTATC
AAGAACACCG TTTCTACCTC TGCACAATAT CTCCAAACTC GTCATTGGAC TGGCTTGGGG
AGCGACTACA ATACAGAAGA AGGAGAAATT TCTTCGGCAG GAGCCAACTT GGAAGAAGGA
AGAGGAGGAA TTGCTCGTGG CTACGACTAC GATAGAGAGG TTCATAGTTA ATATGTATAG
TAACATGTAT ATAGAATATA GAATATAGAG GAATACGAAG AGGTAGCTGT GAATCTACCA
GGTTAAT
 
Protein sequence
MATEAGVPVA LSTYQDESTF QFERVKGSEI VRPHLSLGKN PSRDELESKI TEYYCLDELV 
EVLKKSKEDN ISREYNRITL QFPDSLICDS ATIVHELQRR LGVINDAVSR QKLWILADTS
YSPCCIDEVA AEHVNSDLVV HFGDACLNPI DKLPAVYVFG KPVVDVANLV NQFKTRYPIE
ECQSSKILLM SDSPHTYILK QVYEQLAVEY SGLCYADLAL VPSTKATIIG YKPHSVVDAK
FKTMNRALVG LENVEDYEND EFDIDTILSE HELFHISTPE APRLLQLVTK FSSVTLYDAF
TKQISQGPYP NLMRRYRYMH MARSAGTVGL LVNTLSLANT KKLINTMAKR IKDAGKKHYI
FVVGKPNVAK LANFENVDMW CVLGCDHQGI IVDQSNEYFK PIVTPYELLL ALSDELTWTG
KWITDFKQVL KQVDEEEDAD EEEKHDEDDD EDAPPEFDAV TGRYVSTSRP LRQLQHLQIS
SQEEVKNDVE SKALVNKLSS AVAIKNTVST SAQYLQTRHW TGLGSDYNTE EGEISSAGAN
LEEGRGGIAR GYDYDREVHS