Gene PICST_35812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35812 
Symbol 
ID4838817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp391143 
End bp392408 
Gene Length1266 bp 
Protein Length421 aa 
Translation table12 
GC content43% 
IMG OID640390132 
Productpredicted protein 
Protein accessionXP_001384371 
Protein GI126135694 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0979008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAAG CTATTGAAAC CAGCAAGAAA GTACCGAGAA GACGTTTTGT GGGAAAGAAA 
CCTGGAGCAC CTGATGAATC TTCTGAAAAT GGATCTGCTT TAGTTTCTTC TACCAAGAAC
AAACATCTAG GGAGAGTCAT GAACCAGATC CCTGACGATA TTTTGAACGA CAAGGAGTTG
AATGAAGCGA TAAAGCTTTT GCCATCGAAT TACAACTTTG AAATTCAGAA GACTGTATGG
AATATCAAAA AACATGGCGC CAAAAGAGTT GCTCTCCAGA TGCCCGAAGG CTTGCTAATC
TACTCATTGA TCATTTCTGA TATCTTGGAG CAGTTCTGTG AAGTCGAAAC TGTAGTGATG
GGAGATGTAT CATATGGTGC GTGTTGTATC GACGACTATA CGGCTAGATC ACTTGATTGT
GACTTCATTG TTCATTATGC CCATTCATGT TTGGTTCCTA TAGACATCAC TGCCATCAAG
GTGTTGTACG TTTTTGTCAC CATCAACATC GATGAATCCC ATTTGATCAA CACGATCAAG
TTAAACTTCG ACCAAGGCTC GCAATTAGCT GTGTTTGGTA CTATTCAGTT CAACCCCACT
ATTCACAGCG CGAAAGCCAA ACTAGAGAGC GATTCAGAGA AACCTATATA CTTAATACCT
CCCCAGACTA GACCTCTTTC CAAGGGTGAA GTATTGGGCT GCACATCGGC CAGACTCAAC
AAGGATCATA TAAGCGGAAT GATATACATC GGAGATGGCC GTTTCCACTT GGAAAGTTCC
ATGATCCACA ACCCAGAAAT CCCAGCATAT AGATACGACC CTTACTCAAG AAAGTTCACC
AGAGAGTATT ACGACCAGAA GGAAATGATC CAGGTAAGAG ACGATGCCAT AAAGACAGCA
TCAAAAGCAA AGAAGATAGG CCTCATACTC GGAGCTCTTG GAAGGCAAGG AAACCCCGTA
ACTCTAGACA AGCTTGAGCA ATCGTTATCT TCTCGAGGCA TTCAGGTAGT CAAGATCATC
TTGAGTGAAA TTTTTCCTCA GAAGCTTTCG ATGTTTGATG ATGTAGACGC CTTTGTGCAA
GTGGCCTGCC CCAGATTATC TATAGACTGG GGTTACGCCT TCAACAAGCC GCTACTAACT
CCTTACGAGG CTATGGTGAT GCTAGAACAA GATACTAAAT GGAGTGAAAA GTATTATCCT
ATGGATTACT ATGCTAAGGA CGGCTATGGA CGAGGAAAAA TCCCCGACCA TATCAACGTT
ATATAA
 
Protein sequence
MVEAIETSKK VPRRRFVGKK PGAPDESSEN GSALVSSTKN KHLGRVMNQI PDDILNDKEL 
NEAIKLLPSN YNFEIQKTVW NIKKHGAKRV ALQMPEGLLI YSLIISDILE QFCEVETVVM
GDVSYGACCI DDYTARSLDC DFIVHYAHSC LVPIDITAIK VLYVFVTINI DESHLINTIK
LNFDQGSQLA VFGTIQFNPT IHSAKAKLES DSEKPIYLIP PQTRPLSKGE VLGCTSARLN
KDHISGMIYI GDGRFHLESS MIHNPEIPAY RYDPYSRKFT REYYDQKEMI QVRDDAIKTA
SKAKKIGLIL GALGRQGNPV TLDKLEQSLS SRGIQVVKII LSEIFPQKLS MFDDVDAFVQ
VACPRLSIDW GYAFNKPLLT PYEAMVMLEQ DTKWSEKYYP MDYYAKDGYG RGKIPDHINV
I