Gene PICST_78962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78962 
SymbolDLH1 
ID4840076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp346280 
End bp347397 
Gene Length1118 bp 
Protein Length269 aa 
Translation table12 
GC content42% 
IMG OID640391391 
Productcarboxymethylenebutenolidase 
Protein accessionXP_001385413 
Protein GI126137780 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0108419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACTATATG ATTACCATGA ACCGAATTCG ATATCCCAAT ATCCGAATCA GGAAAAGCTT 
CCTATTCCAA TATATATAAG CAAAGGGATT GGTCGGAGAA TTCCGCCTGC ACTATGCCAG
GAAAACCTGC AGTGCGGGGG TATATAATAT TCCCATAATT TTCACTTCGT TGAATACTTT
TCAGTTTATC ACTCTTTCTG TTCTCTACTT CAATTATCCA CTCAATTGCC AATTTTTTAC
CACTATGTTG ATCGAAGAAA CCTACCACGA TGTCAAGACC TCTTATGGCA CCACCATGAG
ATTATTTGTC TTCCATCCCA AGGTGCCTAA CTACCCCAAG GTCAAGTTCC CAGGTGTCAT
TGTCTACAGT GAAATCTACC AGGTTACCGG CCCAGTTTCC AGATTCGCCA AGGACATTGC
AGGTCAAGGA TTCATCGTCG TGTGTCCATC GATCTACCAT AACTTTGAGA GCTATGAAGC
ATTGACTTAC GATGATGAAG GTACTGACAA AGGTAACAAG TACAAGATTG AAAAGGAGTT
GAAGTCTTAC GATGAAGACA ACAATTTGTC CATTGAGTAT TTGTTGTCGT TGCCAACATG
TAACGGCAAG ATTGGTGCCA CTGGAATGTG CCTTGGTGGC CATTTAGCAT TCCGTTCATC
TTTGGATCCT AGAGTCAAAG CTGCTGTCTG TTTCTTTGCT ACAGACATCC ACATCCATGC
CTTGGGCAAG GGTAAGAATG ATGACTCCTT GAAGAGATCC AGCGAGATCA AGGGTGAAAT
CATCATGATC TTTGGCTGCA AAGATAACCA CGTTCCTTTG GAGGGTAGAG ACTTAATCAG
ATCTACATTG AGAGCCAACA ACGTTGATAT GACCTTTATT GAGATCAACG ATGCTCAACA
TGCCTTTGTC AGAGACGAGC TCAGTAAAGG CAGGTACGAC CCTGCTACTA CTAAGAACTG
TTTCGAATGG CTCTTGGAAT TGTTCAACAG AAAGCTCAAG TTGGACTATG GTGACCACGA
CGGTAAGGCT GAGGTGATTG AGAACATCTG CTGAGTAGTA TATGACATAG AAAAGGTAGA
AAAAAAAGAA AAACAAAATA AAAAATTATG CCTAATAT
 
Protein sequence
MLIEETYHDV KTSYGTTMRL FVFHPKVPNY PKVKFPGVIV YSEIYQVTGP VSRFAKDIAG 
QGFIVVCPSI YHNFESYEAL TYDDEGTDKG NKYKIEKELK SYDEDNNLSI EYLLSLPTCN
GKIGATGMCL GGHLAFRSSL DPRVKAAVCF FATDIHIHAL GKGKNDDSLK RSSEIKGEII
MIFGCKDNHV PLEGRDLIRS TLRANNVDMT FIEINDAQHA FVRDELSKGR YDPATTKNCF
EWLLELFNRK LKLDYGDHDG KAEVIENIC