Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78962 |
Symbol | DLH1 |
ID | 4840076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 346280 |
End bp | 347397 |
Gene Length | 1118 bp |
Protein Length | 269 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391391 |
Product | carboxymethylenebutenolidase |
Protein accession | XP_001385413 |
Protein GI | 126137780 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0412] Dienelactone hydrolase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0108419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACTATATG ATTACCATGA ACCGAATTCG ATATCCCAAT ATCCGAATCA GGAAAAGCTT CCTATTCCAA TATATATAAG CAAAGGGATT GGTCGGAGAA TTCCGCCTGC ACTATGCCAG GAAAACCTGC AGTGCGGGGG TATATAATAT TCCCATAATT TTCACTTCGT TGAATACTTT TCAGTTTATC ACTCTTTCTG TTCTCTACTT CAATTATCCA CTCAATTGCC AATTTTTTAC CACTATGTTG ATCGAAGAAA CCTACCACGA TGTCAAGACC TCTTATGGCA CCACCATGAG ATTATTTGTC TTCCATCCCA AGGTGCCTAA CTACCCCAAG GTCAAGTTCC CAGGTGTCAT TGTCTACAGT GAAATCTACC AGGTTACCGG CCCAGTTTCC AGATTCGCCA AGGACATTGC AGGTCAAGGA TTCATCGTCG TGTGTCCATC GATCTACCAT AACTTTGAGA GCTATGAAGC ATTGACTTAC GATGATGAAG GTACTGACAA AGGTAACAAG TACAAGATTG AAAAGGAGTT GAAGTCTTAC GATGAAGACA ACAATTTGTC CATTGAGTAT TTGTTGTCGT TGCCAACATG TAACGGCAAG ATTGGTGCCA CTGGAATGTG CCTTGGTGGC CATTTAGCAT TCCGTTCATC TTTGGATCCT AGAGTCAAAG CTGCTGTCTG TTTCTTTGCT ACAGACATCC ACATCCATGC CTTGGGCAAG GGTAAGAATG ATGACTCCTT GAAGAGATCC AGCGAGATCA AGGGTGAAAT CATCATGATC TTTGGCTGCA AAGATAACCA CGTTCCTTTG GAGGGTAGAG ACTTAATCAG ATCTACATTG AGAGCCAACA ACGTTGATAT GACCTTTATT GAGATCAACG ATGCTCAACA TGCCTTTGTC AGAGACGAGC TCAGTAAAGG CAGGTACGAC CCTGCTACTA CTAAGAACTG TTTCGAATGG CTCTTGGAAT TGTTCAACAG AAAGCTCAAG TTGGACTATG GTGACCACGA CGGTAAGGCT GAGGTGATTG AGAACATCTG CTGAGTAGTA TATGACATAG AAAAGGTAGA AAAAAAAGAA AAACAAAATA AAAAATTATG CCTAATAT
|
Protein sequence | MLIEETYHDV KTSYGTTMRL FVFHPKVPNY PKVKFPGVIV YSEIYQVTGP VSRFAKDIAG QGFIVVCPSI YHNFESYEAL TYDDEGTDKG NKYKIEKELK SYDEDNNLSI EYLLSLPTCN GKIGATGMCL GGHLAFRSSL DPRVKAAVCF FATDIHIHAL GKGKNDDSLK RSSEIKGEII MIFGCKDNHV PLEGRDLIRS TLRANNVDMT FIEINDAQHA FVRDELSKGR YDPATTKNCF EWLLELFNRK LKLDYGDHDG KAEVIENIC
|
| |