Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58067 |
Symbol | UMH1 |
ID | 4838347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1828322 |
End bp | 1829203 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389662 |
Product | degradation of aromatic compounds |
Protein accession | XP_001383961 |
Protein GI | 126134873 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCTA CTTGGAAGAG ATTAATTAGA TTCGTCGCTC AAGACGGAAA GACCTATAGA GGTGAACCTA TTGTCTCTGA TGCTGACTAC GATGTTGGCA AGCAATTCTT GCAAGGTAAA CAGATCATGG CCAAGGTTAT CACTGGTGAT ATCTTTGATA ATGCTGTTGT GACTGATGTT ATCAAGGAAG TAAAGACTTT GTTGGGTCCT CTTACTCCCG ATGATGTTCC CATGGTGAAG TGTGTAGGTT TGAACTTCAT GAAACACATC CAGGAAGGTG GCAGAACCCC ACCACCATTC CCATCCATTT TCTACAAGAC AAGTTTCAGT GTTGCAGACT TCGGAGAAGA CATTCCAATT CCTAAGATCG CTCAATCCAA ATGTGATTAT GAAGGTGAAT TGTGTATTGT TATTGGCAAG ACCGGAAAAA ATATCAAGGA AGAAGAAGCA TTGAGTTATG TTGCTGGTTA TGTTACTGGC AACGATGTGT CTTCCCGTAA CTGGCAGAAG GACCCAGAAT TCGCCGGAAG TGTTCCTCAA TGGTGTTTCT CTAAGAGTTT CGACAAGTAC GCTCCATTGG GTCCAGCCTT GGTTTCACCA CAAGTGATTC AAAATCCAGG AAACCTTAGC CTACAAACTA CCGTTAACGG CGAAATACGT CAAGATGCCA ACACTGACGA TCTTTTGTTT GGAGTTCCAA GAATCATTTC CTTCATTTCG CAAAGCACTA CATTGGAAAT GGGAACTGTC ATCATGACCG GAACTCCAAG TGGTGTTGCG TTAGGTATGA AGCCAACCCC TGTGTATCTT CAAAACGGAG ACGTTGTTGA GGTAGCTATT GATCAAATCG GCACACTTTC TAATAAAATG GTGTTCGAAT AA
|
Protein sequence | MTPTWKRLIR FVAQDGKTYR GEPIVSDADY DVGKQFLQGK QIMAKVITGD IFDNAVVTDV IKEVKTLLGP LTPDDVPMVK CVGLNFMKHI QEGGRTPPPF PSIFYKTSFS VADFGEDIPI PKIAQSKCDY EGELCIVIGK TGKNIKEEEA LSYVAGYVTG NDVSSRNWQK DPEFAGSVPQ WCFSKSFDKY APLGPALVSP QVIQNPGNLS LQTTVNGEIR QDANTDDLLF GVPRIISFIS QSTTLEMGTV IMTGTPSGVA LGMKPTPVYL QNGDVVEVAI DQIGTLSNKM VFE
|
| |