Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46425 |
Symbol | TML1 |
ID | 4839703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 896761 |
End bp | 897963 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391018 |
Product | Trimethyllysine dioxygenase (Epsilon-trimethyllysine 2-oxoglutarate dioxygenase) (TML-alpha-ketoglutarate dioxygenase) (TML hydroxylase) (TML dioxygenase) (TMLD) |
Protein accession | XP_001384841 |
Protein GI | 126136635 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02410] trimethyllysine dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCTC CCTCAATCAC TTCAGCTACT GCCATCTCGC CAGACTCTGT CGAGATCAAA TGGTCCAACG ACCAATCCTC CGTCTTCCAC AATATTTGGT TGAGAGACAA TTGTCACTGT GAAAAGTGCT ACTACCCTGC AACCAAACAG AGATTGTTGA ACTCTGCTAC AATTAACGCC GATATAGGAG CACGTCATAT CGAAGTCAAG TTCCCCGTTC TTGAAATTGT GTGGAACCAG GAAGACCACA AGTCGTCGTA TTCCTTTTCT TGGCTCTACC TTCACTCGTA CCAGCCTAGG TTGGTACCCG TCGACGAAAA GCTCGCAGGA GAAAAGACTA TTTTAGCCCA AAAGTTGTGG AAGGTTGCTG ATATCAAGGA TTCGTTGCCA GCTGTCGATT TCAACAAGAT CATCGACTCA GACGAAGGCT CCGACAACGA AGATGCTATT CGCGATTGGA CGCTAAAAAT CTGGAAGCAC GGTTTCTGTT TCATCGACAA CGTTCCAGTA ACTCCAGAGG ACACAGAGAA GCTCTGTGAG AAGCTCTGTT ACATCAGACC AACACATTAC GGTGGTTTCT GGGACTTCAC GAGTGATCTC TCAAAAGCAG ACACTGCATA TACAAACATC GACATTTCAT CTCATACAGA TGGAACCTAT TGGTCCGACA CCCCAGGATT GCAATTGTTC CACTTATTGT ACCACGATGG AACAGGAGGA ACGACTTCGC TTGTAGATGC ATTTCAATGT GCAAAGGTCT TGAAGAAGAA CCATCCGGAA AGTTTTGAAC TATTAACTAG AATTCCCATT CCAGCTCATT CCGCGGGCGA GGAAAAAGTC TGTATCCAGC CTGATATTCC TCAGCCTATC TTTAAGTTGG ACAATGAGGG CGAGTTGATC CAGGTTCGTT GGAACCAGAG CGACCGCTCG ACCATGGACA ATTGGACTAA CCCAGCTGAC ATACCCAAAT TCTATGCTGC TATCAGACAT TGGGTGCAAA TCATTACTGA TCCCGAAAAC GAAATCTTCT ACCAGTTGAA GCCTGGTCAG TGTTTGATAT TTGACAACTG GAGATGTTTC CACTCCAGAA CAGAGTTCAC GGGCAAGAGA AGAATGTGTG GTGCCTACAT CAACAGAGAC GACTTTGTCT CTAAATTAAA GTTGCTCAAC TTGGGCAGAC CAGCTGTATT GGAGTCTATT TGA
|
Protein sequence | MTSPSITSAT AISPDSVEIK WSNDQSSVFH NIWLRDNCHC EKCYYPATKQ RLLNSATINA DIGARHIEVK FPVLEIVWNQ EDHKSSYSFS WLYLHSYQPR LVPVDEKLAG EKTILAQKLW KVADIKDSLP AVDFNKIIDS DEGSDNEDAI RDWTLKIWKH GFCFIDNVPV TPEDTEKLCE KLCYIRPTHY GGFWDFTSDL SKADTAYTNI DISSHTDGTY WSDTPGLQLF HLLYHDGTGG TTSLVDAFQC AKVLKKNHPE SFELLTRIPI PAHSAGEEKV CIQPDIPQPI FKLDNEGELI QVRWNQSDRS TMDNWTNPAD IPKFYAAIRH WVQIITDPEN EIFYQLKPGQ CLIFDNWRCF HSRTEFTGKR RMCGAYINRD DFVSKLKLLN LGRPAVLESI
|
| |