Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34605 |
Symbol | TCD4 |
ID | 4851794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2846132 |
End bp | 2847289 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | |
GC content | 45% |
IMG OID | 640393502 |
Product | Fe(II)-dependent sulfonate/alpha-ketoglutarate dioxygenase-like protein |
Protein accession | XP_001387108 |
Protein GI | 126275620 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCAA CTGCAACTTA CGGAAACTTC GACACCCATT TCTTTGCTGG CCAAGACGAA ATTGGGGAAG ACGGAATTTT GACCATCAAC AAGAGAAACA GAGAGCAATC TCATTACCCT GAATTCTTGC CTACTTGGGA TCCCAGCCAG AAGTACCCAC CTTTGAAGTT TTTCAAGCAT GAAGATCCTG GAAAGAGAGC CGATCCATCT TTCCCCAATC TATTTGCCAA AGATCACGAA CAAATCGTCA AGAAAGTCAC TCCCAAGTTG GGATCCGAAG TTAGGGGAGT TCAATTGTCT CAATTGGACT CAGCTGGTAA AGACGAGTTG GCTCTTTTTG TGGCACAAAG AGGAGTGGTA ATCTTCCGCG ACCAGGATTT CGCAGCCAAA GGTCCAGCTT TCGCAGTTGA ATATGGTAAA CACTTCGGAA GATTGCACAT CCACCCAACA TCTGGTGCTC CAAGAAACCA CCCAGAGTTG CACATCACCT ACAGAAGAGC TGATCCCGGC GAATTCGAGA GAGTTTTCTC CAATAGCACT AATGCTGTTC AGTACCATAC TGATGTATCC TACGAGTTGC AACCAGCAGG GATCACTTTT TTCTCAGTAT TGGAAGGGCC GGAATCCGGT GGTGACACCA TCTTCGCCGA TTCAGTCGAA GCATACAACA GATTATCTCC AGCTTTCCAG AAGAGGTTGG CCGGCTTACA TGTGTTGCAT ACTTCCGAAG ACCAAGCTTC TAACTCTAGA GGCCAAGGTG GAATTGAAAG AAGAAAGCCA GTTTCAAACA TCCATCCATT GGTCAGAATT CATCCAGTTA CCGGTGCAAA GAGTTTGTTT GTCAATAGAT CATTTGCTAG AAGAATCGTT GAGTTGAAAG AAGAAGAATC CGAGTCTTTG CTTAAATTCT TGTACGACCA CATTGAGCAA TCCCATGACT TGCAATTGAG AGCCAATTGG GAACCAAACA CAGTGGTTAT CTGGGACAAT AGAAGGGTGC ACCACTCAGC CATCATCGAC TGGGAAACTG CAGTTTCTAG ACATGCCTTC AGAATCACTC CACAAGCCGA AAGACCTGTG GAAGACTTGA AGGACTTGAA TAAAGAAGAG TACGACGTTG GTGATGTTGC TGAAGCTTTG AAAGCTGTTT TACATTAG
|
Protein sequence | MSSTATYGNF DTHFFAGQDE IGEDGILTIN KRNREQSHYP EFLPTWDPSQ KYPPLKFFKH EDPGKRADPS FPNLFAKDHE QIVKKVTPKL GSEVRGVQLS QLDSAGKDEL ALFVAQRGVV IFRDQDFAAK GPAFAVEYGK HFGRLHIHPT SGAPRNHPEL HITYRRADPG EFERVFSNST NAVQYHTDVS YELQPAGITF FSVLEGPESG GDTIFADSVE AYNRLSPAFQ KRLAGLHVLH TSEDQASNSR GQGGIERRKP VSNIHPLVRI HPVTGAKSLF VNRSFARRIV ELKEEESESL LKFLYDHIEQ SHDLQLRANW EPNTVVIWDN RRVHHSAIID WETAVSRHAF RITPQAERPV EDLKDLNKEE YDVGDVAEAL KAVLH
|
| |