Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51444 |
Symbol | TCD5.2 |
ID | 4851225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1248091 |
End bp | 1249365 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | |
GC content | 46% |
IMG OID | 640392933 |
Product | taurine catabolism dioxygenase |
Protein accession | XP_001387469 |
Protein GI | 126274211 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.353625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.08977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCTC CTGCTGCTAC CTCATCTTCC GCCCCAACTG AAGACGACAT TGAACAAACC GTCAGAAAGT TGGCTGCCTT GAAGCCAATC GGCCACAGAT TTTCCAACAA TGCTAAGACT GGTCCTCAAT TGGAATGGTT AGCAAAGTTG CCAGAACCTG CTAGAAAAAG ATTTGAAAAG GCAGGTATTG ATTTGTCCAA CGGTTATCCT GTTATTCCTA AATCGGAAGA TATTCCAAAG TTTGTTGATG AAGCATTTGA AATCAGAAAC AAGGACTATC CATACATTGA AAGGGGTGCA AATGCCGACC CTGAGAAGAA GGCATTGTTT GGAGCTGCCA AAGAAGTTAG ACACTTGACC AAGCACCTTG GTACAGAAAT TGTAGGTTTG CAGTTGAGCG ACTTGAACGA CAAGCAAAAA GACGAATTGG CCTTGTTGGT GGCTGAAAGA GTTGTCGTCT TTTTCAGAAA CCAAGACTTG TCTCCTCAGA AGCAATTGGA ATTGGGTGAA TACTGGGGTC AAGTTGAAAG ACACCCACAA GCTCCACACG TTCCATTGCC AATCCCTGAA GGTACTGAAA CTATTGCCAA GGGTAGTGGT GTCAGTGTAA TCTGGAGAAA GTTTTTCAGC GAATTCTATG GATTCCCTGG TGGTTTCAGG AAGAAGTCCA TCACCTCAGG CTGGCACACT GATTTGGTCC ATGAGCATCA ACCAGCAGGT ATCACCCACT TGCACAACGA CACGATTCCA AAGACTGGAG GTGACACTGC ATGGGCATCT GGTTATGCTG CATACGACAA GTTGTCTCCA GCCTTGCAAA AGTTCCTTGA CGGAAAGACA GCTATCTACC GTTCCGCTCA CCAGTACCTT GACCGTGAAA ATCCATTGAA GGGACCAAAG TACATCGAAA GAGAGCACCC TATTGTGAGA ACCCATCCTG CCACTGGCTG GAAGTACTTG TTCGTCAACA GATCCATGAC TGACAGAATT GTGGGTTTGG AACCAGGTGA ATCCAAGGTT ATTTTGGAGT ACTTGTTCTC AGTCTACGAG AAGAACTTGG ACATTCAAGT GAGATTCCAA TGGCAACCTA CAAACGAAGG CTTTGGAACT TCTGCTATCT GGGATAACAG AGTTTCTCAG CACAATGCTA TTTCTGACTA CGACTTCGAT GGCGATGAAC GTCATGGAAC TAGAGTCACT TCTTTAGCTG AGCTTCCTTA CTTCGACCCC AAGTCCAAGT CTCAAAGAGA AGCATTGGGC TTGTCGTTAG ATTAG
|
Protein sequence | MAPPAATSSS APTEDDIEQT VRKLAALKPI GHRFSNNAKT GPQLEWLAKL PEPARKRFEK AGIDLSNGYP VIPKSEDIPK FVDEAFEIRN KDYPYIERGA NADPEKKALF GAAKEVRHLT KHLGTEIVGL QLSDLNDKQK DELALLVAER VVVFFRNQDL SPQKQLELGE YWGQVERHPQ APHVPLPIPE GTETIAKGSG VSVIWRKFFS EFYGFPGGFR KKSITSGWHT DLVHEHQPAG ITHLHNDTIP KTGGDTAWAS GYAAYDKLSP ALQKFLDGKT AIYRSAHQYL DRENPLKGPK YIEREHPIVR THPATGWKYL FVNRSMTDRI VGLEPGESKV ILEYLFSVYE KNLDIQVRFQ WQPTNEGFGT SAIWDNRVSQ HNAISDYDFD GDERHGTRVT SLAELPYFDP KSKSQREALG LSLD
|
| |