Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30111 |
Symbol | TCD5.1 |
ID | 4836696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1965067 |
End bp | 1966311 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388011 |
Product | taurine catabolism dioxygenase |
Protein accession | XP_001383149 |
Protein GI | 126133248 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCAG CTGCTACTTC CCAAAAGATC TCAGAAAAGG ACGACTTAGA TGCTACCATC AAGAAGTTGG CTTCCTTGAA GCCAATCGGT CACTCCAGTT ACACTGGTGA GATCAAGTCT GGTTACTCTG GTTCTTGGGC CGAAAAGCTT CCAGAAACTA CAAAGGCTAG ATACGCTAGA CATGGTGTTG ACATTTCCAA GGGTTACCCT TACGTCCCTG AAGTTGACAA GATTCCTAAG TTTGTCAACG AAGCTTATGC TATTAGAAAC GAAGTGTACC CATATGTCGA GAGAGGTGCC AAAGCTGATC CAGAAAAGAA GGCATTATTC GGTGCTGCCA AGGAAGTAAT CAACTTGACC AAGCATCTCG GTACTGAGAT TGTTGGTTTG CAATTGAGCG ACTTGAATGA CCAACAAAAG GACGAATTGG CTTTATTGGT AGCTGAAAGA GTTGTCGTTT TCTTCAGAGA CCAAGACTTG TCTCCCCAGA AGCAATTGGA ATTGGGCCAT TACTGGGGCC AAGTTGAAGT TCATCCACAA GTTCCTCGTA TAAGTGAAGA ATTCAACGGT GTCTCCGTGA TCTGGCAAGA TTACTACCGT GCCAAGTATG GTTTGCACCT TAGTTTCAAG AAGGCTATTG GTGGTAATGC GCAATGGCAC ACTGATTTGG TTCACGAGCT TCAGCCAGCT GGTATCACGC ACTTGCACAA CGATGCTATT CCATCTGTTG GCGGTGACAC TTTATGGGCT TCAGGTTATG CTGCTTACGA TAAGTTGTCT CCAGCCTTCC AGAAGTTCTT GGACGGCAAG ACTGCCATCT ACAGATCGGC CCATCAATAT GTTGACCCAG AAAACCCATT GAAGGGTCCT AAGTATGTTG AAAGAGAACA CCCTATTGTT AGAACTCATC CTGCTACTGG ATGGAAGTTC TTGTTCGTCA ACCGTTCCAT GACTGTCAGA ATTGTCGGCT TAGAGCCAGA AGAGTCTAAG ACTATTTTGG AATACTTGTT TAGCGTCTAC GAGAAGAACT TGGATATCCA GGTCAGATTC AACTGGAGAC CAACCAAGGA AGGCTTGGGT ACTTCTGCTA TTTGGGACAA CAGAGCTTCG CAGCACTTCG CTGTCTGGGA CCACGAAGGC AAAGAAAACA GACACGGCAC CAGAGTCACT TCTTTGGCCG AAATTCCATT CTTTGACGAA AACTCAAAGT CTCAGAGAGA AGCCTTGGGC TTATCGTTGG ATTAG
|
Protein sequence | MAPAATSQKI SEKDDLDATI KKLASLKPIG HSSYTGEIKS GYSGSWAEKL PETTKARYAR HGVDISKGYP YVPEVDKIPK FVNEAYAIRN EVYPYVERGA KADPEKKALF GAAKEVINLT KHLGTEIVGL QLSDLNDQQK DELALLVAER VVVFFRDQDL SPQKQLELGH YWGQVEVHPQ VPRISEEFNG VSVIWQDYYR AKYGLHLSFK KAIGGNAQWH TDLVHELQPA GITHLHNDAI PSVGGDTLWA SGYAAYDKLS PAFQKFLDGK TAIYRSAHQY VDPENPLKGP KYVEREHPIV RTHPATGWKF LFVNRSMTVR IVGLEPEESK TILEYLFSVY EKNLDIQVRF NWRPTKEGLG TSAIWDNRAS QHFAVWDHEG KENRHGTRVT SLAEIPFFDE NSKSQREALG LSLD
|
| |