Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29419 |
Symbol | |
ID | 4836957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 212707 |
End bp | 213969 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388272 |
Product | predicted protein |
Protein accession | XP_001382805 |
Protein GI | 126132560 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.106431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATACA AAGTTGCTGG TTCAGAATGG GTTGACCCTT TGGTAGCCAA AGCCATCAAA CAAAGTACAT TGGGCGGTTT CATTTCTCCT GACAAAATCG TAGAAGAGAG AAAGAAAATC AAGTTCCCAG AGTTCTTTGG TCCTTTCAAC CCAGAAGAGG CTGGAAAGTA TGAATTTCTT AAGAATCCTG ATGGAGCTAA GAATGACAAG GGTCACTTGG GAGATCCAAC ATTCAAAAAC TTATTCAAGC CAGGGTCCAA GGTGAAGACC ATTGACTTGT CACCAAACTA CGGTACTGAA ATTGACGGTA TTCAATTGAG TGAATTGGAT GAAGCCGGAA AGAACGACTT GGCTTTATAC TTAGAAACAA GAGGTTTGGC TGTCTTCAGA AATCAGGATT TTAGGGAGAA AGGTCCAGCT TTTGCAAAGA AATTTGGTCA ACATTTTGGT CCATTACATA TCCACCCATC GGTAAGCTAT TCAGCAGAAG AATCCCCAGA GTTGTTGGTT ACATACAGAC CAGCGGGAGG TCCAGAAAGG TACAATGCAC AATTTGCAGG TACTACTACA ACTACTGGGT GGCATTCGGA CGTCAGTTTT GAAGAATACC CAGCTTCTTT CAGTTTTTTC GTTGCTTTGG AAGCACCAGA AACTGGCGGA GATACTGTAT TCCTTGACTT GAGAGAAGCC TATAGGAGAT TGTCTCCCCC AATTCAAAAG TTCTTTGAAT CTTTGACAAT CATTCACACC AACTATTACC TAAACCAACT TGCTAAATTG AAGGACTTGG ATACGCGTGT CAATGCTGAT TCTTTTGCTG AACATCCATT GGTCAGAACT CATCCTGTTA CAGGTGAGAA GTCGTTGTTC TACTCTAAAG GATTTGCCCT CAGAGTAAAG GGACTCAAGC AGCAAGAGTC AGATGCCATT CTTAGTTTCT TGGAAGACCA TATTAACAAC AACCCTGAAA TTCAAGTGAG AGCAAGTCAT AGAGGAACCA ATTCGGGAAC TATTATTGCC TGGGATAATA GAATCTCTAT ACATACTGCT GTTGCTGATT TCTTACAACA CGAGACCGGA CCTCGTCACC ATTTCAGAAT TACTGTTGTA GGCGAAAAGC CTTACTTCGA GGAAGCTGCC GAAGAGAAAG TTGCAAATGG ACACATCAAA AGCTCATCTA ATGGACACTC TAACGGACAC TCTAATGGAA ACTCTAACGG TCACACAATT GCTTCTAATG GCTCCAATGG TAGTGAGAAC TAA
|
Protein sequence | MTYKVAGSEW VDPLVAKAIK QSTLGGFISP DKIVEERKKI KFPEFFGPFN PEEAGKYEFL KNPDGAKNDK GHLGDPTFKN LFKPGSKVKT IDLSPNYGTE IDGIQLSELD EAGKNDLALY LETRGLAVFR NQDFREKGPA FAKKFGQHFG PLHIHPSVSY SAEESPELLV TYRPAGGPER YNAQFAGTTT TTGWHSDVSF EEYPASFSFF VALEAPETGG DTVFLDLREA YRRLSPPIQK FFESLTIIHT NYYLNQLAKL KDLDTRVNAD SFAEHPLVRT HPVTGEKSLF YSKGFALRVK GLKQQESDAI LSFLEDHINN NPEIQVRASH RGTNSGTIIA WDNRISIHTA VADFLQHETG PRHHFRITVV GEKPYFEEAA EEKVANGHIK SSSNGHSNGH SNGNSNGHTI ASNGSNGSEN
|
| |