Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33061 |
Symbol | GBO1 |
ID | 4840419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1335771 |
End bp | 1337132 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391734 |
Product | Gamma-butyrobetaine dioxygenase (Gamma-butyrobetaine,2-oxoglutarate dioxygenase) (Gamma-butyrobetaine hydroxylase) (Gamma-BBH) |
Protein accession | XP_001385951 |
Protein GI | 150866374 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.563607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACCAA ATTTGTATAG AGTTAATAAA ACGAAGCTAC CCCGCGTTTT AGCGAGATCC GTGAGATTTC AGCTGTCGCT TGCTATTCAT AGATACGATG ATAACTATAC AACTTTGGTA TTTGATGGAG ATAGGTCAAT TCTGTTCAGT AACATCTTTC TCCGTGATTC ATGCAGGGAT CCTAAGTCTG TAGACACTTA CTCGAGCCAG AAGCTTTTCA CCACAGCAGA AATCGCAAAG AATTTGTCCA TTAACTCACC TCCCCGGATT AGAAAATCAC CAGATTCAAG CGAGTCTGTT TTAGAAATAG AATGGCTCCA AAACGGAAAA CTCCATCTTT CCCAGTACCA GGAGACGTTC TTGAGAGAGT ACATTGATGC TGAGTCAAGG CAAGCTGGTA AATTCTTTGA AGGTGAAAGA ACAATATGGG ACAACAAGGA ACTCGTTGGA AACCTTCCCA GCATACAAGC TGATTACAAG AAGTACTTGG AACTGGATTC TACTTTCTTT GAAACAGTCA GAAGCCTCAA CAAGTTTGGT TTGGCATTTG TAAACGACAT TCCGGAACCT TCAGCCGAGC TTCAGAAACT GGGAATGAAT GAAAAGAATG CCGCAGAATG GCCGGTGTCA AAGCTTGCTA ACAAATTTGG TTATATCAAG AAGACATTCT ATGGTACTTT ATTTGACGTT AAAAACGAAA AGGAGGAGGC AAAGAACATT GCAAACACAA ACACATTCTT GCCGTTGCAC ATGGACCTCT TGTACTACGA ATCGCCGCCA GGATTGCAAT TGCTTCATTT CATCAAGAAC TCTACAACAG GCGGAGAGAA TGTCTTCTGC GATTCCTTCC TTGCGGCTGA ACATGTCAAA AATGTAGATC CAACAGCATA TGTTGCTTTG ACGCTTGTCC CCATTACCTA TCATTATGAT AACAACAACG AGCACTATTT CTTCAAGAGG CCTTTGGTAG TGGAAGAAGT GAAAGGCGAT ACGGCTCGTA TCAAAGAAGT CAACTACGCT CCACCATTTC AAGGGCCATT TGAGTTTGGA ATAACCAGAA ATGACTCCGA GAGGGAAGGA TTGTTTTTGG CTAAAGATAC CACAGACGGT CTTTTGTTCC AGGACTTTAT CAGAGGATTC CAGCTCTTCG AAGACTTCAT CAACGACCCC GTGAACCACT ACGAAATCAA GATGCCAGAA GGCTCTTGTG TTATATTCGA CAACAGAAGA GTTCTCCACT CCCGTCTTGG ATTCAGTGAC TCCAACGGAG GAGATAGATG GCTCATGGGA ACCTATGTAG ACGGCGATAG TTTCAGATCC AAGTTGAGAA TGGGCTTCAG ACACTTGAAA GAAGCTATGT AA
|
Protein sequence | MIPNLYRVNK TKLPRVLARS VRFQSSLAIH RYDDNYTTLV FDGDRSISFS NIFLRDSCRD PKSVDTYSSQ KLFTTAEIAK NLSINSPPRI RKSPDSSESV LEIEWLQNGK LHLSQYQETF LREYIDAESR QAGKFFEGER TIWDNKELVG NLPSIQADYK KYLESDSTFF ETVRSLNKFG LAFVNDIPEP SAELQKSGMN EKNAAEWPVS KLANKFGYIK KTFYGTLFDV KNEKEEAKNI ANTNTFLPLH MDLLYYESPP GLQLLHFIKN STTGGENVFC DSFLAAEHVK NVDPTAYVAL TLVPITYHYD NNNEHYFFKR PLVVEEVKGD TARIKEVNYA PPFQGPFEFG ITRNDSEREG LFLAKDTTDG LLFQDFIRGF QLFEDFINDP VNHYEIKMPE GSCVIFDNRR VLHSRLGFSD SNGGDRWLMG TYVDGDSFRS KLRMGFRHLK EAM
|
| |