Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_45705 |
Symbol | |
ID | 4838641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 224769 |
End bp | 225647 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640389956 |
Product | predicted protein |
Protein accession | XP_001384003 |
Protein GI | 126134958 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTT TAGATCAATC CTTTGTCACC AACGGTTCCT GGAAGCCCGA TCTCTTCAAG GGCAAGGTTG TCTTTGTCAC CGGAGGAGCA GGTACCATTT GCAGAGTGCA GACCGAAGCC TTGGTATTAC TTGGAGCCAA CGCTGTTATT ATTGGTAGAA ACGTCGAAAA GACTGAGAAG GCTGCCGCAG AAATTCAACA ATTGAGAGCC GGTGCTAAAG TCATTGGAAT TGGTGGAGTC GATGTCAGAT CAGTAGAATC GATCGCAAAG GCTGTGGAAG TCACTGTCAA AGAGCTTGGC CGTATCGATT TCGTCATTGC TGGTGCCGCT GGTAATTTCA TTTCAGACTT CAACCACTTG TCTTCCAATG CTTTCAAGTC TGTCGTTTCC ATTGACTTGT TGGGTTCGTA TAACACAGCA AAGGCCACAT TCCAAGAACT CAGAAAGACC AAAGGTGCCT ACTTATTTGT TTCGGCCACC TTGCACTATT ACGGTATTCC ATTCCAGTTG CATGTTGGCG CTGCTAAGGC TGGTGTAGAT GCTCTTTCCA ATGCCTTGGC TGTTGAATTG GGTCCTTTGG GAATCCGTTC CAATGCCATT GCTCCCGGTT TAATCGAAGG AACAGAAGGC TTTGCTAGAT TGGCACCACC TAGTGAGGAT GGGGGAAATG GCTTACGTGA CAAAATCCCA TTACAAAAAT TCGGTACGAG TAGAGATATT GCCGAAGCCA CTGTCTATTT GTTCTCTCCA GCTGCTTCTT ATGTGACCGG TACTATTGAA GTTGTTGACG GTGGGGCATG GCACATCGGG AGTTACATGC AAGATCTCTA TCCTTCCGTG GTTATCGCTG CTAACGAAGA TCCATCAGCA AAGATCTAA
|
Protein sequence | MSTLDQSFVT NGSWKPDLFK GKVVFVTGGA GTICRVQTEA LVLLGANAVI IGRNVEKTEK AAAEIQQLRA GAKVIGIGGV DVRSVESIAK AVEVTVKELG RIDFVIAGAA GNFISDFNHL SSNAFKSVVS IDLLGSYNTA KATFQELRKT KGAYLFVSAT LHYYGIPFQL HVGAAKAGVD ALSNALAVEL GPLGIRSNAI APGLIEGTEG FARLAPPSED GGNGLRDKIP LQKFGTSRDI AEATVYLFSP AASYVTGTIE VVDGGAWHIG SYMQDLYPSV VIAANEDPSA KI
|
| |