Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32309 |
Symbol | EFH1 |
ID | 4839302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1189919 |
End bp | 1191076 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390617 |
Product | basic helix-loop-helix transcription factor |
Protein accession | XP_001385235 |
Protein GI | 150865852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.655952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACAC CTAGTCAGGC AAACTTGTAC AATCAAGAGT TCACTGGCAC AACTGCTTCA GAACAGAATT TATTACCTCA GTCACAAAAC AATACCCACT CGGAAATCCA TATTCTTCAA CAACTTCCTC ACCAAATACA ACATTCCCGA CCAAAGTCTT ACATGCTAAA CGAACCTTAT ATTCCATTAC CGCCGAATTA TCAAGCAGTC CAACAAAATG AATACTACAC TAACAACGAC TATGGAAATA ACTTGAACAC TATTAGTCCA TCTCTTCCGC TAACCAACCT TACTCAAGGT ACTTCCTCTG GCGGACAACA GGGGTCAGCT GAACAGCTAT CTGTGCCTAT TCAGCAACAA AGTATAATTC AATCACAACA TGATCAGCAT CTGCATGTTT ATCCTTCTGT ATTCCAACAG CAACAACAAC AATTTTATCA ACAGCAGCAA CAGCCGCTTG GATTGCAAAA TCCCTACCAA GCCCAACATA CCCATATGAT GGCCCCTTCA ACATATCCCA AACCGCTCCA TAACCACTCT AACTCAAATA CTTCAACTAC AACCAACTCT TCACTGAAAA TCTCACACTC AAGAAATTCA TCCTCTTCTA GTACTGTCCA AGAATATCCA GACGTTGCAA AGCCGAAAAT TGCTACTGTC TTCTGGGAGG ACGAAAAGAC TATATGCTAC CAGGTGAGAG CAAGAGGTGT CTTAGTTTCT AGAAGAGAAG ATACTAACTT TGTCAATGGT ACAAAATTGT TGAATGTAAT AGGGATGACT CGAGGTAAAA GGGATGGTAT TCTCAAGACG GAAAAGACCC GCAACGTTGT CAAGGTAGGA TCCATGAACC TCAAAGGGGT CTGGATTCCT TTCGACAGAG CATTCGAAAT TGCTAGAAAC GAAGGAGTTG ATGAAGCATT GCACCCACTC TTTGTCAAGG ACATTAAGAC CTTTTACAAG ACTAAAGGTT ACAAATTGAA GATTTCAACA GAGGGTAATC AGATCGTCAA AACCCCGGTT GGCAGTCCCA TACAATCCAC AAGTCCTGGC ACCATTGATG AAAAGGGACC AATTAGAACC ACCACACCCA TGGTATTCAA CTCGTCAGAC GCTTTGAAGA ATCAAAATTC CTTTCGCACT GACTGCTATG AATCTTGA
|
Protein sequence | MSTPSQANLY NQEFTGTTAS EQNLLPQSQN NTHSEIHILQ QLPHQIQHSR PKSYMLNEPY IPLPPNYQAV QQNEYYTNND YGNNLNTISP SLPLTNLTQG TSSGGQQGSA EQLSVPIQQQ SIIQSQHDQH SHVYPSVFQQ QQQQFYQQQQ QPLGLQNPYQ AQHTHMMAPS TYPKPLHNHS NSNTSTTTNS SSKISHSRNS SSSSTVQEYP DVAKPKIATV FWEDEKTICY QVRARGVLVS RREDTNFVNG TKLLNVIGMT RGKRDGILKT EKTRNVVKVG SMNLKGVWIP FDRAFEIARN EGVDEALHPL FVKDIKTFYK TKGYKLKIST EGNQIVKTPV GSPIQSTSPG TIDEKGPIRT TTPMVFNSSD ALKNQNSFRT DCYES
|
| |