Gene PICST_32309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32309 
SymbolEFH1 
ID4839302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1189919 
End bp1191076 
Gene Length1158 bp 
Protein Length385 aa 
Translation table12 
GC content42% 
IMG OID640390617 
Productbasic helix-loop-helix transcription factor 
Protein accessionXP_001385235 
Protein GI150865852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.655952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAC CTAGTCAGGC AAACTTGTAC AATCAAGAGT TCACTGGCAC AACTGCTTCA 
GAACAGAATT TATTACCTCA GTCACAAAAC AATACCCACT CGGAAATCCA TATTCTTCAA
CAACTTCCTC ACCAAATACA ACATTCCCGA CCAAAGTCTT ACATGCTAAA CGAACCTTAT
ATTCCATTAC CGCCGAATTA TCAAGCAGTC CAACAAAATG AATACTACAC TAACAACGAC
TATGGAAATA ACTTGAACAC TATTAGTCCA TCTCTTCCGC TAACCAACCT TACTCAAGGT
ACTTCCTCTG GCGGACAACA GGGGTCAGCT GAACAGCTAT CTGTGCCTAT TCAGCAACAA
AGTATAATTC AATCACAACA TGATCAGCAT CTGCATGTTT ATCCTTCTGT ATTCCAACAG
CAACAACAAC AATTTTATCA ACAGCAGCAA CAGCCGCTTG GATTGCAAAA TCCCTACCAA
GCCCAACATA CCCATATGAT GGCCCCTTCA ACATATCCCA AACCGCTCCA TAACCACTCT
AACTCAAATA CTTCAACTAC AACCAACTCT TCACTGAAAA TCTCACACTC AAGAAATTCA
TCCTCTTCTA GTACTGTCCA AGAATATCCA GACGTTGCAA AGCCGAAAAT TGCTACTGTC
TTCTGGGAGG ACGAAAAGAC TATATGCTAC CAGGTGAGAG CAAGAGGTGT CTTAGTTTCT
AGAAGAGAAG ATACTAACTT TGTCAATGGT ACAAAATTGT TGAATGTAAT AGGGATGACT
CGAGGTAAAA GGGATGGTAT TCTCAAGACG GAAAAGACCC GCAACGTTGT CAAGGTAGGA
TCCATGAACC TCAAAGGGGT CTGGATTCCT TTCGACAGAG CATTCGAAAT TGCTAGAAAC
GAAGGAGTTG ATGAAGCATT GCACCCACTC TTTGTCAAGG ACATTAAGAC CTTTTACAAG
ACTAAAGGTT ACAAATTGAA GATTTCAACA GAGGGTAATC AGATCGTCAA AACCCCGGTT
GGCAGTCCCA TACAATCCAC AAGTCCTGGC ACCATTGATG AAAAGGGACC AATTAGAACC
ACCACACCCA TGGTATTCAA CTCGTCAGAC GCTTTGAAGA ATCAAAATTC CTTTCGCACT
GACTGCTATG AATCTTGA
 
Protein sequence
MSTPSQANLY NQEFTGTTAS EQNLLPQSQN NTHSEIHILQ QLPHQIQHSR PKSYMLNEPY 
IPLPPNYQAV QQNEYYTNND YGNNLNTISP SLPLTNLTQG TSSGGQQGSA EQLSVPIQQQ
SIIQSQHDQH SHVYPSVFQQ QQQQFYQQQQ QPLGLQNPYQ AQHTHMMAPS TYPKPLHNHS
NSNTSTTTNS SSKISHSRNS SSSSTVQEYP DVAKPKIATV FWEDEKTICY QVRARGVLVS
RREDTNFVNG TKLLNVIGMT RGKRDGILKT EKTRNVVKVG SMNLKGVWIP FDRAFEIARN
EGVDEALHPL FVKDIKTFYK TKGYKLKIST EGNQIVKTPV GSPIQSTSPG TIDEKGPIRT
TTPMVFNSSD ALKNQNSFRT DCYES