Gene PICST_32799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32799 
SymbolBET4 
ID4840032 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp661167 
End bp662355 
Gene Length1189 bp 
Protein Length379 aa 
Translation table12 
GC content40% 
IMG OID640391347 
ProductGeranylgeranyl transferase type II alpha subunit 
Protein accessionXP_001385477 
Protein GI150866017 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5536] Protein prenyltransferase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.193653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.546499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATGTC CTCTTTTCAT TCCTTCTATG ACTCTATATG CTAACTTCTC AGCAACATGG 
AATCAAGAGA GTTTCACTCT CTGCCGAGGC CAAGAGGCTC AAGCTTGAAA AGGATAAATC
CAAGATTGCC CACTACAAGC AATTGACAGA AAATATCTTC TCGTTAAGAA ACCTACAGAC
TTATACGGTT GAATCTCTAA AAGAAACCAC CCAGATTCTC CAAATCAATC CTGAGTTCTA
CACAATGTGG AACTATCGTC GTGAAATATT TGAGCATTTA AAGAACAACA TACCTGTAGA
AGACTATGCT CAACTCATGG ACAACGATTT AAAGATGTTG ATGGTGATCT TGAAGCGATT
CCCCAAAGTG TACTGGATAT GGAACCACCG TAGATGGTGC TTGTTTGAGC TAGTCAAGAT
CAACAGAGTA GACTGGCAGT ATGAATATGC TGTGGTTTCA AAATTGTTGG AGTTAGACAG
TCGTAACTAC CATGGGTGGC AGTATAGACG GTTTGTCGTC CAAAATATGC AAATCCAGGC
TACCACCAAA GCAGCACCAG CATCTAAGAA TGAAGAGAGC TTGGTTGTGC TTGGTATAAA
CATTGAAGAA TTCAAATACA CCACATCAAA AATCAACAAG AACTTCTCCA ACTTCTCTGC
ATGGCACAAC CGTAGCACAT TGATTCCTAA GATATATAAC TTATATCTCC AATTGGAAGC
ACCCTCTGAA AAGCTGCCTG ATGTATACGA TATATTCAAA CTGCCTCGTT CCATTTTGAC
TCATGAGTTG GAATTGATCA AGACGGGAAT GTATATGGAC TCAGAAGATA CGTCAATCTG
GCTCTACATG TGGTGGCTAT TGACGGAAAA GTTCTTTACT GATGAGTTGC GCAAAGAGGA
TGGTGCCTAC CTTGCTGTAT TGGAAGAACA ACTTGCGAAC GTAGAAGAAT TGAATGAACT
TGAAAAGAGT GATCATATCT ACCATTGGGA CAATTGCTGG TGTCTAAAGA CTATCATACT
TGTGAAGGGT TTGATACAAC AGGAACATGT AAAAACCAAC AAAAGCTCTG CCTTGTTGAC
GCAAGATATA AAGAACCATA TACAAGCACT TATCGAAATC GATCCATTGA GAAAGGGCAA
ATACTTGGAT CAACTTGAAG GGAAAGCAAG TATAATTCCG ATTCTTTAG
 
Protein sequence
MQHGIKRVSL SAEAKRLKLE KDKSKIAHYK QLTENIFSLR NLQTYTVESL KETTQILQIN 
PEFYTMWNYR REIFEHLKNN IPVEDYAQLM DNDLKMLMVI LKRFPKVYWI WNHRRWCLFE
LVKINRVDWQ YEYAVVSKLL ELDSRNYHGW QYRRFVVQNM QIQATTKAAP ASKNEESLVV
LGINIEEFKY TTSKINKNFS NFSAWHNRST LIPKIYNLYL QLEAPSEKSP DVYDIFKSPR
SILTHELELI KTGMYMDSED TSIWLYMWWL LTEKFFTDEL RKEDGAYLAV LEEQLANVEE
LNELEKSDHI YHWDNCWCLK TIILVKGLIQ QEHVKTNKSS ALLTQDIKNH IQALIEIDPL
RKGKYLDQLE GKASIIPIL