Gene PICST_89338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89338 
SymbolERG20 
ID4838897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1433290 
End bp1434426 
Gene Length1137 bp 
Protein Length350 aa 
Translation table12 
GC content44% 
IMG OID640390212 
Productarnesyl diphosphate synthetase (FPP synthetase) 
Protein accessionXP_001384571 
Protein GI126136094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0142] Geranylgeranyl pyrophosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAAAAACAA CACTTCGTCA ACATGAGCAA AGAGGCTTCT AGAGCTAGAT TTATAGGTGT 
GTTCGACCAA TTGGTTGAAG AATTGAAGGA GGTCTTGGTC GGCTACAATA TGCCTCAAGA
GGCTGTAGAT TGGTTTGTCA AGAACTTAGA CTACAACACC CCAGGAGGAA AGTTGAACAG
AGGTTTATCC GTAGTTGATA CCTACTGTAT TTTGAACAAG ACCACCGCCG TAGAGTTGGA
TGACGAAAAA TATGCCAAGG TAGCTCTTTT GGGTTGGGCC ATTGAATTGC TCCAGGCCTA
CTTCTTGGTA GCTGACGACA TGATGGACCA GTCCAAGACC AGAAGAGGCC AGCCATGTTG
GTACTTGGCT GAAGGAGTAG GAAACATTGC CATCAACGAT TCCTTCATGT TAGAAGGTGC
CATCTACGTG TTGTTGAAGA AGCACTTCAG AAACGACTCG TACTATGTAG ACTTGTTGGA
CTTGTTCCAC GAAGTCACCT TCAAGACGGA ATTGGGCCAG TTATTGGATT TAGTCACTGC
TGATGAATAT GTGGTTGATT TGGACAAGTT TTCCTTGGAC AAGCACTCAT TTATTGTCAT
CTTCAAGACT GCCTACTACT CGTTCTACTT ACCTGTAGCC TTGGCAATGT ACATGTCAGG
CATCAACTCT GCAGAAGACT TGAAGCAAGT GCAAGACATC TTGATCCCAT TGGGCGAGTA
CTTCCAGATC CAGGACGACT TCTTGGATTG CTTTGGTACC CCAGAACAGA TCGGCAAGAT
CGGAACTGAT ATCAAGGACA ACAAATGTTC GTGGGTCATC AACCAAGCTC TTTCCCGTGT
AAACAAGGAA CAGCGTGAGC TCTTGGATAA CAACTACGGA AAGAAGGACG ACGTTTCTGA
ACAAAAATGT AAGGACTTGT TCAAGGAATT AGGCATTGAA CAGGTCTACC ACGACTACGA
AGAAGCTGTT GTTGCCAAGT TGAGATCACA AATTGAAAAG GTTGACGAAT CCAGAGGCTT
GAAGAAGGAA GTTTTGTCTG CCTTCTTGGC CAAGGTGTAC AAGCGTTCGA AGTAGGCTAT
ACATCTCTGT ATTGGAATGT TGGTATATCT ACATTAATAA ATATTATTAG AGACTTC
 
Protein sequence
MSKEASRARF IGVFDQLVEE LKEVLVGYNM PQEAVDWFVK NLDYNTPGGK LNRGLSVVDT 
YCILNKTTAV ELDDEKYAKV ALLGWAIELL QAYFLVADDM MDQSKTRRGQ PCWYLAEGVG
NIAINDSFML EGAIYVLLKK HFRNDSYYVD LLDLFHEVTF KTELGQLLDL VTADEYVVDL
DKFSLDKHSF IVIFKTAYYS FYLPVALAMY MSGINSAEDL KQVQDILIPL GEYFQIQDDF
LDCFGTPEQI GKIGTDIKDN KCSWVINQAL SRVNKEQREL LDNNYGKKDD VSEQKCKDLF
KELGIEQVYH DYEEAVVAKL RSQIEKVDES RGLKKEVLSA FLAKVYKRSK