Gene PICST_40742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40742 
SymbolERG12 
ID4837010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2272877 
End bp2274175 
Gene Length1299 bp 
Protein Length432 aa 
Translation table12 
GC content47% 
IMG OID640388325 
Productmevalonate kinase 
Protein accessionXP_001382664 
Protein GI150863997 
COG category[I] Lipid transport and metabolism 
COG ID[COG1577] Mevalonate kinase 
TIGRFAM ID[TIGR00549] mevalonate kinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0150941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTC CATTCTTTGT CAGTGCTCCG GGCAAAGTCA TCATCTTTGG TGAACACTCA 
GCAGTATATG GGAAACCGGC CATCGCTGCT GCGCTCTCGC TAAGAGCATA TCTTCTAGTA
ACACCTTCGC AGGACCCAGA CACCATCAAC CTCCTGTTCC CTGACATCAA CTTAACCCAT
TCATGGAACA AAAACGACAT CCCCTGGGAC AGCATTGTCA AGCACATCAA CTTGGTCGAC
AACTTGCCTC AGACATCTGA GGAACTCGTT CCCGAAATCG TAGATCAGCT TGGTCTGGTG
TTGGCAGATT TGAACCTGTC GCTACACTAT ACAGCATGCT TGTGTTTTCT CTACTTGTAC
ACCCATCTTT GCAACCAAGA ACTTGCAGGT ATGTCCTTCT GCATCCGTTC AACGTTGCCG
ATTGGCGCAG GACTTGGCTC GTCGGCTCTG ACAGCTGTGT GTTTGGCATC TGCATTGGCC
ATTTTGGGAA ATCGGGTTAC TTCAGCTTCG TTCTTGCAAA CTGACAAAAT CCTTAAAAAA
GAGAACAACG ACTTGGACTT TATAGACAGC TGGTCCCTCA TGGGAGAAAA GTGTTTCCAC
GGTAATCCTT CAGGAATTGA CAACGCTGTG GCTACCCATG GTGGTGCTGT GATGTTCCAG
AGAATGAACA ACCCAGCCCA ACCCTCTGTT CGGACGTCAA TGAGAAACTT CCCTGCTATA
AAGTTGCTTC TTACCAACAC TAAGGTTCCT CGTAGTACAG CGGATCTCGT AGGAGGTGTG
GGAAAATTGA ATGTAGAATA CCCCAAAACG TCTAATTCCA TCTTGGAAGC AATGGAACAC
TTGAGCAACA CTGCTTACCA AATTATGGTG AGACCATTTT TTGGTGCTGA AGAAAGGAAA
AAGCTCCGAG AGTTGGTCAA CATCAACCAC GGCCTCTTGG TAGCATTGGG AGTATCGCAT
CCTTCGTTGG AAAAGGTCAA AATCATTACT GACACGAGCA AGTTGGGCTC CACCAAGCTC
ACAGGTGCTG GAGGAGGTGG ATGCGCCATC ACTCTTGTAG ATGAAGATGT TTCCGAGGCT
GACATTGCTC AAGGAATTGC AGAGTTGGAA AAGGAAGGGT ACGAATGCTT TGAAACCTCG
TTGGGTGGAA AGGGTGTAGG CTCATTGTCG TTTGAAGATG TTCCTCAAGA ATTGAGATCA
ACTGTATTTT CTCCAGAAAA GTTCTGTGCT TATTCCGACC GCATAGAGAT AGAAAAGGTT
TTAAGCACCA ATGCCCTTGA AGGATGGAGA TACTGGTGA
 
Protein sequence
MSTPFFVSAP GKVIIFGEHS AVYGKPAIAA ALSLRAYLLV TPSQDPDTIN LSFPDINLTH 
SWNKNDIPWD SIVKHINLVD NLPQTSEELV PEIVDQLGSV LADLNSSLHY TACLCFLYLY
THLCNQELAG MSFCIRSTLP IGAGLGSSAS TAVCLASALA ILGNRVTSAS FLQTDKILKK
ENNDLDFIDS WSLMGEKCFH GNPSGIDNAV ATHGGAVMFQ RMNNPAQPSV RTSMRNFPAI
KLLLTNTKVP RSTADLVGGV GKLNVEYPKT SNSILEAMEH LSNTAYQIMV RPFFGAEERK
KLRELVNINH GLLVALGVSH PSLEKVKIIT DTSKLGSTKL TGAGGGGCAI TLVDEDVSEA
DIAQGIAELE KEGYECFETS LGGKGVGSLS FEDVPQELRS TVFSPEKFCA YSDRIEIEKV
LSTNALEGWR YW