Gene PICST_83020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83020 
Symbol 
ID4838335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1281543 
End bp1283124 
Gene Length1582 bp 
Protein Length447 aa 
Translation table12 
GC content45% 
IMG OID640389650 
Productpredicted protein 
Protein accessionXP_001383530 
Protein GI126134011 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID[TIGR01833] 3-hydroxy-3-methylglutaryl-CoA-synthase, eukaryotic clade 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.203181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CACATCGGAT TTCAAATTCT ACACTCTTTT CTTCAGTGGC GATTTTGGTT GTCTTTTTGC 
TTCTTTTGCT TCACTTTCTT TAACAATTCA CATGTCTCCA CAGAATATCG GTATTAAGGC
CATTGAGGTC TACATTCCAA CCCAGGCTGT CAGCCAGTCT GAGTTGGAGA AGTTCGACGG
CATTCCTGCT GGCAAATACA CCATTGGCTT GGGCCAGACC AACATGGCCT TCGTCAACGA
CAGAGAAGAC ATCTATTCGC TCTCACTTAC AGTCTTGTCC AAGTTGATTT CTAACTATAA
GATCGACACC AACAACATCG GTCGTTTGGA AGTAGGCACT GAGACACTTT TGGACAAGTC
CAAGTCTGTC AAGTCTGTGT TGATGCAATT ATTTCCAGGC AACAACGACA TCGAAGGTAT
CGACACTGTT AATGCATGTT ATGGTGGTAC CGCTGCTGTG ATCAATGCCC TCAACTGGAT
CGAATCATCC TCGTGGGATG GTAGAGACGC TATCGTCGTC GCTGGTGACA TTGCTATCTA
CGATAAGGGT GCTGCCAGAC CCACTGGTGG TGTTGGTTCC GTGGCTCTTT TGATTGGTCC
AGATGCTCCA ATTGTGTTTG AATCTACTCG TGGTTCATAC ATGGAACACG CCTACGACTT
CTATAAGCCT GACTTCACTT CTGAATATCC CGTTGTTGAT GGCCACTTCT CTTTGGCTTG
TTATGTCAAG GCTCTTGACC AATGTTATCG TGCCTACTCC AAGAAGGTCA CCAAGGATGC
CACCAAGACC GTTGGACTCT ACAACCACTT CGATTACAAT GCTTTCCACG TTCCTACCTG
CAAGTTGGTG TCCAAGTCGT ACGCCAGATT GTTGTACAAC GACTACATAG CAGACCCAAC
CAAATTTGCT GAGACTATCG ATGAAGCTAC CAGAACTGCT CTCGACAGTT TGACCTACGA
GCAGTCATTG GTCGACAAGA ACTTGGAAAA GGTATTTGTA GGCTTGACTA AGCAAGAAGC
TAAATCCAGA TTGGAACCTG CTCTCACGGT ACCTACCAAC ACCGGTAACA TGTACACGGC
CTCTGCCTGG GCCTCGTTGT CCTCGTTGCT TTACTTCGTT GGCTCTGAGA AATTGCAGGG
CAAGAGAGTC GGAATCTTCT CCTACGGTTC CGGTTTGGCC TCTTCGTTGT TGTCTGTTGT
GGTCAAGGGA GATATCTCTG CCATCACTAC TAACTTGAAC TTTGACTACA AGTTGGGCGA
AGGAAGAAAG ATTGAATCCC CAGAACAGTA CATCGCTGCC ATTGCCTTGA GAGAAAAGGC
TCACTTGCAA AAGTCCTTCA AACCTACTGG TTCCATCGAC AACTTGGCCA AGGGTACCTA
CTACTTGGTT GAAGTTGACG ACAAGTTCAG AAGAAGTTAC GATGTTAAGA ACTAGATCTT
CTGAACCGTT TCATAGTACT CATCATGGCA TCTGTTAAAT CTTGTACTAT ATATTTGCTA
GCATATCGTT CCATCGTCTC GCTATTCTTT TACTTCGATA TATTCTACTT GTCCTTAGCT
ATTTAATAAT AGAACAACTC AG
 
Protein sequence
MSPQNIGIKA IEVYIPTQAV SQSELEKFDG IPAGKYTIGL GQTNMAFVND REDIYSLSLT 
VLSKLISNYK IDTNNIGRLE VGTETLLDKS KSVKSVLMQL FPGNNDIEGI DTVNACYGGT
AAVINALNWI ESSSWDGRDA IVVAGDIAIY DKGAARPTGG VGSVALLIGP DAPIVFESTR
GSYMEHAYDF YKPDFTSEYP VVDGHFSLAC YVKALDQCYR AYSKKVTKDA TKTVGLYNHF
DYNAFHVPTC KLVSKSYARL LYNDYIADPT KFAETIDEAT RTALDSLTYE QSLVDKNLEK
VFVGLTKQEA KSRLEPALTV PTNTGNMYTA SAWASLSSLL YFVGSEKLQG KRVGIFSYGS
GLASSLLSVV VKGDISAITT NLNFDYKLGE GRKIESPEQY IAAIALREKA HLQKSFKPTG
SIDNLAKGTY YLVEVDDKFR RSYDVKN