Gene PICST_37142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37142 
SymbolCYP52 
ID4841090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp217096 
End bp218370 
Gene Length1275 bp 
Protein Length424 aa 
Translation table12 
GC content44% 
IMG OID640392405 
ProductCytochrome P450 52A3 (CYPLIIA3) (Alkane-inducible P450-ALK1-A) (P450-CM1) (CYP52A3-A) (Cytochrome P-450ALK) 
Protein accessionXP_001386440 
Protein GI150866745 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATGA CAACCGATCC CGAGAACTTC AAGGCTATGT TGGCTACCCA ATTTAATGAT 
TTTTCTATTG GCCGTAGATA CCAGATCTTG AGTCCAGTGA TTGGTGACAG TATCTTCACT
TTGGATGGTG AAGGTTGGAA GCACTCCAGG GCCATGTTAA GACCCCAGTT TGTCAGGGAG
CAAGTTGGAC ATGTCCAGGC TTTGGAACCT CACTTACAGT TACTTGCTAA ACATATTCGC
TCCTACAAAG GAGAAACAGT TGATTTGCAG CAGTTGTTCA CTAAGTTCAC TCTTGATACA
GCTACAGAAT TCCTTTTCGG TCAAAGTGTT CATACCTTGT ATGACGAAAG AATTGGCATG
AAGACTCCTG ATGATGTTCC ATATGCGAAA GACTTCACCG ATGGTTTGTT TATTACCCAA
AAGTACACCT CGGAAAGAGG CTATGCTCAA CAGTTCTACT GGTTAATTGA TGGCAAGGAA
TTCAGAACTG CGATTGCCAA CGTTCATAAG TTCGCCCGTT TTTACGTCGA TAGGGCTCTC
AACTTCTCGC AAGCTGAGCT TGAAAAGAAA TCACAGGAAA GTTATACCTT CTTATACGAG
TTGGTGCAAC AAACCAGAGA CCCTAAAGTT CTCCAGGATC AATTGCTTGC CATCATGTTA
GCTGGCAGAG ACACCACATC TTCACTACTT TCATTCATCT TCTACGAACT TTCCCGCAAC
CCTGGGATTT GGGAAAAGTT GAAAAAGGAA GTATACGAAA ACTTTGGCTC TGGAACAGAA
AAAGATATTG CCAAGATCAC GTTCGAATCG TTGAAGAAGT GTAACTACGT GAAGTGGGTG
ATTAACGAAA CGTTGAGAAT GTACCCTACT GTGCCTGTTA ATTTGAGGGT CTCTAATAAA
GATACTCTGT TGCCTAAAGG AGGTGGTGAA GACGGAAAGT CGCCAATTTT TATTCCACGG
GGCACTACAG TTGGGTTCAG AGTTTACTCC ACGCAGAGAA ATAAAGAATA CTACGGTGAA
GATCCTGACG TTTTCAGACC GGAAAGATGG GCCGACATCG GCAAGTTGGG ATGGGCATAC
CTTCCGTTCT TAGGAGGACC CAGAACATGT ATCGGACAAC AGTTTGCCCT CACCGAAGCC
GGGTACATTC TCGTGAGAAT AGCTCAATTG TTCCCTAACC TCAAGTCTAA GAACAGTGTT
CATTATCCTC CAAAGAAGAC TCTCAACGTT ATTTTCAATC TCTTTGAGGG CTGTTTGGTG
GAGATGGGTG AGTAG
 
Protein sequence
MFMTTDPENF KAMLATQFND FSIGRRYQIL SPVIGDSIFT LDGEGWKHSR AMLRPQFVRE 
QVGHVQALEP HLQLLAKHIR SYKGETVDLQ QLFTKFTLDT ATEFLFGQSV HTLYDERIGM
KTPDDVPYAK DFTDGLFITQ KYTSERGYAQ QFYWLIDGKE FRTAIANVHK FARFYVDRAL
NFSQAELEKK SQESYTFLYE LVQQTRDPKV LQDQLLAIML AGRDTTSSLL SFIFYELSRN
PGIWEKLKKE VYENFGSGTE KDIAKITFES LKKCNYVKWV INETLRMYPT VPVNLRVSNK
DTSLPKGGGE DGKSPIFIPR GTTVGFRVYS TQRNKEYYGE DPDVFRPERW ADIGKLGWAY
LPFLGGPRTC IGQQFALTEA GYILVRIAQL FPNLKSKNSV HYPPKKTLNV IFNLFEGCLV
EMGE