Gene PICST_58031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58031 
SymbolCP52M 
ID4837981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp975795 
End bp977345 
Gene Length1551 bp 
Protein Length516 aa 
Translation table12 
GC content44% 
IMG OID640389296 
ProductCytochrome P450 52A13 (Alkane hydroxylase 2) (Alkane-inducible p450alk 2) (DH-ALK2) 
Protein accessionXP_001383817 
Protein GI126134585 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.413796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTG AACTTGCTTT TGAATACTTG ACCAAATGGT ACTCGATATT GATCGGAGCT 
GCCTTGATCT ATGGTATTGC TCGTTACATC AAAATCCAGT TATTCATCAG GAAGCATGGT
TGTGAGGAGA CTCCTTTCCT TCCAGATGCT AAATGGTTTG CAATCCCAAT CATGTCTAGA
GTTCTTAAAG CCAAGAACGA AGGTAGATTG GTCGATTTGG CTCAAAGCTT TATGACGTCT
GATAGGAGAA CTACCCACGT CTACTTGGGC CCTGCCAGAA TTATCTTCAC CATCGACCCA
GAGAATATGA AGACCATGTT GGCTACCAAA TTTAACGACT ATGCTCTTGG ATTCAGACAC
ACCCATCTTG CCCCATTGTT GGGTGATGGT ATCTTCACTT TGGATGGCGA AGGATGGAAG
CATTCTAGAT CTATGTTAAG ACCTCAGTTT GCCAGAGAAC AAGTCGCCCA CGTCAGAGCC
TTGGAACCTC ACGTTCAAGT TTTAATGAAG CATATCAGGT TGAACAAGGG TAAGACGTTT
GATCTCCAAG AATTATTCTT CAAGTTGACC CTCGATACCT CAACTGAATT CTTGTTTGGT
GAGTCCATCT ACTCTTTGTA TGACTCTTCT ATTGGTTTAA CTCCTCCAAC TGACATCCAA
GGCAGATCCG AATTCGCTGA TGCTTTCAAC ACTTCGCAGA AGTACTTGGG TACCAGAGCA
TGGCTCCAAT TCATGTACTG GGTCGTTCAA AACAGGGAGT TCTATCAATG TAACGCTAAA
GTCCACAAGG TCGCTAAATA CTACGTCAAG AGAGCTTTGA ATTTCACTCC AGATGAACTC
GAAAAGGCTT CTGCCAACGG TTACACCTTC TTGTACGAAT TGGTCAAGCA AACTAGAGAC
CCAGTTGTGT TGCAAGATCA ATTGTTGAAC ATCTTGGTTG CTGGTAGAGA TACCACCGCT
GGTTTATTGT CGTTCACCTT CTTCGAATTG GCCAGAAACC CAGACGTCTT CGAAAAGTTG
AAGAATGAAA TCTACGAACA CTTCGGTAAG GGTGATGAGT CCAGAGTCGA AGACATCACT
TTCGAATCAT TGAAGCAGTG TGAATATTTG AAGTTCGTCT TGAACGAAGC CTTGAGAATG
TATCCATCTG TTCCTCTCAA CTTCAGAGTT TCTACAAAGG ACACCGTATT GCCAAATGGT
GGTGGTAAGG ATGGAACAAA GCCTGTTTTC GTTGGTAAGG GTACTACTGT TGCTTACACC
GTCTACTGTA CTCACAGAGA TGAAAAGTAC TACGGTAAGG ACGCCAATGT GTTCAGACCA
GAAAGATGGG CCACCTTGAA CAAATTGGGA TGGGCCTACC TTCCTTTCAA CGGTGGACCA
AGAATCTGTT TGGGTCAGCA GTTTGCATTG ACTGAAGCTT CTTATGTTAT TGTCAGATTA
TTGCAAAACT TCCCTAACTT GGTTTCCAAG GATGACAGAC CATACCCACC AGCAAAGTCG
ATGCATTTGA CAATGTGCCA CCAAGACGGA ATCTTTGTTG AATTGTCTTA G
 
Protein sequence
MSAELAFEYL TKWYSILIGA ALIYGIARYI KIQLFIRKHG CEETPFLPDA KWFAIPIMSR 
VLKAKNEGRL VDLAQSFMTS DRRTTHVYLG PARIIFTIDP ENMKTMLATK FNDYALGFRH
THLAPLLGDG IFTLDGEGWK HSRSMLRPQF AREQVAHVRA LEPHVQVLMK HIRLNKGKTF
DLQELFFKLT LDTSTEFLFG ESIYSLYDSS IGLTPPTDIQ GRSEFADAFN TSQKYLGTRA
WLQFMYWVVQ NREFYQCNAK VHKVAKYYVK RALNFTPDEL EKASANGYTF LYELVKQTRD
PVVLQDQLLN ILVAGRDTTA GLLSFTFFEL ARNPDVFEKL KNEIYEHFGK GDESRVEDIT
FESLKQCEYL KFVLNEALRM YPSVPLNFRV STKDTVLPNG GGKDGTKPVF VGKGTTVAYT
VYCTHRDEKY YGKDANVFRP ERWATLNKLG WAYLPFNGGP RICLGQQFAL TEASYVIVRL
LQNFPNLVSK DDRPYPPAKS MHLTMCHQDG IFVELS