Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58031 |
Symbol | CP52M |
ID | 4837981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 975795 |
End bp | 977345 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389296 |
Product | Cytochrome P450 52A13 (Alkane hydroxylase 2) (Alkane-inducible p450alk 2) (DH-ALK2) |
Protein accession | XP_001383817 |
Protein GI | 126134585 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.413796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTG AACTTGCTTT TGAATACTTG ACCAAATGGT ACTCGATATT GATCGGAGCT GCCTTGATCT ATGGTATTGC TCGTTACATC AAAATCCAGT TATTCATCAG GAAGCATGGT TGTGAGGAGA CTCCTTTCCT TCCAGATGCT AAATGGTTTG CAATCCCAAT CATGTCTAGA GTTCTTAAAG CCAAGAACGA AGGTAGATTG GTCGATTTGG CTCAAAGCTT TATGACGTCT GATAGGAGAA CTACCCACGT CTACTTGGGC CCTGCCAGAA TTATCTTCAC CATCGACCCA GAGAATATGA AGACCATGTT GGCTACCAAA TTTAACGACT ATGCTCTTGG ATTCAGACAC ACCCATCTTG CCCCATTGTT GGGTGATGGT ATCTTCACTT TGGATGGCGA AGGATGGAAG CATTCTAGAT CTATGTTAAG ACCTCAGTTT GCCAGAGAAC AAGTCGCCCA CGTCAGAGCC TTGGAACCTC ACGTTCAAGT TTTAATGAAG CATATCAGGT TGAACAAGGG TAAGACGTTT GATCTCCAAG AATTATTCTT CAAGTTGACC CTCGATACCT CAACTGAATT CTTGTTTGGT GAGTCCATCT ACTCTTTGTA TGACTCTTCT ATTGGTTTAA CTCCTCCAAC TGACATCCAA GGCAGATCCG AATTCGCTGA TGCTTTCAAC ACTTCGCAGA AGTACTTGGG TACCAGAGCA TGGCTCCAAT TCATGTACTG GGTCGTTCAA AACAGGGAGT TCTATCAATG TAACGCTAAA GTCCACAAGG TCGCTAAATA CTACGTCAAG AGAGCTTTGA ATTTCACTCC AGATGAACTC GAAAAGGCTT CTGCCAACGG TTACACCTTC TTGTACGAAT TGGTCAAGCA AACTAGAGAC CCAGTTGTGT TGCAAGATCA ATTGTTGAAC ATCTTGGTTG CTGGTAGAGA TACCACCGCT GGTTTATTGT CGTTCACCTT CTTCGAATTG GCCAGAAACC CAGACGTCTT CGAAAAGTTG AAGAATGAAA TCTACGAACA CTTCGGTAAG GGTGATGAGT CCAGAGTCGA AGACATCACT TTCGAATCAT TGAAGCAGTG TGAATATTTG AAGTTCGTCT TGAACGAAGC CTTGAGAATG TATCCATCTG TTCCTCTCAA CTTCAGAGTT TCTACAAAGG ACACCGTATT GCCAAATGGT GGTGGTAAGG ATGGAACAAA GCCTGTTTTC GTTGGTAAGG GTACTACTGT TGCTTACACC GTCTACTGTA CTCACAGAGA TGAAAAGTAC TACGGTAAGG ACGCCAATGT GTTCAGACCA GAAAGATGGG CCACCTTGAA CAAATTGGGA TGGGCCTACC TTCCTTTCAA CGGTGGACCA AGAATCTGTT TGGGTCAGCA GTTTGCATTG ACTGAAGCTT CTTATGTTAT TGTCAGATTA TTGCAAAACT TCCCTAACTT GGTTTCCAAG GATGACAGAC CATACCCACC AGCAAAGTCG ATGCATTTGA CAATGTGCCA CCAAGACGGA ATCTTTGTTG AATTGTCTTA G
|
Protein sequence | MSAELAFEYL TKWYSILIGA ALIYGIARYI KIQLFIRKHG CEETPFLPDA KWFAIPIMSR VLKAKNEGRL VDLAQSFMTS DRRTTHVYLG PARIIFTIDP ENMKTMLATK FNDYALGFRH THLAPLLGDG IFTLDGEGWK HSRSMLRPQF AREQVAHVRA LEPHVQVLMK HIRLNKGKTF DLQELFFKLT LDTSTEFLFG ESIYSLYDSS IGLTPPTDIQ GRSEFADAFN TSQKYLGTRA WLQFMYWVVQ NREFYQCNAK VHKVAKYYVK RALNFTPDEL EKASANGYTF LYELVKQTRD PVVLQDQLLN ILVAGRDTTA GLLSFTFFEL ARNPDVFEKL KNEIYEHFGK GDESRVEDIT FESLKQCEYL KFVLNEALRM YPSVPLNFRV STKDTVLPNG GGKDGTKPVF VGKGTTVAYT VYCTHRDEKY YGKDANVFRP ERWATLNKLG WAYLPFNGGP RICLGQQFAL TEASYVIVRL LQNFPNLVSK DDRPYPPAKS MHLTMCHQDG IFVELS
|
| |