Gene PICST_35590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35590 
SymbolALK2 
ID4838296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1154778 
End bp1156349 
Gene Length1572 bp 
Protein Length523 aa 
Translation table12 
GC content45% 
IMG OID640389611 
Productn-alkane inducible cytochrome P- 450 
Protein accessionXP_001383506 
Protein GI150864612 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATT TTATTGAATT CGTCACCACC AACTGGTACA TCATCATTCC AGCACTTCTA 
GTGTTGCACA AGGTCTTTGA CCTCTTGTAT GTTCAGTATT TGTACAGAAA GCTTGGAGCA
AAGCCTTGCA CCAACCAGAC AGATGACCAT GCTTTTGGTA TTCGTGCTGG ATTTGAAATG
TTGAAGAAAA AGAACGAAGG AACTGTTGTA GACTTTGGTG CAGAAAGATT TGAATCCCGC
ATCGACCCCA AGATCCCAAC CTTTTCCATG AGATTGTTCT TGATTCCAAT TGTGCTTACC
AGAGACCCCG AAAACATAAA GGCATTATTG GCCACTCAAT TCAACGAGTT CGTATTAGGC
TCTAGATTCG AACAGCTTGC CCCATTGTTG GGTAAAGGTA TTTTCACGTT GGACGGTGAA
GGCTGGAAGC ATTCCAGAGC CATGTTGAGA CCACAGTTCG CTAAGGAGCA AGTTGCCCAT
GTGCAATCTT TGGAACCTCA CATACAAGCC TTGGCCAAGC ATGTTCGTAA CGCCAAGGGC
AAACTGTTTG ACATCCAAGA ATTGTTCCAC AGATTGACTG TCGACTCTGC CACTGAATTC
TTGTTTGGTC AATCTGTTGA ATCATTGAGA GACGAATCTG TTGGTATGGC CGATGAAGCT
ACGGACTTCG CAGGGAAGAG TACCTTCGCC GCTTCGTTCA CCATTGCCCA AAACTGGTTG
GCTAACAGAG CCGTTGCCCA GAAGTTCTAT TTCCTTATCA ACCCCAAAGA AATGCGTGAT
TCTATCAAAG ATGTTCACAG ATTTGTCGAT TACTACGTTC AGGTCGCATT GGACACTCCT
CAAGACGAAT TGGACAAAAA GTCCAAGGAC GGTTACATCT TCTTGTACGA ATTGGTGAAA
CAGACTAGAG ATCCATACGT GTTAAGATCG CAGTTGTTGA ACGTCTTGTT GGCTGGCCGT
GACACCACCG CCGGTTTGTT GTCATTTGCT TTCTTCGAAT TAGCCAGAAG ACCAGATATA
TGGAGCAAGT TGAAGGACGA AATCTATGAG AATTTCGGCC TGGGTGAGAA CTCCAAAGTT
GACGAGATTA CTTTCGAATC GTTGAAGAGA TGTGAATACT TGAAGGCATT CCTTAACGAA
ACCTTGAGAT TGTACCCATC TGTCCCTGTT AACTTCAGAG TTGCTACTAA GGACACCACC
TTGCCAAGAG GTGGTGGTAA GGATGGTAGT GAGCCTATTC TTGTCAGAAA GGGCCAGTCT
GTCTTCTACA GTGTCTATGC CACTCACAGA AGCGAAGCAT ACTACGGCAA GGACAGACAT
GTGTTCAGAC CTGAAAGATG GTTCGAGCCT TCTGCTAGGA AGCTCGGCTG GGCTTACTTG
CCATTCAATG GTGGTCCAAG AATCTGTTTG GGTCAACAGT TCGCCTTGAC TGAGGCTTCG
TACGTTGTCG CCAGATTGAT TCAACTTTTC CCTAACATTG AAAACTATGA ACCGGAGGAA
GTTTACCCAC CATTTAAGAA CTCCCAATTG ACCATGAACC TTTTGAACGG GTTACACATT
GGCTTATACT AG
 
Protein sequence
MANFIEFVTT NWYIIIPALL VLHKVFDLLY VQYLYRKLGA KPCTNQTDDH AFGIRAGFEM 
LKKKNEGTVV DFGAERFESR IDPKIPTFSM RLFLIPIVLT RDPENIKALL ATQFNEFVLG
SRFEQLAPLL GKGIFTLDGE GWKHSRAMLR PQFAKEQVAH VQSLEPHIQA LAKHVRNAKG
KSFDIQELFH RLTVDSATEF LFGQSVESLR DESVGMADEA TDFAGKSTFA ASFTIAQNWL
ANRAVAQKFY FLINPKEMRD SIKDVHRFVD YYVQVALDTP QDELDKKSKD GYIFLYELVK
QTRDPYVLRS QLLNVLLAGR DTTAGLLSFA FFELARRPDI WSKLKDEIYE NFGSGENSKV
DEITFESLKR CEYLKAFLNE TLRLYPSVPV NFRVATKDTT LPRGGGKDGS EPILVRKGQS
VFYSVYATHR SEAYYGKDRH VFRPERWFEP SARKLGWAYL PFNGGPRICL GQQFALTEAS
YVVARLIQLF PNIENYEPEE VYPPFKNSQL TMNLLNGLHI GLY