Gene PICST_58279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58279 
Symbol 
ID4838858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1385552 
End bp1386808 
Gene Length1257 bp 
Protein Length418 aa 
Translation table12 
GC content41% 
IMG OID640390173 
Productpredicted protein 
Protein accessionXP_001384564 
Protein GI150865376 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.518781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC TGGAAGTAGA GCAAATAGAT AAGAAGTATA ACGTGCGGCC ATTCATAGAA 
GCTCCGCTTT CGAAGTCTGA TATAGAGCCA GTCCAGTTGG ATGCTCTTGA TTTGTCTCTC
TACAAGGATG GATCAGAACA TTTTGAAACC AGGAAGAAGT TGGCAGACCA ATTGGAAAAG
TCAATTTCCA CTTCTGGATT CTTTTCTGTT GTAAATCATG GCATAGATGT AGAGAGATTC
GAGAGTTTGA AGGCTATCGC CCAATCACTT TTGGAAATCC CTGCTGAAGA GCAAGGTCCA
TATTTGGCTG GAGCATGGAA ATCTGATCTT GAAGACAGAA CTAAGTCTGT TGGCGCTGAA
AGAGGTGCTG GTTTTAAACC CAAAGGCTAC TGGTCCATGA GAAACGGTGT TCATGATTCA
ATTGTTCATT ACAACTTGAA CAATATGTTA CATCCATCGT TCTTCGACGA TTCCAAGAAC
AACCACCATC CCTTAGTCAA AGCACATTTG GAAGAAATAG CTGGATACTT CAGGTATTTG
CATAACGATG TCTTGAAAAA GATCACCTAC TTGTGTGATA TCATTTTAGA GATCCCTGAA
GGAACAATCT GGAAATTGTA CTACAGTGTT GAAGAAAATG ACTTCGAAAG ATCTGGTCAA
GGTGCTGGCA GGTTCATGTT GTACCACAAT ATGAAAGCAG AAGACGAGGC TAAGGTAGGG
AAAAACTGGC TCAGGGGTCA TTCTGATTCT GGCGGATTCA CATTCATCAC TTCTCAACCA
ATTTTATCTT TACAAGTTCG AGATTACTTC ACTGGAGAAT GGAGATATGT TGGCCACACT
CCTAATGCCT TTATTGTCAA TATTGCTGAT GCCATGGAGT TCATCACTGG GGGATACTTC
AAGTCATCGA TTCACAGAGT TGTCTCACCT CCGGAAGATC AAAAGAACTA CAGAAGATTG
GTATTGATAT ACTTCTCAAG TCCAAAAAAC ATCTCCATTG TAGACCCTGA AGCATTGGAC
TCTCCTAAAT TGGCAAGGTT GGGATTCCTG AAACCAGATG AATGGGCAAA GATCACGTTC
AAAGATTGGT ATAGTATTAA GGGACTGTTG TTTGGCAGAA AAGCAGTCAA TGATTCCAAT
AGTGATGAAC CAAACTTGGT TTTGTTGTAC GGAAGACTAC ATGAGAGGTG GCATCAAGCT
GAAGCCAACT TCACTCTCGA AGAGGCAAGA AAGAGATTCA AAGTAATTGA AATCTGA
 
Protein sequence
MTQSEVEQID KKYNVRPFIE APLSKSDIEP VQLDALDLSL YKDGSEHFET RKKLADQLEK 
SISTSGFFSV VNHGIDVERF ESLKAIAQSL LEIPAEEQGP YLAGAWKSDL EDRTKSVGAE
RGAGFKPKGY WSMRNGVHDS IVHYNLNNML HPSFFDDSKN NHHPLVKAHL EEIAGYFRYL
HNDVLKKITY LCDIILEIPE GTIWKLYYSV EENDFERSGQ GAGRFMLYHN MKAEDEAKVG
KNWLRGHSDS GGFTFITSQP ILSLQVRDYF TGEWRYVGHT PNAFIVNIAD AMEFITGGYF
KSSIHRVVSP PEDQKNYRRL VLIYFSSPKN ISIVDPEALD SPKLARLGFS KPDEWAKITF
KDWYSIKGSL FGRKAVNDSN SDEPNLVLLY GRLHERWHQA EANFTLEEAR KRFKVIEI