Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58279 |
Symbol | |
ID | 4838858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1385552 |
End bp | 1386808 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390173 |
Product | predicted protein |
Protein accession | XP_001384564 |
Protein GI | 150865376 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.518781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGC TGGAAGTAGA GCAAATAGAT AAGAAGTATA ACGTGCGGCC ATTCATAGAA GCTCCGCTTT CGAAGTCTGA TATAGAGCCA GTCCAGTTGG ATGCTCTTGA TTTGTCTCTC TACAAGGATG GATCAGAACA TTTTGAAACC AGGAAGAAGT TGGCAGACCA ATTGGAAAAG TCAATTTCCA CTTCTGGATT CTTTTCTGTT GTAAATCATG GCATAGATGT AGAGAGATTC GAGAGTTTGA AGGCTATCGC CCAATCACTT TTGGAAATCC CTGCTGAAGA GCAAGGTCCA TATTTGGCTG GAGCATGGAA ATCTGATCTT GAAGACAGAA CTAAGTCTGT TGGCGCTGAA AGAGGTGCTG GTTTTAAACC CAAAGGCTAC TGGTCCATGA GAAACGGTGT TCATGATTCA ATTGTTCATT ACAACTTGAA CAATATGTTA CATCCATCGT TCTTCGACGA TTCCAAGAAC AACCACCATC CCTTAGTCAA AGCACATTTG GAAGAAATAG CTGGATACTT CAGGTATTTG CATAACGATG TCTTGAAAAA GATCACCTAC TTGTGTGATA TCATTTTAGA GATCCCTGAA GGAACAATCT GGAAATTGTA CTACAGTGTT GAAGAAAATG ACTTCGAAAG ATCTGGTCAA GGTGCTGGCA GGTTCATGTT GTACCACAAT ATGAAAGCAG AAGACGAGGC TAAGGTAGGG AAAAACTGGC TCAGGGGTCA TTCTGATTCT GGCGGATTCA CATTCATCAC TTCTCAACCA ATTTTATCTT TACAAGTTCG AGATTACTTC ACTGGAGAAT GGAGATATGT TGGCCACACT CCTAATGCCT TTATTGTCAA TATTGCTGAT GCCATGGAGT TCATCACTGG GGGATACTTC AAGTCATCGA TTCACAGAGT TGTCTCACCT CCGGAAGATC AAAAGAACTA CAGAAGATTG GTATTGATAT ACTTCTCAAG TCCAAAAAAC ATCTCCATTG TAGACCCTGA AGCATTGGAC TCTCCTAAAT TGGCAAGGTT GGGATTCCTG AAACCAGATG AATGGGCAAA GATCACGTTC AAAGATTGGT ATAGTATTAA GGGACTGTTG TTTGGCAGAA AAGCAGTCAA TGATTCCAAT AGTGATGAAC CAAACTTGGT TTTGTTGTAC GGAAGACTAC ATGAGAGGTG GCATCAAGCT GAAGCCAACT TCACTCTCGA AGAGGCAAGA AAGAGATTCA AAGTAATTGA AATCTGA
|
Protein sequence | MTQSEVEQID KKYNVRPFIE APLSKSDIEP VQLDALDLSL YKDGSEHFET RKKLADQLEK SISTSGFFSV VNHGIDVERF ESLKAIAQSL LEIPAEEQGP YLAGAWKSDL EDRTKSVGAE RGAGFKPKGY WSMRNGVHDS IVHYNLNNML HPSFFDDSKN NHHPLVKAHL EEIAGYFRYL HNDVLKKITY LCDIILEIPE GTIWKLYYSV EENDFERSGQ GAGRFMLYHN MKAEDEAKVG KNWLRGHSDS GGFTFITSQP ILSLQVRDYF TGEWRYVGHT PNAFIVNIAD AMEFITGGYF KSSIHRVVSP PEDQKNYRRL VLIYFSSPKN ISIVDPEALD SPKLARLGFS KPDEWAKITF KDWYSIKGSL FGRKAVNDSN SDEPNLVLLY GRLHERWHQA EANFTLEEAR KRFKVIEI
|
| |