Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34797 |
Symbol | |
ID | 4837262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 199313 |
End bp | 200581 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640388577 |
Product | predicted protein |
Protein accession | XP_001382803 |
Protein GI | 126132556 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.242371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACAA CTGAAAATTT GAATATTGAC GAAATTGACA AGAAGTACAA TTTGAGACCA TTTGTAGAGG CTACTCCGTC AGATACTGCT GCAGAAGTTA TCCAATTGAA CTCACTAGAT TTGTCTCTTT TCCAGGAAGG ACCAGATTTC TTGGATCAGA GAAAAAAACT TGCTACCCAG CTTGAAGAGT CTCTCTCAAC TGTAGGATTT TTCGCTTTGG TTAACCATGG AATCAGCCAA GACACGTTTG ACCAATTGAG GTCTGTTGCT CAATCCACGT TTGAGTTGCC GGATCAGGAA AAGAAGAAGT ACTTGTCTGG AGCATTGACT TCTGATACAG AAGACAGAAG TGTTTCATTA GGTGCGGAAA GAGGTGCCGG ATTCAAACCA AAGGGATATT GGTCTATGAA GAACGGAGTC AAAGATAGTA TTGAATTGTA CAATTTCAGG GACTTGCAAC AAAGGGAAGT TTATGATTCC TCCAAGCCCT ACCCAGAGAT AGTGAAAGCA CATCTTCCAA ATGTTGTGAG CTATTTTAGA TTCATACATG GCAATATCTT GAAGAAGTTG ACTATTTTAT GTGATATTAT ATTAGAGCTT CCAGAAGGTT ACTTGTGGGA GAACTACTTC AAGGTTGTGG ATGGTGATTC CTATAATTCA GGAAGTGGAT TCGGAAGATT CATGATCTAC CATGCTTTGA ATCCTGAAGA TGAAGCAAAA GTTGATAACA ATTGGCTCCG TGGACATTCT GATGGCACGG CGTTCACATT TATTACATCC CAGCCTATCT TGTCATTACA GATAAGAGAC TATTATACTG GTGATTGGAA GTATGTTGGC CATACACCTA ACGGACTTAT TGTTAATATA GGCGATGCAT TGGAATTTAT AACTGGTGCA TACTTCAAGT CTTCTATACA TCGAGTCGTA ACCCCACCTG ATGATCAGAA AAATTTTAAA AGATTGGTAA TCATTTACTT CTGTGATCCC AAGCTTCCTT CTATTCTCGA TCCCGAGCCA TTGAATTCTC CAAAATTGAA AAGATTGGGA TACAGAAAAC ACGATGAATG GGAAAGGATT ACATTCCAGC AATGGGACGA GGAAAAAGGT AGATTATTTG GAAGGAGTGA CGTAAACGAT GCCAAAAGTG ACGAACCAAA CTTGGTGCTA CTCTACGGAA GACTACATGA AAGGTGGCAT CAAGCAGAAC ACAATTTCTC TCTTGAAGAA GCTAGGAAGA AGTATAAGGT AATTGAAAAC AAAAGTTAA
|
Protein sequence | MATTENLNID EIDKKYNLRP FVEATPSDTA AEVIQLNSLD LSLFQEGPDF LDQRKKLATQ LEESLSTVGF FALVNHGISQ DTFDQLRSVA QSTFELPDQE KKKYLSGALT SDTEDRSVSL GAERGAGFKP KGYWSMKNGV KDSIELYNFR DLQQREVYDS SKPYPEIVKA HLPNVVSYFR FIHGNILKKL TILCDIILEL PEGYLWENYF KVVDGDSYNS GSGFGRFMIY HALNPEDEAK VDNNWLRGHS DGTAFTFITS QPILSLQIRD YYTGDWKYVG HTPNGLIVNI GDALEFITGA YFKSSIHRVV TPPDDQKNFK RLVIIYFCDP KLPSILDPEP LNSPKLKRLG YRKHDEWERI TFQQWDEEKG RLFGRSDVND AKSDEPNLVL LYGRLHERWH QAEHNFSLEE ARKKYKVIEN KS
|
| |