Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36557 |
Symbol | |
ID | 4840122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 279889 |
End bp | 281094 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391437 |
Product | predicted protein |
Protein accession | XP_001385393 |
Protein GI | 150865963 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.325381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGCAC TATCTGCTGA GGAATTAGAT AGGAAGTATA ATGTGCGTCC GTTTGTTGAC CCTGAGCCAA CTAAAATCGA TGTGAATCCT TTAGTTTTAA CAGCAATAGA CCTTTCCTTA TTCAAAGAAG GGGATGAACA TCTAAACGAA AGAAAGAGTT TAGCAAAGGT GCTAGAATCA TCTGTAACCA CATATGGTTT TTTCAATTTG GTCAATTTTG GTATACCTAA AGAACGAATT GAACATATAA GAGCTATTAG TCAGAGCTTG CTTACAATTC CATACGAAGA AAAGTTGAAG TATTTGGCGA GTGCCGCTAC TAAAGAAGAA GAGAAGCCTA AAAGCATAGG TGCCGAGCGT GGCCAGGGAT TTAAGCCAAA AGGCTACTGG TCCATCAAGA ACGGAATTCG AGACTCGATT GATTTCTATA ATGTCAGAGA TACTTATCAT GATTCGTTCT TAGAGACTCC GGAAGCACAC CCTGAGTTGT TGCAAGTCCA TCTCAAAGAA GTGGCAGACT ATTATAACCA TTTACATAGA GTTGTATTGC CCAAACTTTT GCGATTGTTT GACTTAATTT TCAAGATTCC CGAAGGTACC TTGTTGAAAC GGTATTTCCA CAAAAGTGGA ACAAACGAAG ATACGTCAGG CAGCCATGGT CGTCTTATGT TGTACCGGCC ATATGAGAAT CAACAAGAGT TTGAGCAGAC AGACAAGATG TTCTTGCGTG GGCACTCAGA TATTAGCGCG CTTACCTTTA TAACTTCCCA GCCAATATTG GCCTTGCAGA TTATGGATGT CTACACTGGA GCGTGGAGAT ATGTTGCTCA TCGCGACGAC TCTTTGATTG TCAATATTGG GGATGCGCTT GAGTTCATCA GCGGTGGTCA TTTCAAGGCT TGTCTCCATA GAGTGGTCGA GCCTCCTGCG GATCAGAGAG GGTTTAATCG GCTTGTGGTT ATTTACTTTT GCAACCCAAG TGACAACTCC GAGATGGATC CCGAGCTCTT GGACTCTCCT GCATTACGCA GATTGGGGTA CACCAGGGAG GATAAGTTGA AGCAATGGGA AAAGATCCAA TTCCATGACT GGAACACTAC GAAAGGCGAG CTCCTTGGGA GAACCGCAGC TGGTGAGAGA AATCTACTTC AGTATCACGG AAGGTACATT GAGAGGTGGC ACCGATTTTC GGAATTGGCA AATTAG
|
Protein sequence | MSALSAEELD RKYNVRPFVD PEPTKIDVNP LVLTAIDLSL FKEGDEHLNE RKSLAKVLES SVTTYGFFNL VNFGIPKERI EHIRAISQSL LTIPYEEKLK YLASAATKEE EKPKSIGAER GQGFKPKGYW SIKNGIRDSI DFYNVRDTYH DSFLETPEAH PELLQVHLKE VADYYNHLHR VVLPKLLRLF DLIFKIPEGT LLKRYFHKSG TNEDTSGSHG RLMLYRPYEN QQEFEQTDKM FLRGHSDISA LTFITSQPIL ALQIMDVYTG AWRYVAHRDD SLIVNIGDAL EFISGGHFKA CLHRVVEPPA DQRGFNRLVV IYFCNPSDNS EMDPELLDSP ALRRLGYTRE DKLKQWEKIQ FHDWNTTKGE LLGRTAAGER NLLQYHGRYI ERWHRFSELA N
|
| |