Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37559 |
Symbol | |
ID | 4851544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2093775 |
End bp | 2094812 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393252 |
Product | predicted protein |
Protein accession | XP_001388030 |
Protein GI | 126274804 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.960018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.136249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATG CTGCTGTCAG AACTCCGGTA GTAGTTTCCT TAAAGGAATT AGTCCAGGGA ATTGACCATG CTACTTTGGC AGAAGCTTTT GGACCCCAGT CGTTAGGAAT CATCGTTATC AAAGATTTAC CCCAGAAGTT TCATGATCTC AGATTGAAGG TGTTGAAGTC GATTTCGATA TTGGCCAACT TGGGGCCTGA CGTGTTAAGT AATCTAGAAT CAGAGGAGGC AATGTGGTTA ACAGGCTGGT CTTGTGGTAA AGAAATATTG GCTAACTCAG GAAAACCAGA CTTTAACAAA GGTTCCTACT ATGTGAACTG TGCCTTCCAC AAGAATCCTG AGTGGGAAGG ACCGACGGAA AAATTGACCA AAGAGTTCAT CAACCACAGG GCATACACCA CAGCCAATAT GTGGCCTTCT GCAGATCACA AAGGTCTTGA AAATTTTCAA GAAGATGCTA AGGAGCTTAT TAGCTTAATC ATAGATGTAG CCCAATCTGT AGCTGCTAAT TGTGACAAAT TCATTACAGA GAGCAAAATC TCTCCCAACT ACGAACAAAA CTACTTAGAA CGAATTGTGA AAAACTCGAC TTGTACGAAG GCAAGGTTAC TCCATTATTT TCCACTGAAG TCGTCGTCGG AATCGGGCAA AGATGATGAC TGGTGCGGTG AGCATTTGGA CCACTCTTGT CTCACAGGAT TGACATCTGC TTTGTTCATC GACGAATCTA AGGGTCTAAC CGCTGCTCTT GATAAATCCC CAGACCCTGA ACTGGGTTTG TACATTCGTG ACAGACAGAA TGAAGTGGTT AAAGTGAACA TTCCTCCCGA ATGTCTTGCT TTCCAGACTG GATCTACTCT CCAGGAAGTT TCTCGAGGAA AATTCCTGGC AGTACCCCAC TATGTCAAAG GAACTTCGAT TCCAAATATC GCTAGAAACA CTTTGGCTGT GTTCTGCCAG CCAGACTTGG ACGAAATGGT TAATGATTCT GAGAACTTTG CCCAGTATGC CGATAGAATT CTCAAGGCCA ACCACTAA
|
Protein sequence | MTNAAVRTPV VVSLKELVQG IDHATLAEAF GPQSLGIIVI KDLPQKFHDL RLKVLKSISI LANLGPDVLS NLESEEAMWL TGWSCGKEIL ANSGKPDFNK GSYYVNCAFH KNPEWEGPTE KLTKEFINHR AYTTANMWPS ADHKGLENFQ EDAKELISLI IDVAQSVAAN CDKFITESKI SPNYEQNYLE RIVKNSTCTK ARLLHYFPLK SSSESGKDDD WCGEHLDHSC LTGLTSALFI DESKGLTAAL DKSPDPELGL YIRDRQNEVV KVNIPPECLA FQTGSTLQEV SRGKFLAVPH YVKGTSIPNI ARNTLAVFCQ PDLDEMVNDS ENFAQYADRI LKANH
|
| |