Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31468 |
Symbol | |
ID | 4839050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 884156 |
End bp | 885181 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390365 |
Product | predicted protein |
Protein accession | XP_001384472 |
Protein GI | 126135896 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.561665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAA TTGATGGAAA CCCTTTGAAG ATTGTAGATA TCACCTCAGT TGATAAATCA ACTGCTGAAG AATTGTATGA CGCCGCTACG TCCCAGGGGT TCTTATTTGT TGAAGGTCAC GAGTTCAGCC AGGAAGAAAT AGATACTGTA TTCCAACTTT CTAAGGAATT CTTTGAGTTA CCCCATAGTT ATAAGTCAAA GTATCCAATA GGCTCTATGA ATCATGGATA CGCAGACTTT GGAGGCGAGA ACTTGGATCC TAAGGGCCAG AAGAAGGGAG ATCCCAAGGA AGCTCTCAAT ATCTGTTTGT TAAACTTCTT GACAGGTTTA TCATCCCAAG AGATTCCGGA CTGGTTTACG GAGGATCCCA AGAGGTTGGC GATTATCACC ACAACTGTAA AGAAGTTCTA TGCCTTGTCG ATGAAGATCT TGAAGTTGTT GGCTATTGGA TTGAAAATAG AGGATTCCAA CGAAATCAAA GGTGAAGACT GGTTTTCTTC AAGATATGAA GCCACTAAAG TCTCAGGTTC TACTTTCAGG TTTTTGCATT ACCCAGGCCA AAAGAGTTTG AACCCAGAAG CTGTGATCAG AGCTGGTGCC CATACCGATT ATGGATCTGT GACATTATTA TTCCAGCAGG AGAATCAGGA AGGACTAGAG ATCTACTCAC CGGTATCAAA GCAATGGGTT GCGGTTCCTT TTGTAGCTGC TAATACAGAA AAGTTTCCAG GAATGGGCCC TCCTATTGTA GTTAATATTG GAGATTTATT AAGTTACTGG ACAGCTGGTT TGTTGAAGTC AACTATTCAC AGAGTCAAGT TTCCGGCCAA AGTTCAAGCC ACCGGCCAGG ATAGATACTC GATTGTATTC TTTAGTCATC CTAACGATGA GGCGTTGTTA GAGGCTGTAC CTAGTGAGGT GGTGAGAAGT ATCAAGGGAA GAGGAGCCAA TAAGGATACT GTTGCCATCA CAGCTAAAGA GCATTTGGAC AGTAGGCTTG CAGCAACATA CGGCTGGAAG AAGTAG
|
Protein sequence | MAEIDGNPLK IVDITSVDKS TAEELYDAAT SQGFLFVEGH EFSQEEIDTV FQLSKEFFEL PHSYKSKYPI GSMNHGYADF GGENLDPKGQ KKGDPKEALN ICLLNFLTGL SSQEIPDWFT EDPKRLAIIT TTVKKFYALS MKILKLLAIG LKIEDSNEIK GEDWFSSRYE ATKVSGSTFR FLHYPGQKSL NPEAVIRAGA HTDYGSVTLL FQQENQEGLE IYSPVSKQWV AVPFVAANTE KFPGMGPPIV VNIGDLLSYW TAGLLKSTIH RVKFPAKVQA TGQDRYSIVF FSHPNDEALL EAVPSEVVRS IKGRGANKDT VAITAKEHLD SRLAATYGWK K
|
| |