Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47071 |
Symbol | |
ID | 4839441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1353935 |
End bp | 1355053 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640390756 |
Product | predicted protein |
Protein accession | XP_001385264 |
Protein GI | 150865873 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.454401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00245237 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCACC ACACCATGAC CTTCGAATCT AATATCATTA ACGGTAATGC TAGGAACAAC CCTGCCACCA ATCACATCCA AATTGGAGGC AGAAAGTCCA CGCTTGCAGT GGTGCAATCC GAAATAGTCA AGAAGGAGAT CGAAGAAGCA TTTCCACACA TTAACTGCTC GATTTTGGCG CTTTCCACTT TGGGAGACAA GATCCAGAAC AAGCCTCTCT ATTCATTCGG CGGAAAATCG CTCTGGACAA AAGAACTCGA GATCTTGTTG TTGGACCTGA TTGACGAGTT TCCCCAGCTT GACTTGATAG TGCATTCCTT GAAGGACATG CCAACGAACT TGCCTGATGA GTTTGAGTTG GGTTGTATTT TGAAGAGAGA AGACCCTCGT GACGCTTTAG TTATGAGAGC CGGATCACCT TACAAGACGT TGGACGATTT GCCAGCAGGC TCTGTAGTGG GAACATCTTC CATCAGAAGA TCATCGCAAC TCGTGAAGAA CTATCCCCAC TTGAAGTTCG ACTCTGTTCG TGGTAACCTT CAGACTCGTT TGAGCAAATT AGACGACGAC TCCCAACCAT TCGAGTGTAT CATTTTGGCA CTGGCTGGCT TAATCAGAGT TGGTTTGGGC CACAGAGTTA CAGACTATTT GAACGCTCCA CACATGTACT ACGCCGTTGG CCAGGGAGCT TTGGGTGTAG AAATCAGAAA AAACGACACC AAGATGAAGA ACATCTTGGC TAAGATACTG CACATCCCAA CATCCCTCTG TTGTTACGCA GAGAGATCGT TGATGAGATA CTTGGAAGGA GGGTGTTCCG TGCCGATAGG TGTTCACACG AACTACGATG AAGACTCGAA GGTGTTGAAG TTTGAGGCTA TCATAGTCAG TCCCGACGGA ACTCAGTTTG TGGAAGACGA ACTTGAAGCT CAAGTCGAGA CCTTGCAACA GGCCGAAGCC TTGGGTATAC AACTAGGCGA CAGACTCATC GCTAAGGGTG CAAAGGATAT CTTGGACAAG ATCGACTTCA ATAGAATCAA CCAGGCTCCT AGCACCATCA ACACACCCAC TCCATCCATA GCTACCTCCA TAGAGGCCGT CGTTTCTACG GCTAACTAA
|
Protein sequence | MPHHTMTFES NIINGNARNN PATNHIQIGG RKSTLAVVQS EIVKKEIEEA FPHINCSILA LSTLGDKIQN KPLYSFGGKS LWTKELEILL LDSIDEFPQL DLIVHSLKDM PTNLPDEFEL GCILKREDPR DALVMRAGSP YKTLDDLPAG SVVGTSSIRR SSQLVKNYPH LKFDSVRGNL QTRLSKLDDD SQPFECIILA SAGLIRVGLG HRVTDYLNAP HMYYAVGQGA LGVEIRKNDT KMKNILAKIS HIPTSLCCYA ERSLMRYLEG GCSVPIGVHT NYDEDSKVLK FEAIIVSPDG TQFVEDELEA QVETLQQAEA LGIQLGDRLI AKGAKDILDK IDFNRINQAP STINTPTPSI ATSIEAVVST AN
|
| |