Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46716 |
Symbol | HEM12 |
ID | 4839596 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1381163 |
End bp | 1382431 |
Gene Length | 1269 bp |
Protein Length | 362 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390911 |
Product | uroporphyrinogen decarboxylase |
Protein accession | XP_001384951 |
Protein GI | 126136855 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00457298 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGAAT TCGCACCCTT GAAGAACGAC TTGATTCTCA GAGCTGCCAG AGGCGAGAAG GTCGAAAGAC CACCTATTTG GATCATGGTA AGTCTGAAGT GGAATATTTG AGATTCGGAA TTGAAGATAA AGTCACATAC AAAGCAGAGC TTCAAATTAG AATGCAAATA ACAATCGAAT TCTGTTAGTT GTAATATCAT TGTTATCGTA CTTGTCTCGT ATCTAAGATT CGTGGTCTTA CGTATCACAA TATACTAACC TTCGCAGAGA CAAGCTGGAA GATATCTTCC TGAATACCAC GAAGCCAAAG GTAACAGAGA TTTCTTTGAA ACTTGTAGAG ATGCAGAGAT AGCATCGGAA ATCACTATTC AGCCCGTAAA CCACTTTGAT GGCTTAATAG ATGCAGCCAT CATCTTCAGT GATATCTTGG TGATTCCCCA GGCCATGGGA TTCGAGATTG AGATGCTCGA AGGTAAAGGT CCAGTATTCG TAGCTCCTTT GAGATCTCCT GATGATTTGG CTAGAGTAAA CTTCCAGCCT GATGTCTTGA AGAGTTTGGA CTGGGCGTTC AAGTCCATCA CTCTTACCAG AACCAAATTG AACGGCAGAG TGCCATTGTT GGGCTTTGTA GGAGCACCTT GGACTTTGTT GGTTTATATG ACCGAAGGTC AGGGTTCCAA GATGTTCCGT TTTGTCAAGG AATGGATCTA CAAATATACT GAAGAGTCAC ACAAATTGTT ACAGGCCATC ACAGATGCCT GTGTCGAATT CTTAGCTCAA CAAGTTGTTG CTGGAGCTCA GATGTTGCAG GTTTTCGAGT CTTGGGCCGG TGAATTGGGA CCTCGTGAGT TTGACGAGTT CTCGTTGCCA TACTTGAGAC AGATCGCTGA AAAATTACCA AAGAGACTTG TAGAACTCGG AGTGACGGAA AAGATCCCCC TAACTGTATT TGCCAAGGGT GCCTGGTATG CCTTGGACGA TCTCTGTGAA TCTGGCTACG ACACTGTTTC CTTGGATTGG TTGTATAAGC CAGAAGACGC TGTCAAGGTG GTCAACAACA GAAGAATCAC TTTGCAAGGG AACTTAGATC CAGGTATCAT GTACGGTTCA GATGAAGTGA TCTCTCAAAA GGTAGAAGAA ATGATCAAGG GCTTTGGAGG TGGAAAACAA AACTACATCA TCAACTTTGG TCATGGAACT CATCCATTCA TGAAGCCCGA GAAGATCGAG CATTTCTTGA AGGAATGCCA TAAGTATGGT TCCCAATAG
|
Protein sequence | MPEFAPLKND LILRAARGEK VERPPIWIMR QAGRYLPEYH EAKGNRDFFE TCRDAEIASE ITIQPVNHFD GLIDAAIIFS DILVIPQAMG FEIEMLEGKG PVFVAPLRSP DDLARVNFQP DVLKSLDWAF KSITLTRTKL NGRVPLLGFV GAPWTLLVYM TEGQGSKMFR FVKEWIYKYT EESHKLLQAI TDACVEFLAQ QVVAGAQMLQ VFESWAGELG PREFDEFSLP YLRQIAEKLP KRLVELGVTE KIPLTVFAKG AWYALDDLCE SGYDTVSLDW LYKPEDAVKV VNNRRITLQG NLDPGIMYGS DEVISQKVEE MIKGFGGGKQ NYIINFGHGT HPFMKPEKIE HFLKECHKYG SQ
|
| |