Gene PICST_46716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46716 
SymbolHEM12 
ID4839596 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1381163 
End bp1382431 
Gene Length1269 bp 
Protein Length362 aa 
Translation table12 
GC content43% 
IMG OID640390911 
Producturoporphyrinogen decarboxylase 
Protein accessionXP_001384951 
Protein GI126136855 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457298 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAAT TCGCACCCTT GAAGAACGAC TTGATTCTCA GAGCTGCCAG AGGCGAGAAG 
GTCGAAAGAC CACCTATTTG GATCATGGTA AGTCTGAAGT GGAATATTTG AGATTCGGAA
TTGAAGATAA AGTCACATAC AAAGCAGAGC TTCAAATTAG AATGCAAATA ACAATCGAAT
TCTGTTAGTT GTAATATCAT TGTTATCGTA CTTGTCTCGT ATCTAAGATT CGTGGTCTTA
CGTATCACAA TATACTAACC TTCGCAGAGA CAAGCTGGAA GATATCTTCC TGAATACCAC
GAAGCCAAAG GTAACAGAGA TTTCTTTGAA ACTTGTAGAG ATGCAGAGAT AGCATCGGAA
ATCACTATTC AGCCCGTAAA CCACTTTGAT GGCTTAATAG ATGCAGCCAT CATCTTCAGT
GATATCTTGG TGATTCCCCA GGCCATGGGA TTCGAGATTG AGATGCTCGA AGGTAAAGGT
CCAGTATTCG TAGCTCCTTT GAGATCTCCT GATGATTTGG CTAGAGTAAA CTTCCAGCCT
GATGTCTTGA AGAGTTTGGA CTGGGCGTTC AAGTCCATCA CTCTTACCAG AACCAAATTG
AACGGCAGAG TGCCATTGTT GGGCTTTGTA GGAGCACCTT GGACTTTGTT GGTTTATATG
ACCGAAGGTC AGGGTTCCAA GATGTTCCGT TTTGTCAAGG AATGGATCTA CAAATATACT
GAAGAGTCAC ACAAATTGTT ACAGGCCATC ACAGATGCCT GTGTCGAATT CTTAGCTCAA
CAAGTTGTTG CTGGAGCTCA GATGTTGCAG GTTTTCGAGT CTTGGGCCGG TGAATTGGGA
CCTCGTGAGT TTGACGAGTT CTCGTTGCCA TACTTGAGAC AGATCGCTGA AAAATTACCA
AAGAGACTTG TAGAACTCGG AGTGACGGAA AAGATCCCCC TAACTGTATT TGCCAAGGGT
GCCTGGTATG CCTTGGACGA TCTCTGTGAA TCTGGCTACG ACACTGTTTC CTTGGATTGG
TTGTATAAGC CAGAAGACGC TGTCAAGGTG GTCAACAACA GAAGAATCAC TTTGCAAGGG
AACTTAGATC CAGGTATCAT GTACGGTTCA GATGAAGTGA TCTCTCAAAA GGTAGAAGAA
ATGATCAAGG GCTTTGGAGG TGGAAAACAA AACTACATCA TCAACTTTGG TCATGGAACT
CATCCATTCA TGAAGCCCGA GAAGATCGAG CATTTCTTGA AGGAATGCCA TAAGTATGGT
TCCCAATAG
 
Protein sequence
MPEFAPLKND LILRAARGEK VERPPIWIMR QAGRYLPEYH EAKGNRDFFE TCRDAEIASE 
ITIQPVNHFD GLIDAAIIFS DILVIPQAMG FEIEMLEGKG PVFVAPLRSP DDLARVNFQP
DVLKSLDWAF KSITLTRTKL NGRVPLLGFV GAPWTLLVYM TEGQGSKMFR FVKEWIYKYT
EESHKLLQAI TDACVEFLAQ QVVAGAQMLQ VFESWAGELG PREFDEFSLP YLRQIAEKLP
KRLVELGVTE KIPLTVFAKG AWYALDDLCE SGYDTVSLDW LYKPEDAVKV VNNRRITLQG
NLDPGIMYGS DEVISQKVEE MIKGFGGGKQ NYIINFGHGT HPFMKPEKIE HFLKECHKYG
SQ