Gene CPF_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1687 
SymbolhemD 
ID4202350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1909190 
End bp1910668 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content30% 
IMG OID638082561 
Producturoporphyrinogen-III methyltransferase/synthase 
Protein accessionYP_696125 
Protein GI110800584 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0805383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGCATATAT AATAGGGGCA GGTCCTGGAG ATGAAGAGTT ATTAACACTT 
AAAGCAATAA ATGCATTACA AAAATGCACA GCTGTCTTAT ATGATAGATT AGTTGGGAAC
AATATACTTA ATTATTTAAA TGATGACTGT GAGATTTACT ATTGTGGAAA AGAGCCAGGA
TGCCATTATA AAACACAGGA AGAGATAAAT GAAAGTATAG TTGAGCTTGC TAAAAAAGGT
CATGTTGTTG GTAGAGTAAA AGGAGGAGAT CCATATGTTT TTGGAAGAGG TGGAGAAGAA
GTATTAGCCT TAGTAGAAGA AAACATACCT TTTGAAGTTA TTCCAGGGGT TACTTCACCT
ATTTCTGTTT TGAATTATGC AGGAATACCA ATAACACATA GAGGATTAGC ACAGAGTTTT
CATATTGTAA CAGGAATGTC AGCAAGAACT TTAAATGTAA ATTGGAAAGC TTTATCAAAA
GAAAATGGAA CCTTAGTATT TATGATGGGA CTTTCAAATT TAGAGACAAT TGTTGAGAAA
CTTTTAGAAA ATGGTAAAGA TATAGAAACT CCTTGTGGAG TAGTAATGAG AGGAACAACT
TCAAAGCAAA GAAAGGTTAT TGGAACATTA GAAAATATAT GTAAAAAGGT TAGAGAAGCT
AAGTTAGAGT CACCTTGTAT AATAGTTGTT GGAGATGTTG TCTCATTAAA TGAAAAACTT
TCATGGTATG AAAATTTACC TCTATTTGGA GCTAATATTT GCTTAACAAG ATCTAAGGAA
CAATCTAAAG AGATTAAGTG GAAATTAAAA GAGTTAGGTG CAGAAGTAAC AGAAATAAAC
TCTATAAAAA TAAAAGAAAC TGCTTATAAT TTAGATGAAT ATATTAATAC TTTAGAAAAA
TATGATCATA TAGTATTTAC TTCAGTAAAT GCTGTTAATG TATTCTTTGA TTATTTAGTA
AAAAATAGAG TTGATATAAG AAAAATAAAA GCAGATTTTG CTGTACTAGG AAAAGCAACT
AAAAAAGCTT TAATAGCTAG AGGGATTGTG CCAAGCATAA TGGCTCATTC ATTTACAGCA
GAAGGTTTAT TTGAAGTTTT AAAAGATAAT ATTAAAGAAG GAGAAGAGGT ATTAATTCCA
TGTTCTTCTT TAAGCAGAGA ATATTTGTTT GATAATTTAG CTTCTTTAGG AGCAAAATGT
CATAGAGTTA ATATTTATGA TACTATATGC GGAGATGTTA AAAATCCAAG AGCTTTCAAG
GAAGTTGATA TGGTATTATA CACAAGTCCT TCAACGGTTA AAAACATGAT TGATATGATT
GGACTTGAAG CTCTTAAAGA GAAAGTAAGC ATAGCTATAG GACCTATAAC TTTAAAAGCT
TTAAATGAAA GTGGAATTGA AGGAAAAATG TGCAAAACAC ATTGTGGAGA TGGATTTTTA
AGTGAAATTG AGGGTATATG GCAAGAGGTT AAAAAATAA
 
Protein sequence
MSKKAYIIGA GPGDEELLTL KAINALQKCT AVLYDRLVGN NILNYLNDDC EIYYCGKEPG 
CHYKTQEEIN ESIVELAKKG HVVGRVKGGD PYVFGRGGEE VLALVEENIP FEVIPGVTSP
ISVLNYAGIP ITHRGLAQSF HIVTGMSART LNVNWKALSK ENGTLVFMMG LSNLETIVEK
LLENGKDIET PCGVVMRGTT SKQRKVIGTL ENICKKVREA KLESPCIIVV GDVVSLNEKL
SWYENLPLFG ANICLTRSKE QSKEIKWKLK ELGAEVTEIN SIKIKETAYN LDEYINTLEK
YDHIVFTSVN AVNVFFDYLV KNRVDIRKIK ADFAVLGKAT KKALIARGIV PSIMAHSFTA
EGLFEVLKDN IKEGEEVLIP CSSLSREYLF DNLASLGAKC HRVNIYDTIC GDVKNPRAFK
EVDMVLYTSP STVKNMIDMI GLEALKEKVS IAIGPITLKA LNESGIEGKM CKTHCGDGFL
SEIEGIWQEV KK