Gene CPR_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1421 
SymbolhemD 
ID4206042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1600005 
End bp1601483 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content30% 
IMG OID642565975 
Producturoporphyrinogen-III methyltransferase/synthase 
Protein accessionYP_698740 
Protein GI110803870 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.16426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGCATATAT AATAGGGGCA GGTCCTGGAG ATGAAGAGTT ATTAACACTT 
AAAGCAATAA ATGCATTACA AAAATGCACA GCTGTCTTAT ATGATAGATT AGTTGGAAAC
AATATACTTA ATTACTTAAA TGATGACTGT GAGATTTACT ATTGTGGAAA AGAGCCAGGA
TGTCATTATA AAACACAGGA AGAGATAAAT GAAAGTATAG TTGAACTTGC TAAAAAAGGT
CATATTGTTG GTAGAGTAAA AGGTGGAGAT CCCTATGTTT TTGGAAGAGG TGGAGAAGAA
GTATTAGCCT TAGTAGAAGA AAACATACCT TTTGAAGTTA TTCCAGGGGT TACTTCACCT
ATTTCTGTTT TAAATTATGC AGGGATACCA ATAACACATA GAGGATTAGC ACAGAGTTTT
CATATTGTAA CAGGAATGTC AGCAAGAACT TTAAATGTAA ATTGGAAAGC TTTATCAAAA
GAAAATGGAA CCTTAGTATT TATGATGGGG CTTTCAAATT TAGAGACAAT TGTTGAGAAA
CTTTTAGAAA ATGGTAAAGA TATAGAAACT CCTTGTGGAG TAGTAATGAG AGGAACAACT
TCAAAGCAAA GAAAGGTTAT TGGAACATTA GAAAATATAT GTAAAAAGGT TAGAGAAGCT
AAGTTAGAGT CACCTTGTAT AATAGTTGTT GGTGATGTTG TTTCATTAAA TGAAAAACTT
TCATGGTATG AAAAATTACC TCTATTTGGA GCTAATATTT GTTTAACAAG ATCTAAGGAA
CAATCTAAAG AGATTAAGTG GAAATTAAAA GAGTTAGGTG CAGAAGTAAC AGAAATAAAC
TCTATAAAAA TAAAAGAAAC TTCAGAAAAT TTAGATGAAT ATATTAATAC TTTAGAAAAA
TATGATCATA TAGTATTTAC TTCAGTAAAT GCTGTTAATG TATTCTTTGA TTATTTAGTG
AAAAAAAGAG TTGATATAAG AAAAATAAAA GCAGATTTTG CTGTACTAGG AAAAGCAACT
AAAAAAGCTT TAATAGCTAG AGGAATTGTA CCAAGCATAA TGGCTCATTC ATTTACAGCG
GAAGGTTTAT TTGAAGTTTT AAAAGATAAT ATTAAAGAAG GAGAAGAAGT CTTAATTCCA
TGCTCTTCTT TAAGTAGAGA ATATTTGTTT GATAATTTAG CTTCTTTAGG AGCAAAATGT
CATAGAGTTA ATATTTATGA TACTATATGC GGAGATGTTA AAAATCCAAG AGCTTTCAAG
GAAGTTGATA TGGTATTATA CACAAGTCCT TCAACGGTTA AAAACATGAT TGATATGATT
GGACTTGAAT CTCTTAAAGA GAAAGTAAGT ATAGCTATAG GACCTATAAC CTTAAAAGCT
TTAAATGAAA GTGGAATTGA AGGGAAAATG TGCAAAACAC ATTGTGGGGA TGGATTTTTA
AGTGAAATTG AAGGTATATG GCAAGAGGTT AAAAAATAA
 
Protein sequence
MSKKAYIIGA GPGDEELLTL KAINALQKCT AVLYDRLVGN NILNYLNDDC EIYYCGKEPG 
CHYKTQEEIN ESIVELAKKG HIVGRVKGGD PYVFGRGGEE VLALVEENIP FEVIPGVTSP
ISVLNYAGIP ITHRGLAQSF HIVTGMSART LNVNWKALSK ENGTLVFMMG LSNLETIVEK
LLENGKDIET PCGVVMRGTT SKQRKVIGTL ENICKKVREA KLESPCIIVV GDVVSLNEKL
SWYEKLPLFG ANICLTRSKE QSKEIKWKLK ELGAEVTEIN SIKIKETSEN LDEYINTLEK
YDHIVFTSVN AVNVFFDYLV KKRVDIRKIK ADFAVLGKAT KKALIARGIV PSIMAHSFTA
EGLFEVLKDN IKEGEEVLIP CSSLSREYLF DNLASLGAKC HRVNIYDTIC GDVKNPRAFK
EVDMVLYTSP STVKNMIDMI GLESLKEKVS IAIGPITLKA LNESGIEGKM CKTHCGDGFL
SEIEGIWQEV KK