Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0452 |
Symbol | cysG |
ID | 4787592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 400934 |
End bp | 402379 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640092882 |
Product | uroporphyrinogen-III methylase |
Protein accession | YP_001023460 |
Protein GI | 124262990 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG0778] Nitroreductase |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase [TIGR02476] cob(II)yrinic acid a,c-diamide reductase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 160 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAATG AGCCGAGCTT CGCTGCCGGC ACGGTGTGCA TCGTCGGCGC CGGGCCGGGT GCGGCGGACC TGCTGACGCT GCGTGCCCTG CGGCGCCTGC AGCAGGCCGA GGTGATGGTG CATGACCGGC TGATCGCGCC GGAGGTGCTG GCGCTCGCGC CGCCGCAAGC GCTGCGGGTG TACGTGGGCA AGGCTTCGGG CGACCACGCC GTGCCGCAGG CGGGCATCCA TGCGCTGTTG GTCGAGCATG CCCGGGCGGG CCGGCGCGTG GTGCGGCTCA AGGGTGGCGA TCCCTTCGTG TTCGGCCGCG GCGGCGAGGA GGCACTGGCG TTGCAGGCCG CCGGCATCGC CTGCGAGGTC GTGCCCGGTG TGACCGCCGC CGGCGGCTGT GCGGCGGCCG CCGGCATCCC GCTGACGCAC CGCGACCTGG TCAGCAGCTG CGTCTTCCTG CCCGGGCATC TGGCCGAGGG TGAGGGTGCG CACGGCCGCA CGCTGGACTG GGCTGCACTG GCGCGGCCCG GGCAGACGCG TGTGTTCTAC ATGGGCGTGC AGCGCCTGCC CCAGATCGCG CAGCGGCTCA TTGCCCACGG GCTGGCGCCC GAGACGCCGG CCGCGATCGT GCGCGACGGT ACGCGCGCGA CTCAAAGCGT GCTCGCCACT GGCCTAGCAG CGCTGGCGCA GGCCGCGCCC GCGTACGGCC CGCAGCCCGG GCTCCTGATC ATCGGCGAGA CGGTGAGCCT GAGCCCGGAC TATCGGCCCA CGGTGGCGCC GCAGCCGGAG CTTCGGCGCA CCATGCACTT CGATGCCAAC GAGCGCGATG CCGTCTACCG CGTCATCGCC GCGCGGCGCG ACATGCGGCA CTTCGCGGGC GGGACGGCGC TGGCCCCGGC CGTGCTCGAG CGCCTACTGT GCGCGGCGCA TCAGGCGCCG TCGGTGGGGC TGATGCAGCC CTGGCGCTTC ATCCGCGTCG CCAACCCGGC GCTGCGCGAG TCCCTGGCCG CGCAGGTGGC CGCCGAGCGG GAGCGCACGG CGCAGGCGCT GGGCGAGCGT GCCGAAGAGT TCCTGCGGCT CAAGGTCGAG GGCCTGCGCG AGTGCGCCGA GCTGCTGGCC CTGGTGCTCG CGCCCGACGA CGGAACGGTC TTCGGCCGCC GTACGCTGCC GCGCGAGATG GCGCTGTGTT CGGTGGGGGC GGCGGCGCAG AACCTGTGGC TGGCCGCACG TGCGGAAAAC CTGGGCCTGG GCTGGGTGTC GATGTTCGAG CCGGCGGCCG TCGCCGCGCT GCTGGGGCTG CCTGAGGGCG CCTTGCCGCT GGGCCTGCTG TGCCTTGGCC CGGTCGATGC CTTCTACGAC GAGCCGATGC TGCAGGCTGA GCACTGGCGT CAGGGCCAGC CGCTCGGTGA CATGGTGTTC GAAGACACCT GGGGTCGATC GGCTGACAAG CCGTGA
|
Protein sequence | MSNEPSFAAG TVCIVGAGPG AADLLTLRAL RRLQQAEVMV HDRLIAPEVL ALAPPQALRV YVGKASGDHA VPQAGIHALL VEHARAGRRV VRLKGGDPFV FGRGGEEALA LQAAGIACEV VPGVTAAGGC AAAAGIPLTH RDLVSSCVFL PGHLAEGEGA HGRTLDWAAL ARPGQTRVFY MGVQRLPQIA QRLIAHGLAP ETPAAIVRDG TRATQSVLAT GLAALAQAAP AYGPQPGLLI IGETVSLSPD YRPTVAPQPE LRRTMHFDAN ERDAVYRVIA ARRDMRHFAG GTALAPAVLE RLLCAAHQAP SVGLMQPWRF IRVANPALRE SLAAQVAAER ERTAQALGER AEEFLRLKVE GLRECAELLA LVLAPDDGTV FGRRTLPREM ALCSVGAAAQ NLWLAARAEN LGLGWVSMFE PAAVAALLGL PEGALPLGLL CLGPVDAFYD EPMLQAEHWR QGQPLGDMVF EDTWGRSADK P
|
| |