Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2069 |
Symbol | |
ID | 8416386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2435984 |
End bp | 2437552 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645025051 |
Product | uroporphyrin-III C-methyltransferase |
Protein accession | YP_003182421 |
Protein GI | 257791815 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.198087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000347629 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCGC AGGGGGTGGA GGCTCTGCTC GTGGGAGCGG GGCCGGGCGA TCCGAACCTG CTCACGCTCG CGGGAGCGGC CGCGCTTTCG CGAGCCGACG TCGTGGTGTA CGACTACTTG GCGAATCCGG CTCTGTTGGC GCACGCGCCG CAGGACGCCG AGCGCGTGTA CGTGGGCAAA AAGGGCTTTT CGGAGCATGT GACCCAACGT CAGATCAACG AGCTGCTGGT GCAGCGTGCG CGCGATCTGG CCGCGCGCGG AGGCGGGGTG CTCGTGCGTC TCAAGGGCGG CGACCCGTTC GTGTTCGGGC GCGGCGGCGA AGAGGCGCTC GCGCTCGCCG AGGCGGGGTT CCCTTGCCCG ATCGTGCCGG GCGTGACGAG CGGCGTGGCC GCACCCGCCT TCGCGGGCAT CCCCGTCACG CACCGCGGGT TGGCCTCGTC GGTGACGTTC GTCACGGGAA GCGAGGATCC GACGAAGGCC GAGACGGCCG TCGACTGGAG CGGCATCGCC CACGGCGCCG ACACGCTGTG CTTCTACATG GGCGTGCGCA ACCTGCCCGT CATCGCGCGG CGGCTGATGG AGGCGGGGCG CTCGGCCGAC ACGCCGGTCT CCCTCGTCCG CTGGGGCACG ACGCCCATGC AGGAGGTGCT TGCGGGAACG TTGGCCACCA TTGCCGAACG TGCGGCGGCC GTCGGGTTCA AGGCGCCGGC CATCATCGTC GTGGGCGCCG TGGCCGCCCT GCGCGAGCGG TTGGCTTGGT ACGAGCCCGG CCCGCTTGCG GGCACGACCG TCGCCGTCAC GCGCACGCGC GCCCAGGCGA GCGGGCTGAC GGAGCGGCTG CGTGCGCTCG GCGCTTCGGT CATCGAGCTG CCCGTCATCT CCATCGCAGC GCCGTCCTCG TTCAGCGGCG TCGACTCGTG CATCGAACGC CTCGCCGGCT ACCGGTTCGT CGTGTTCACG AGCGCGAACG GAGTGAAGGC GTTTTTCGAA CGCCTCGTGC TCGCAGGGCT GGACGCCCGC GCGCTCGCCT GCGCGCGCAT CGCCGCCATC GGGCCCGCCA CGGCGGCTGA GCTGGCCGAG CGCGGGATCG TCGCCGACCT CGTGCCCGGC GAGTTCCGGG CCGAAGCGGT GGCGGACCTG CTCATCGAGG CGGGCTTGAC GGACGGCGAC TGGGTGCTCG TGCCGCGAGC CCTCGAGGCG CGCGACGTGC TGCCTCGGAT GCTGCGCGCC TGCGGAGCGC GCGTGGACGT CGTCCCCGTG TACCGCACCG TGCCTCCGTC GCGCGCTTCG GCAGAACCCG CCTTGGCGAG CTTGATAGCC GGGGAAGCGG ACGCCGTGAC GTTCACCTCG TCGTCCACGG TGCGCAACTT CGTCGGCCTC GTGCGCGACG TTGCGCCGAA CCCGGTCGAG GTGCTCGAGC GCCTCGACTT CTATTCCATC GGGCCCATCA CCACCACCAC CGCTCGCGAC GAGTCGCTGC GCATCGCGGC CCAGGCCGAA GCGTACACGA TCGACGGCCT GGTCGAGGCC ATCGTCAAGC ACCGCGCCTC GTGCGTCAAC GAAACGTGA
|
Protein sequence | MTAQGVEALL VGAGPGDPNL LTLAGAAALS RADVVVYDYL ANPALLAHAP QDAERVYVGK KGFSEHVTQR QINELLVQRA RDLAARGGGV LVRLKGGDPF VFGRGGEEAL ALAEAGFPCP IVPGVTSGVA APAFAGIPVT HRGLASSVTF VTGSEDPTKA ETAVDWSGIA HGADTLCFYM GVRNLPVIAR RLMEAGRSAD TPVSLVRWGT TPMQEVLAGT LATIAERAAA VGFKAPAIIV VGAVAALRER LAWYEPGPLA GTTVAVTRTR AQASGLTERL RALGASVIEL PVISIAAPSS FSGVDSCIER LAGYRFVVFT SANGVKAFFE RLVLAGLDAR ALACARIAAI GPATAAELAE RGIVADLVPG EFRAEAVADL LIEAGLTDGD WVLVPRALEA RDVLPRMLRA CGARVDVVPV YRTVPPSRAS AEPALASLIA GEADAVTFTS SSTVRNFVGL VRDVAPNPVE VLERLDFYSI GPITTTTARD ESLRIAAQAE AYTIDGLVEA IVKHRASCVN ET
|
| |