Gene Elen_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2069 
Symbol 
ID8416386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2435984 
End bp2437552 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content72% 
IMG OID645025051 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_003182421 
Protein GI257791815 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.198087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000347629 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGCGC AGGGGGTGGA GGCTCTGCTC GTGGGAGCGG GGCCGGGCGA TCCGAACCTG 
CTCACGCTCG CGGGAGCGGC CGCGCTTTCG CGAGCCGACG TCGTGGTGTA CGACTACTTG
GCGAATCCGG CTCTGTTGGC GCACGCGCCG CAGGACGCCG AGCGCGTGTA CGTGGGCAAA
AAGGGCTTTT CGGAGCATGT GACCCAACGT CAGATCAACG AGCTGCTGGT GCAGCGTGCG
CGCGATCTGG CCGCGCGCGG AGGCGGGGTG CTCGTGCGTC TCAAGGGCGG CGACCCGTTC
GTGTTCGGGC GCGGCGGCGA AGAGGCGCTC GCGCTCGCCG AGGCGGGGTT CCCTTGCCCG
ATCGTGCCGG GCGTGACGAG CGGCGTGGCC GCACCCGCCT TCGCGGGCAT CCCCGTCACG
CACCGCGGGT TGGCCTCGTC GGTGACGTTC GTCACGGGAA GCGAGGATCC GACGAAGGCC
GAGACGGCCG TCGACTGGAG CGGCATCGCC CACGGCGCCG ACACGCTGTG CTTCTACATG
GGCGTGCGCA ACCTGCCCGT CATCGCGCGG CGGCTGATGG AGGCGGGGCG CTCGGCCGAC
ACGCCGGTCT CCCTCGTCCG CTGGGGCACG ACGCCCATGC AGGAGGTGCT TGCGGGAACG
TTGGCCACCA TTGCCGAACG TGCGGCGGCC GTCGGGTTCA AGGCGCCGGC CATCATCGTC
GTGGGCGCCG TGGCCGCCCT GCGCGAGCGG TTGGCTTGGT ACGAGCCCGG CCCGCTTGCG
GGCACGACCG TCGCCGTCAC GCGCACGCGC GCCCAGGCGA GCGGGCTGAC GGAGCGGCTG
CGTGCGCTCG GCGCTTCGGT CATCGAGCTG CCCGTCATCT CCATCGCAGC GCCGTCCTCG
TTCAGCGGCG TCGACTCGTG CATCGAACGC CTCGCCGGCT ACCGGTTCGT CGTGTTCACG
AGCGCGAACG GAGTGAAGGC GTTTTTCGAA CGCCTCGTGC TCGCAGGGCT GGACGCCCGC
GCGCTCGCCT GCGCGCGCAT CGCCGCCATC GGGCCCGCCA CGGCGGCTGA GCTGGCCGAG
CGCGGGATCG TCGCCGACCT CGTGCCCGGC GAGTTCCGGG CCGAAGCGGT GGCGGACCTG
CTCATCGAGG CGGGCTTGAC GGACGGCGAC TGGGTGCTCG TGCCGCGAGC CCTCGAGGCG
CGCGACGTGC TGCCTCGGAT GCTGCGCGCC TGCGGAGCGC GCGTGGACGT CGTCCCCGTG
TACCGCACCG TGCCTCCGTC GCGCGCTTCG GCAGAACCCG CCTTGGCGAG CTTGATAGCC
GGGGAAGCGG ACGCCGTGAC GTTCACCTCG TCGTCCACGG TGCGCAACTT CGTCGGCCTC
GTGCGCGACG TTGCGCCGAA CCCGGTCGAG GTGCTCGAGC GCCTCGACTT CTATTCCATC
GGGCCCATCA CCACCACCAC CGCTCGCGAC GAGTCGCTGC GCATCGCGGC CCAGGCCGAA
GCGTACACGA TCGACGGCCT GGTCGAGGCC ATCGTCAAGC ACCGCGCCTC GTGCGTCAAC
GAAACGTGA
 
Protein sequence
MTAQGVEALL VGAGPGDPNL LTLAGAAALS RADVVVYDYL ANPALLAHAP QDAERVYVGK 
KGFSEHVTQR QINELLVQRA RDLAARGGGV LVRLKGGDPF VFGRGGEEAL ALAEAGFPCP
IVPGVTSGVA APAFAGIPVT HRGLASSVTF VTGSEDPTKA ETAVDWSGIA HGADTLCFYM
GVRNLPVIAR RLMEAGRSAD TPVSLVRWGT TPMQEVLAGT LATIAERAAA VGFKAPAIIV
VGAVAALRER LAWYEPGPLA GTTVAVTRTR AQASGLTERL RALGASVIEL PVISIAAPSS
FSGVDSCIER LAGYRFVVFT SANGVKAFFE RLVLAGLDAR ALACARIAAI GPATAAELAE
RGIVADLVPG EFRAEAVADL LIEAGLTDGD WVLVPRALEA RDVLPRMLRA CGARVDVVPV
YRTVPPSRAS AEPALASLIA GEADAVTFTS SSTVRNFVGL VRDVAPNPVE VLERLDFYSI
GPITTTTARD ESLRIAAQAE AYTIDGLVEA IVKHRASCVN ET