Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1710 |
Symbol | |
ID | 8416009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2015033 |
End bp | 2016358 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645024676 |
Product | Coproporphyrinogen dehydrogenase |
Protein accession | YP_003182064 |
Protein GI | 257791458 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCG AGCGCATACT GTCGCGCCTG ATCCATACCA TGACCAAGCA GCATCTGGCT CTCAACCCCA CGAGCGAGAC CATGATGCCG TCCGCCAATC CCGGGCAGAA GTACATGCTG TACATGCACG TCCCTTTCTG CGAGCGCCTG TGCCCGTACT GCAGCTTCAA CCGTTACCCG TTTCGCGAGG AGGTCGCCCG GCCCTACTTC GCGAACATGC GCAAAGAGAT GCTGATGTTG AAGGATCTCG GCTACGACTT CGAGAGCATC TACGTAGGCG GCGGCACACC CACCGTCATG ATCGACGAGC TGTGCGAGAC GCTCGACATG GCTCGCGACA ACTTCAACAT CAGGGAGGTC GCCTCCGAGA CCAACCCGAA CCATCTGACG CAGCCGTGGT TGGACAAGCT GCAGGGTCGT GTGCAGCGTT TGAGCGTGGG CGTGCAAAGC TTCGACGACG ACCTGCTCAA GCAGATGGAC CGCTACGAGA AGTACGGCAG CGGCGACGTC ATCCTGGAGC GCATCGCCGA GGCGGAGCCG TACTTCGACT CGCTGAACGT GGACATGATC TTCAATTTCC CCTCGCAGAC CGAGGACGTT CTCATGGCCG ACCTCGAGAA GATCGCGCTG TCCGGTTGCC GCCAGACCAC GTTCTCTCCG CTGTACGTGT CGTCGGCCAC TACGCGCAAA ATGGCCGCTA CGTTAGGCGC GATGGACTAC GACCGCGAGT ACCGTTACTA CCAGATCCTC GACGGCGTGC TGGCCGGCGG CGACGACCCG CTGTTCGACC GCACCACGCT GTGGACGTTC ACGCGCCCCG ACAAGGCGGG CAAGCCGGCC GACGCGCCTC AGATCGACGA GTACCAGGTG AACTACGACG AGTATCCGGC CATCGGCAGC GGCTCCATCA CGCACCTGAA CGGCTGCTTG TACGTGAACA ACTTCAGCAT CAAAGACTAC AACGCGGCCA TCGAGAGCGG GCGCATGTCC ATCATGGGCA AGACCGAGAT GAGCAAGCGC GACCTCATGC GCTACCGCTT CCTGCTCGAT TTGTACAAGC TGCGGCTCGA CAAGCGCGCC TTCGAGCGCG ACTTCGGCTG CAGCATCGAG ACAGGTCTTC CCATGGAATT GGCTTTCATG CGTCTGAACA GGGCGTTTGA AACCGACAAC GCCGAAGAGC TGACGCTGAC GCCTATCGGA CGCTACCTGA CCGTGGTCAT GTACCGCCAG TTCCTCAGCG GCATGAACAA CTTGCGCGAC CAGGCGCGCG CCGCGCTGAC CGGCCCGGAG CGCGAGCTCC TGTTCGGCGA CGGCGTTCCC GCGTAA
|
Protein sequence | MLSERILSRL IHTMTKQHLA LNPTSETMMP SANPGQKYML YMHVPFCERL CPYCSFNRYP FREEVARPYF ANMRKEMLML KDLGYDFESI YVGGGTPTVM IDELCETLDM ARDNFNIREV ASETNPNHLT QPWLDKLQGR VQRLSVGVQS FDDDLLKQMD RYEKYGSGDV ILERIAEAEP YFDSLNVDMI FNFPSQTEDV LMADLEKIAL SGCRQTTFSP LYVSSATTRK MAATLGAMDY DREYRYYQIL DGVLAGGDDP LFDRTTLWTF TRPDKAGKPA DAPQIDEYQV NYDEYPAIGS GSITHLNGCL YVNNFSIKDY NAAIESGRMS IMGKTEMSKR DLMRYRFLLD LYKLRLDKRA FERDFGCSIE TGLPMELAFM RLNRAFETDN AEELTLTPIG RYLTVVMYRQ FLSGMNNLRD QARAALTGPE RELLFGDGVP A
|
| |