Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2067 |
Symbol | |
ID | 8416384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2433757 |
End bp | 2434749 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645025049 |
Product | Porphobilinogen synthase |
Protein accession | YP_003182419 |
Protein GI | 257791813 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.355719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00258041 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGATTTC CCGCTTACCG TCCGCGCCGC ATGCGTGCGA ACCCCGCGGT GCGCGCGTTC GTGCGCGAGA CGCGCGTGGA ACCGGGCGAT CTGGTATACC CGGTGTTCGT GAAGCCGGGG GCCGGCGTGC GCGACGAGGT GGCGTCCATG CCGGGCGTGT TCCAGCTTTC GATCGACCAG CTGGCGGCCG AGGTGGACGA GCTTCGAAGC TGCCGCGTGA ACTCGCTCAT GCTGTTCGGC CTTCCTGCGC GCAAGGACGA GCGGGGCAGC GAAGCTTACG ACGACCGCGG CGTGGTGCAG CAGGCCGTGC GCGCCATCAA GGAACACGCG CCCGACTTCC ACGTGATCAC CGACGTGTGC TTGTGCGAGT ACACGAGCCA CGGCCATTGC GGCGTGCTCG ACGAGCGCGG GGGCGTGGAC AACGACGAGA CGCTCGGGCT CCTGGCGGCC GAGGCGGTGA GCCATGCGCG CGCCGGAGCC GACATGGTGG CCCCGTCCGA CATGATGGAC GGGCGCGTGG GCGCGCTGCG CTCGGCGCTC GACGAGGCGG GCTTTTCGCA CGTGCCCATC ATGGCGTATG CGGCGAAGTA CGCGTCGGGC TACTACGGGC CGTTCCGCGA TGCGGCCGAT TCGGCGCCGG CGTTCGGCGA CCGCTCGGCG TACCAGATGG ATCCCGCGAA CAGCGTCGAG GCGCTGCGCG AGGTGCGCCT CGACATCGAG GAGGGGGCCG ACCTTGTCAT CGTGAAGCCG GCGCTATCCT ATCTGGACGT GGTGCGGCGC GTGAAGGACG CCTTCGCGTT TCCCACCGTG GCCTACAACG TGTCGGGCGA GTACGCCATG GTGAAGGCCG CCGCCGCGCA AGGGTGGATC GACGAGCGCC GCGTGGTGCT GGAGACGCTG CTTTCCATGA AGCGCGCCGG CGCCGACGCA ATCATCACCT ACCATGCGAA GGACGCTGCG CGCTGGATCA TCGGAGGCCG TCATGGCCGC TGA
|
Protein sequence | MGFPAYRPRR MRANPAVRAF VRETRVEPGD LVYPVFVKPG AGVRDEVASM PGVFQLSIDQ LAAEVDELRS CRVNSLMLFG LPARKDERGS EAYDDRGVVQ QAVRAIKEHA PDFHVITDVC LCEYTSHGHC GVLDERGGVD NDETLGLLAA EAVSHARAGA DMVAPSDMMD GRVGALRSAL DEAGFSHVPI MAYAAKYASG YYGPFRDAAD SAPAFGDRSA YQMDPANSVE ALREVRLDIE EGADLVIVKP ALSYLDVVRR VKDAFAFPTV AYNVSGEYAM VKAAAAQGWI DERRVVLETL LSMKRAGADA IITYHAKDAA RWIIGGRHGR
|
| |