Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2026 |
Symbol | |
ID | 8416337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2372777 |
End bp | 2373988 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025003 |
Product | Ethanolamine utilization protein EutH |
Protein accession | YP_003182379 |
Protein GI | 257791773 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.86347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00000329746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGATGA TCGGAACGGC CGTGGTGTAC ATCATCATGG TCTGCGCATT GGCGGGCGCC GTCGCATCGG CCATCAAGCC GGAGAGCGAG CTCGGCCGGC AATTCGTGGC CGGCATCGAC TCCATCGGCC CCATCTTCCT TCCCGTAGCG GGCATCATGG CATCGGCTCC TTATCTGACG GCGTTCGTGA GCACGGTCTT CGGGCCGGCG TACGGGGCGC TCGGCGCCGA CCCAGCCATG GCAGCGACGA CGTTCATCGC CATCGACATG GGCGGATACC AGCTGGCGGA CGCGCTTGCG CAGACGCGTG AGAGCTGGAT CATGGCGATG ATGACCGGGT ATATGGCGGG CGCAACCATC GTTTTCACGA TACCGGTGGC GCTGAAGATG CTCGAGAAAC GCGATCGGAA GTACTTGGCG CTCGGAGTGA TGAGCGGCCT TCTCGCGATT CCTATCGGCG TGCTCGTTGC GAGCATCATC ATCGCGCTTT CGCACCCGGT GATCCGGGAG GTCGTATCGA CGAACGCCGA AGCGACCTAT CAGCTTGCGT TGAGCTTCGC CCAGATCGGC GTCAACCTCG TGCCGCTGAT CATCATATGC GTTGCGCTGG CATTGGGGCT CAAGTTCAAG CCCGACGCCA TGATCAAGGG GTTCATCGTG TTCGGTCGCG TGATGGAAGC GACGCTCAAA ATCGTGTTCG TGCTGGCGGT TATCGAATAC TTCACGGGCA TCTTCACCAC GGTCTTCGGC TCCTTCGGGT TCGATCCTAT CATCGCCGAC GAGGAGGATA TCTTCCGAGC ACTCGAGGTG TCGGGTGCTA TCGGCATGAT GTTGTGCGGC GCGTTTCCCA TGGTGTACCT CATCAAGCGC TATCTTGCGA AGCCACTCGC CAAAATCGGC GGCGCGGTGG GGCTGAGCTC GGACGCAACC GCTGGCTTGC TGGCCGCCTC GGCGAACGTG CTTGCGGCGC TGTCGATGGT GAAAGACCTC AAGGCGCGCG ACAAGGTGCT GGTCATGTCG TTTGCCGTGT GCTGCGCATT TCTGTTCGGC GATCATCTGT CGTTCACGGC GAACTTCCAA CCGACGCTGA TCGTGCCGGT GCTTGTAGGG AAGCTGTCAG CGGGTGTGTG TGCCGTCGTG TTCGCAAGCT TGCTTGCCGT GAAGAAGGCT GAGGAACTCG AGCGGATCGA TAGGGCAGAA GCCGAAAACT AG
|
Protein sequence | MEMIGTAVVY IIMVCALAGA VASAIKPESE LGRQFVAGID SIGPIFLPVA GIMASAPYLT AFVSTVFGPA YGALGADPAM AATTFIAIDM GGYQLADALA QTRESWIMAM MTGYMAGATI VFTIPVALKM LEKRDRKYLA LGVMSGLLAI PIGVLVASII IALSHPVIRE VVSTNAEATY QLALSFAQIG VNLVPLIIIC VALALGLKFK PDAMIKGFIV FGRVMEATLK IVFVLAVIEY FTGIFTTVFG SFGFDPIIAD EEDIFRALEV SGAIGMMLCG AFPMVYLIKR YLAKPLAKIG GAVGLSSDAT AGLLAASANV LAALSMVKDL KARDKVLVMS FAVCCAFLFG DHLSFTANFQ PTLIVPVLVG KLSAGVCAVV FASLLAVKKA EELERIDRAE AEN
|
| |