Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1657 |
Symbol | |
ID | 8415956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1959578 |
End bp | 1960258 |
Gene Length | 681 bp |
Protein Length | 226 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645024626 |
Product | competence protein ComEA helix-hairpin-helix repeat protein |
Protein accession | YP_003182014 |
Protein GI | 257791408 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.11522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.000932836 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGTTTG CAGAGCGGGC GGAGTCGTGG CGGGCGAAGG CGCACCTGAC CGGCGTGCGC CTGCCGGTGC TCGTGGGCGT GACGGCGCTC GCGGCCATCG TGCTCATCGC GGCGGGAGGA GCGCTGGTCA AGGCCGGAAC GTCGGACGGC TTCTCGCTCT CTCGCGACGA CGGCGCGGCG ACTTCCGACG GTACCGACGA GGACGGCGCC GCTTCCGTGG AGGCGCGAAC CGTCTTCGTG CACGTGGGAG GCGCCGTGGT CGAACCGGGG GTGCGAGAGC TGGCGGAGGG CGCGCGGGTG CAGGACGCGG TGGACGCGGC CGGAGGGTTC GCCGACGGAG CCGCCCGCGA CGCGCTCAAC CTGGCACGCG TGCTCGCGGA CGGCGAGCAG ATCGTCGTGC CGTCGCAGGA GGAGGCTGTC TTGGAGCCGG GCGCGGCCGT GGACGGCGGC GATGCGGGCT CCAGGGCGGC TGCTTCGCCG ACGGGCGGCA AGATCGACCT CAATCGGGCG ACGGCGGCCG AGCTCGATGC GCTGCCTGGC GTGGGGCCGT CCACGGCGGA GAAGATCGTG GCCGACCGTG AGGCGAACGG CCCCTTCCGC ACGGTGGAGG ATCTCAAGCG CGTGTCCGGC ATCGGGGACA AGAAGTTCGC CGATCTGGCC GATCTCGTAT GCGTGGGATG A
|
Protein sequence | MGFAERAESW RAKAHLTGVR LPVLVGVTAL AAIVLIAAGG ALVKAGTSDG FSLSRDDGAA TSDGTDEDGA ASVEARTVFV HVGGAVVEPG VRELAEGARV QDAVDAAGGF ADGAARDALN LARVLADGEQ IVVPSQEEAV LEPGAAVDGG DAGSRAAASP TGGKIDLNRA TAAELDALPG VGPSTAEKIV ADREANGPFR TVEDLKRVSG IGDKKFADLA DLVCVG
|
| |