Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1799 |
Symbol | |
ID | 8416103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2107465 |
End bp | 2108820 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645024770 |
Product | hypothetical protein |
Protein accession | YP_003182153 |
Protein GI | 257791547 |
COG category | [S] Function unknown |
COG ID | [COG4487] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.41875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA TCAAGTGCCC CCATTGCGGC GAGATGTTCA CCATCGACGA GGCCGGTTTC GCGGCCATTC TGAAGCAGGT GCGCGACGCC GAGTTCGACA AGGAGGTGCG TCGCCACGAG CAGCTGATGG CCTCCGAGAA GCAGCAAGCC GTGCAGTTGG CCGTGGCCGA GGCTCTTGCG AAAGCGCAGG GCGACGCGGC TCAGAAGGAG GCGCGCATCG CCGAGCTGGA GGCGCGGCTG CAGGCGGAGC AGCGTGAGCG CGAGAGCCAG CAGCGCCTGG CCCACGCCGA GCGCGAGCGG GCGCTGGCCG ACGCGGCGGC TGCGAAGGAC GCCCGCATCG TCGAGTTGGA GCAACGTGTG GAGGCGCAGG GTCGCGCCTT CGAGGCGGAG AAGAAGCTGG CGGTGGAGCA GGCGCGTTCG GCGCTGGAGC GCGAGCGCGA CGCGCTGGCC GCGCAGGTGG CGCTCAAGGA CGCCGAGAAG AGCCGATGCG AGAGCGCCCT CAAAGAGCAG CTGGCCATCG AGCTCAAAGC CAAGGACGAC ATCATCGCCT ACAAGGACGG CGAGATCGAG CGCTACAAGG ACATGAAGGC GCGCCTGTCC ACCAAGATGG TGGGCGAGTC GTTGGAGCAG CACTGCGAGA CCGAGTTCAA CAAGATCCGC GCCGCCGCGT TCCCGCGCGC GTACTTCGAG AAGGACAACG ACGCGTCCGA GGGCTCGAAG GGCGACTTCA TCTTCCGCGA ATGCGACGAG GAGGGCAACG AGATCGTGTC CATCATGTTC GAGATGAAGA ACGAGTCCGA CGATTCGTCG CATCGTCACA AGAACGAGGA CTTTTTCAAG AAGCTGGACG CCGACCGCAG GAAGAAGGGG TGCGAGTACG CGGTGCTGGT CACGTTGCTG GAGCCGGAAA GCGAGCTGTA CAACCAGGGC ATCGTGGATG TGTCGTACCG CTTCGAGAAG ATGTACGCCA TTCGCCCGCA GTTCTTCCTG CCGATGATCT CCATCCTGCG CAACGCGGCG TTGAACTCGA TGGCGTACAA GGCGGAGCTG GCGGTGGTGC GCAACCAGAA CATAGACATC ACGAAATTCG AAGAGCAGAT GGAAACGTTC AAAAGCGGTT TCGCGCGCAA TTACGATCTG GCCAGCCGCA AGTTCCAGAC GGCCATCGAC GAGATCGACA AGACCATCCT GCACTTGCAG AAGACCAAGG ACGCCCTCGT GTCGTCCGAG AACAACCTGC GCCTGGCGAA CAACAAGGCC CAAGACCTCA CCATCAAGCG CCTGACCAGG AACAACCCCA CCATGAAAGC GGCCTTCGAG GCGCTGGCCG AGGAGAAGGA TGACACGCAG CCGTGA
|
Protein sequence | MNEIKCPHCG EMFTIDEAGF AAILKQVRDA EFDKEVRRHE QLMASEKQQA VQLAVAEALA KAQGDAAQKE ARIAELEARL QAEQRERESQ QRLAHAERER ALADAAAAKD ARIVELEQRV EAQGRAFEAE KKLAVEQARS ALERERDALA AQVALKDAEK SRCESALKEQ LAIELKAKDD IIAYKDGEIE RYKDMKARLS TKMVGESLEQ HCETEFNKIR AAAFPRAYFE KDNDASEGSK GDFIFRECDE EGNEIVSIMF EMKNESDDSS HRHKNEDFFK KLDADRRKKG CEYAVLVTLL EPESELYNQG IVDVSYRFEK MYAIRPQFFL PMISILRNAA LNSMAYKAEL AVVRNQNIDI TKFEEQMETF KSGFARNYDL ASRKFQTAID EIDKTILHLQ KTKDALVSSE NNLRLANNKA QDLTIKRLTR NNPTMKAAFE ALAEEKDDTQ P
|
| |