Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1462 |
Symbol | |
ID | 8415760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1745474 |
End bp | 1747660 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024431 |
Product | YhgE/Pip C-terminal domain protein |
Protein accession | YP_003181820 |
Protein GI | 257791214 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03061] YhgE/Pip N-terminal domain [TIGR03062] YhgE/Pip C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTAACG TGATCCGCAT CGTGCGCAGC GACTTCAAGC GGCTGTTCGC GAACGCGATG AGCGTGATCA TCGTGATCGG GCTCGTGGTG ATGCCGTCGA TCTTCGCGTG GTACAACGTC ATCGCGTGTT GGAACGTGTT CGACAACACG GGCAACCTGA CGGTGGCGGT AGCGAACGTC GACGACGGCT ACGAGAGCGA TCTCGTGCCT TTGCGCGTGA ACATCGGCGA GCGGGTGGTG TCGGCTCTGC GAGCAAATGA CCAGATCGAC TGGACGTTCA CCACCGAGGA GGACGCGGTG GACGGCGCGC GGTCGGGACG CTACTACGCG GCCGTGGTCA TCCCCGCGGG GTTCAGCAAG GATATGCTGA CGTTCTACTC CGAGGACGTG CAGCACGCCC GTATCGTCTA CTACGCCAAC GAGAAGAAGA GCGCTATTGC GCCGAAGATC ACCGATCAAG GTGCCGATTC GGTATCGTAC CAGGTGAACG AGGTGTTCGC CCAGACGTTG TCAGAAGTGG CGCTGGGCAT CGCCGAGTCG ATGTCCGCCT ACGCCGACGA GGCAGACGTG GGCGGGCGCA TTGCGGGTGT GTCGGGGAAA GTGCGCGACA TGGGAGCGCA AGCCGAACGG ATGGCGTCGG TGCTGGAACT GTATTCGTCG CTGGCCGGCG CTGCGCAGAC GCTCGCGAGC GACTCGGGCA AGCTGGTAGT CGCAGCGCAA AACGGGGTCG ATGGCCTCGA CGCCGCATCG TCTCAGGGCT CTACGTCGGC CTCCGACCTG GTGACGGCCG TGAAAGGGTC GGTCGACGAC CTTGCGGGCG CACTCGACGG TGCTGCCGCG CGCTTCGACG AGGTGTCTGC CTCCGCAGGG GCGCTGTTCG ACGCCGCCTC GACCGGTGCC GCGCAGGGCG CGGCAGGGCT GCGGGGCCAG GCCGAAGCTG TGGACGCGCA GATCGCGCGG TACCGTAGCG CCATAGAGCA GCTCGAGGAG CTGCGAGGCT CGTTGCCTTC CGATGCCCAG CAGGCGCTCG ACGCCGTCAT CGCGCGCATG AGCGCCGTCG TCTCGCTTAT GGAGGGCATG CGCGACAACC TGGGCGCCGC AGCCGAGAGG CTGGAGGCTG GAAACGCCGA CGTGGAGGCT CAGCGCGCGG AGGTCGAACG GCTTGCGGCT GAGGCGCGGC AGGCCTCCGA CGACCTCGCA GCCTCGTTCG ACAGCGGGCT CAAGCCGGGC TTGCAGCAGC TCGCGGACAG CGCGGGCGTG CTTGCGGCTA ACATGGGAGA CGGCCTGGAC GGTCTGCGCT CGTCGGGCGC CGATCTGTCC GCGTCCGCAG GGTCGGCCGC CGACGTGCTG GGCGATGCGC GCGCGAAGGT GGACGAGGCG GCGCAGAAGT TGCGCGAAAC GGCGCGCGAG CTGGATACGC TCGCCGACGG CGTGGACGAG GCGCTGGTCG CGGGCGACGC GGACGCGCTG CGCGCGGTGC TGGGCGCCGA CGCGCAGCTC TTGTCGAAGG CGCTCGCAGC GCCGGTGGGC ATCGAGCGCG AGGCCGTGTT CCCGGCTGAG AACTTCGGTT CGGCCATGGC CCCGCTGTAC ACGACGCTCG CCCTGTTCAT CGGGTCGCTT TTGATCCTCG TGGTGGTGAA GCCGACGGTT TCCGACCGCA CCCGCGAGCA GCTGTCCGAC CCGCAGCCGC GCCAGCTGTT CATGGGGCGC TTCGGCGTGC TGGCGTTCTT GTCGCTCGCG CAGACCACGG TGATGGGCTT GGGGAACCTG CTGTTCTTGC AGGTGCAGGT TGCCGAGCCC GCGCTGTTCA TGCTGTGCTT CTGGATAGCC GGCCTCGTGT TCACGTTCCT GATATACGCG CTGGTGGCGG CGTTCGCGAA CCTCGGCAAG GCCGTGGCGG TGCTGCTGTT GATCATCCAG GTGACGGGGT GCGGCGGGTC GTTCCCGCTG CAGCTACTGC CGCCGTTCGT GCAGGCCCTG AGCCCCTGGC TTCCCGCCAC GCACGTGGTG AACGCCATGC GGGCGGCCAT GTTCGGCACC TACGGCGCCG ACTTCTGGAC GGAGATCGGG CTGCTCCTGC TGTTCCTGAT CCCCGCGGCG CTCATCGGCC TCGTGCTGCG CAAACCCCTC GCCAAGTTCA TGACCTGGTA CGTCGAGCAG GTAGAGTCGT CCAAGCTCGT AGGGTAG
|
Protein sequence | MGNVIRIVRS DFKRLFANAM SVIIVIGLVV MPSIFAWYNV IACWNVFDNT GNLTVAVANV DDGYESDLVP LRVNIGERVV SALRANDQID WTFTTEEDAV DGARSGRYYA AVVIPAGFSK DMLTFYSEDV QHARIVYYAN EKKSAIAPKI TDQGADSVSY QVNEVFAQTL SEVALGIAES MSAYADEADV GGRIAGVSGK VRDMGAQAER MASVLELYSS LAGAAQTLAS DSGKLVVAAQ NGVDGLDAAS SQGSTSASDL VTAVKGSVDD LAGALDGAAA RFDEVSASAG ALFDAASTGA AQGAAGLRGQ AEAVDAQIAR YRSAIEQLEE LRGSLPSDAQ QALDAVIARM SAVVSLMEGM RDNLGAAAER LEAGNADVEA QRAEVERLAA EARQASDDLA ASFDSGLKPG LQQLADSAGV LAANMGDGLD GLRSSGADLS ASAGSAADVL GDARAKVDEA AQKLRETARE LDTLADGVDE ALVAGDADAL RAVLGADAQL LSKALAAPVG IEREAVFPAE NFGSAMAPLY TTLALFIGSL LILVVVKPTV SDRTREQLSD PQPRQLFMGR FGVLAFLSLA QTTVMGLGNL LFLQVQVAEP ALFMLCFWIA GLVFTFLIYA LVAAFANLGK AVAVLLLIIQ VTGCGGSFPL QLLPPFVQAL SPWLPATHVV NAMRAAMFGT YGADFWTEIG LLLLFLIPAA LIGLVLRKPL AKFMTWYVEQ VESSKLVG
|
| |