Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0151 |
Symbol | |
ID | 8414435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 209446 |
End bp | 212640 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023131 |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003180534 |
Protein GI | 257789928 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTTC GCACACTGTT GATAGGCGCC GCGTTGGGCC TTGCGCTTTG CCTGCCCGTC GCTCCGGGCA TGACGGCGGA AGCCGCCGAA GATACCGTGC AAATCACGGA CATCGAGCAA TTGGTGCCGT ACGTTCCCGA CGCGATCTTC CGCCAAGCCG TGTTCGATGC CGTGAAGGAC GGGGCTGACG GCGCCGAAGG CGCGGACGTC GAAGAAGCGC TGTACAATTT CAGGGGAACC GTTCTGTACA ACAAGAGCAA CGCGACGCCC TCGGATCAGA AGATCAAGGA CGTACACGGC ATTCAATACC TGCGCAACGC CCAGCTCGTC AGCCTGAAGT ACAACGAGAT TCGCGACTTT TCATGGCTCG AGAGGAGCGG CGGGCTCGAA GACAAGTACT ATGGCGAGCT GCTGGCGAAC GACCCCACCA TCGAAATCGA CGAGAGGAAC GTGGTGTGGG ATTTCGGCGG CAACCCCTTT GAGATGCTCC CGACGTTCTT CGGGGGACGC CTGAAGATCA TGCAGCCGGC AAGCTCTTCT TTCACGTACT CGGAAGACGT ATCCCAGCAC CTTGCCTACG TTCGCCCCGC CGGCGAAACC GCCGTCAGCG GAGCCCTCGA CATCGGCAAA TCGGCCATAT ACGAGCATGG CGCGAAGATC GAGGACGCGC ACGTCGTAGA ATGCGGAGTT CACACACGGC CGGGCGACGC CGCCACCAGC ATGGTCATCG CCTCTCATAA CGACACGACC GCCGCCTTCA CCGGCCTCGA GAAAAGCGGG GTCTTCCATA TCTACGTCGG CATGGATAAA GAGTTGAAGT ACGGCACCCA GGACGAGTGG GGCGCCATTA CGGAAGGCGA GCAATCCTAC AAATACTACC TCACGCCCAC CTTTCGCGTA TACGACCGGA TCACGGCCGC GTCCGCAGCC GGAAGCAGCG CCGTGCTGAC CAAGACCGAC TCCACCACCA ACGCGCCGGT GGCCGGAGCG ACCTATGCGG TGTACACGGA CAAGGGGGCT TTCGTCGAAG AGCGCACGAC CGATGATGCG GGCAGCCTCT CCACCTCCAG CCTCCTTCCG GGCGCCTACT ACTTCCAGGA GACCGAAGCG CCCACCGGCT ACCTGCTCAA CGACAAGAAA ATTCCCTTCA CCATCGTAGA GGGCACCACG GGCGCAACCA CCTCGGTCGG CGGCGGCGAG TCGCAGGTGA CCACGTCCGA CGGCCAAACC GTCAACGCCT CTGCCAACGA GCGGCTGTTC GCCGGCGGCA AGGACGGCTC CGGCACCCTG CTCAGCCCCG ACCTGGAGCT TTCATCGAGC AACCCCGACG ACGTCGTAGG CGTGCAGGTG ACGTACGACA AGCTCGACGG CGACCGAGGC GGCGACAACG TCGTGCGCAC GTTCGACAGC CTGACCGACG CCCAAGCCGA CATCAACGCC GAGAAGGGCG ACAACGCCAT CATGGGGCCC GTCTCCGTCA CGGCTAGGTA TCGCACGTCG ACCGCCGCGC CCGTGCAGGT GCAAACGAGC GACGAGCCCG TCGAGGCGAT CGACATCCCC GTGAAGAAGC ACTGGCAGGA CAACCCCGAC TGGCACGGAA CGCGCGCCGA CGTCACCATC CGCTTGTGGT GCGGCGGCGA CGAGGTGGGC ACCTGGACGC TGATCGGAGG AGAGCCGAGC GAAGGCGGGG CCGACGACTT CGACCACGTG TTCACCGGCC TGCCGAAAAC CGACCAGTAC GGCAACGACC TCGTGTACGA AGTGACCGAG GATCCCGTTC GGGACGCTTC GGGCATCACC GGCAACTACA TCTCGACCAT CGACGCCGAT CCAGCGGTTG ACAACGGCGT CATCGTCTCC AACCTGTACA ACGTTGCCGA GAAGTTCTAC CTGACCGGTC AGAAAACATG GAGCGGCGAC ACCGAGGCCG ACCGCCCCGC ATCCGTGTCC CTGACGCTCA CGCAGACGAA CGCGAGCGGG CACGCCCCCT ACATCTTCAA GACCACGGCG TCGGCGCCCG ACTGGACGTA CACGTTCACG AACATCCCGC TGCTCGAAGG CTCCGAGCGG GCAACGTACC AGCTCACCGA AACGCCCGTG AACGGGTACA CGTCGTTCGT CCCCATCGCC AACATCCAGG GCACCGGCGA CATCGAGACG GTGGTCGTTC CGGGCGAGCA AACCGTCAAC AAGATCAAGC GCGTCGACGT CGCCGGCACC AAAACATGGC ACGACTACGA CAACGCCCTC GGCACACGCC CGGCGGCGAT CACCGTGAAC CTGTATCAGG ACGACGTGCT GTTCGACACC GCCACCGCAA CCGCCGACGA CGGCTGGGCG TACCAGTTCA CCGGTCTGCC CGAAGCCGCC GCGAACGGCG CCATCCACGT GTACACCGTG CAGGAGGAGG CCGTCGAGCA CTACGCCACC GTCATCGACG GCACCGCCAT CGCGAACACG CTCGACCCGA AGCTGAACGA CATAGCCGAG GTGAAGGGCA CGAAGACCTG GGACGACAAC GACAACGCCG GCAACACGCG CCCCGAAAGC ATCACCGTGG AGCTGCTGGA CGGCGACGAT GTGGTGAAGT CGCTGGAAAC CACGGAGGCT GACGGTTGGG CGTACGCGTT CGCCGAGCTG CCGAAGTACG CCGACGACGG AACCGAGATC GCCTACACCG TGCGCGAGAA GGACGTCCCG GCCGGCTACG AGGCGGCCGT CTCGGGCCAC AACATCGCGA ACACGCTGAA GCCCCAGCCC GGCGGCGACA CCGTCGAAGT CGCCGGCACG AAGACCTGGG TGGACAACGA CAACGCCGGC ACCACGCGAC CGGAAACGCT CACCGTCTCG CTGTACCAGA ACGACAAGCT CTTCCGCACG CAGGAGACGT CCGCACAACG CGATTGGGCG TACCGCTTCG CCGACCTCCC CCGCTTCGAC GCGGATGGGA AGGAGTTCGC GTACTCGATA CGGGAGGACG CCGTCCCCAG CGGGTACACG GCCTCCGTGA AGGGTTACGA CCTGACCAAC ACCTTGAGCG AGATCCCGCA GAAGCCGAAG GACCCCACCC CGACCGACGC GCCGCACAAG CTGGCCGCCA CCGGCGATGC CCCGTTCGCC GCAGTCGCGC TTGGCGCAAC CTTCGCGGCC GCGCTCCTCG TCGTCGCCGC CTTCCGAAAG CGCAGGCGCG CCTAG
|
Protein sequence | MKVRTLLIGA ALGLALCLPV APGMTAEAAE DTVQITDIEQ LVPYVPDAIF RQAVFDAVKD GADGAEGADV EEALYNFRGT VLYNKSNATP SDQKIKDVHG IQYLRNAQLV SLKYNEIRDF SWLERSGGLE DKYYGELLAN DPTIEIDERN VVWDFGGNPF EMLPTFFGGR LKIMQPASSS FTYSEDVSQH LAYVRPAGET AVSGALDIGK SAIYEHGAKI EDAHVVECGV HTRPGDAATS MVIASHNDTT AAFTGLEKSG VFHIYVGMDK ELKYGTQDEW GAITEGEQSY KYYLTPTFRV YDRITAASAA GSSAVLTKTD STTNAPVAGA TYAVYTDKGA FVEERTTDDA GSLSTSSLLP GAYYFQETEA PTGYLLNDKK IPFTIVEGTT GATTSVGGGE SQVTTSDGQT VNASANERLF AGGKDGSGTL LSPDLELSSS NPDDVVGVQV TYDKLDGDRG GDNVVRTFDS LTDAQADINA EKGDNAIMGP VSVTARYRTS TAAPVQVQTS DEPVEAIDIP VKKHWQDNPD WHGTRADVTI RLWCGGDEVG TWTLIGGEPS EGGADDFDHV FTGLPKTDQY GNDLVYEVTE DPVRDASGIT GNYISTIDAD PAVDNGVIVS NLYNVAEKFY LTGQKTWSGD TEADRPASVS LTLTQTNASG HAPYIFKTTA SAPDWTYTFT NIPLLEGSER ATYQLTETPV NGYTSFVPIA NIQGTGDIET VVVPGEQTVN KIKRVDVAGT KTWHDYDNAL GTRPAAITVN LYQDDVLFDT ATATADDGWA YQFTGLPEAA ANGAIHVYTV QEEAVEHYAT VIDGTAIANT LDPKLNDIAE VKGTKTWDDN DNAGNTRPES ITVELLDGDD VVKSLETTEA DGWAYAFAEL PKYADDGTEI AYTVREKDVP AGYEAAVSGH NIANTLKPQP GGDTVEVAGT KTWVDNDNAG TTRPETLTVS LYQNDKLFRT QETSAQRDWA YRFADLPRFD ADGKEFAYSI REDAVPSGYT ASVKGYDLTN TLSEIPQKPK DPTPTDAPHK LAATGDAPFA AVALGATFAA ALLVVAAFRK RRRA
|
| |