Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0459 |
Symbol | |
ID | 8414743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 585130 |
End bp | 588342 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023431 |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003180834 |
Protein GI | 257790228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTG CATCGCATGT CGAATCGCCA ATATCCGGAC GGAAGCTGTG CGCATTCGCA CTCGCCGCGG CGCTTGCGCT CATGGCGGGA GCACAGTTCG TTCCCGCGCA TTACGCCTGG GCTGAAGACG AGCCGGCGGC AACGGCTCAG GCGGATGCGA CCCTCCCTGA CGCGGCCCCA CTCGAGGAAG CCCCGCCCGA GGTGGCAGCC TTGGCGCCCA TAGAGAATGC GCCGCACCTC GACCTGAAGG ACGGTTCCAT CCTCTTGAAG GACTCCGGCT CGAACCTCCA ATACAGCCAG GACAACGAGG CAACGTGGAC GTCCTATTCC GGCAACGTGA CGATCGGTGG AACGGCAGGC AGCGGCACGA ACGTGCAAGT GCTCAACGGA ACCCATGCGA TACTTCTCAA CGGCGTTACC ATCAACGAGC CGCGCGCCAA TCACGCGGCG ATTGAAATCG GCAGCGACAC GCAGGCTGCG CAGCTTACGC TCTGGCTGTA TGGCTCGAAC AAGCTTTCCG GTTCGAGCGG GTATCCCGCA ATTTTCGCAC CTGGTAAGGC TCATCTGATT TTCAAAGGCG ATCTCTACGG CAGTCTGGAA GCGCAGGGAG GATACGAGGC TCCCGCCATC GGTGCGGACA GCGCGGTTTC AAGTTGCGGC ACCATCGAGT TTTGGCAACC CGGCAACGTA ACCGCGCGAA GCGGCAACGG CGCCCCTGCC CTCGGCGACC CGACCTCGAG CGCTGTGGAT GCAACGGGGT CGGTGTCCTT CTTTTCGGGA ACGGTGACCT TGCAGTCCTC AGGCACTTGC GACATTACCG CGAAAAGCAT CGTCACCGGC GGCGGCAACA TTCACCTCTC GCAGCGCCCG CCCGCCAACA CGACGCTCGC CGTCGATCAG GGCGGCTCCT TCCCCAGTTC CGCGTATGTT CTGGTGTCGG GCTTGGCGGC AGGTGAGAAT CTGCTCGGAG CCAGTTTTGG TGAGAAAGGC CCTGTCTTTG TCAGAGGCGA TCGGTGGGTG GATGCCAAGC ATTACATCGT AGACGGCTTC GCCACTTGGA TGGACGACGG CGATCTGGGA CTTTTTCTCG ACAGAAACCT GCTTAAGGAC GGCAGCTCCG TCAAGATGAG CTTTAGCGAA TCCGGCAACG TGTACGAAGG CACGATCAAG GGTTCTGCCG AAGCCGGATA CACGCTTCAT CTGGCCATTC CCGACCCGCC TGTTGAAAAC CCCGTCGTCA GCATGCACGA AAAAAAGGGC TGCAAGCTCA TCGTCGATGT GAGTTCGGAG GGCATTCCGC AGTACACCCT CGACGGGGGA GTGAGTTGGA CGCGCTACAG CGGGTACCTG GCTGTCACGG GAGCCGCAGA TTTTACCTCC TTCCCCGTAA GCAATCAGGT GATCGTAAGG AAAGGCACTC ACGCACTGCG CTTTCAGGAT CTGAGCGGCG TTGCGGTTTC AATCGAGGGA TCCTCGAACG CGACGCTCAC GCTGGTGGGA AGTAATTACA TATGCGGCCA CAGACTGGAG GCGTCGCCGG GGGTGGCGGT TCGAGGCACC TCGTCTCTTA CCATCAACGG CAAGGGAACC TTGAATACCG TGGGAGCAGG CACAGGTCGT CAGCCTTGCA TCCTGGGAGA GCCGAACACG ACCATCTCGA TAGCGGGCGG CACGGTTCTC GCTCATTCGG GAGTAACGGG CGGTAATGGC ATTACTGCAG GCAAGCTCGT TGTTTCCGGA GGCACTGTCG AAGCGCTCAG TCAGTCGAGT GGAAGCGCAG CGATCACGGC GAGCAAGTCC ATCGCCGTCA GCGGCGGGTC GGTTACCGCC AAAGGCTCCA AGTTCGTCGC TGGCATCGGG TCGGATCATT ACGGCTCGTG CGGCTCCATC ACCATCAGCG GCGGCATAGT GAACGCGCAC GGCGGCGACC TGTCTGCAGC CATAGGTTGC AGCGTCGGCG GCTCGTGCGG GGAAATCAAG ATCAGCGGGG GCACCGTCAA CGCCGTCGGC GGGCTGGGCA CTCATGGAGC CGGCCGGGTC GCGGCTATCA CAAGCTCCGT CAAGGCGCCC GTCATCACCG GCGGGACGGT AAAATGCAAC GAGGCGGTTT CAACGAGGGC GGCGTCTTTG GCGTCGGCAT CAGCTTCGCC GGCTCCGGCA GCGTTGAACT TCCTGCTCGC CTCCCCCGTC GCGCTCGCGT CCGACCTGGA CGAGGAAGCA ACGTCTCTTG CCTCCGAGGA CGAGCAGCGC TTCGTCAACG AAAACAACGA CGACCTCTCG CTTGTAACGA TACACGGGCT TCCGGCGAAC ACGCCCATCT CCTCCTTGCA GATGCGCCTG CTCGACACGG AATTCGTGGA AGAGCCGACC GGCGGCAGCA CCTACGGCAT GCGCGACGTC CAGACCGACG ACCAAGGCGC GCTCTACCTG CACCTCACGC CGCTCGAGGC GACGGACAAG CTGGTCGTCG CCTCGTGGAA CGGCCGCAAT TACAGCGGGC TGAGCGAGAG GAACCTGGAC GGAACCTTCG CCATCGCACT CGCCCCCACC GAGGCCGCCG TGTACTACCA GTGGCACAGC TTCGAGGACG GCTGGGCCTA CGAGGACGCC GGCGTCGATT GGAGCATCGA CGGGCAGACG AGCGGAAACG CCGCCGGCCA CGACGTGACC GCGCTGCGCA TCGAGACGTC CATCGGCGGC TTGAACGTTT CCTACGCGGT CGACAACGGA AGCGGGTGGA GCCCGGCGGT TGAGAACGGC GACATCGCGG GCGAGATGGA GCAGCCCGTG CAGGCGCTGC GCGTCAGCCT GTCGGGCGAC GAGGCTGCGC GCTACACCGT CTACTACCGC CTGTACGTGA AAGGCGTCGG CTGGATGGCT TGGGCGCATG ACGGCACCGC CAACGGCACG TCGGGATACG GCTACCCCGT CAAGGCGTTC CAGGTGGCGA TACTTCCTGC GGGAGCGACG CCCACGACCG GCGAGGCCAG CGCGACCGAG TACGCGTACC TGGTGAAAGA CGGCCAAAGC CAGCCGGGGC TCATCGGCGA GAAGCCTTCC GCCGGGCATG AGGAACGCGA GGCTGCGCCC CCTGCGAAGG CGCTCGCCTC TACCGGCGAC AGCCCGGCTT TCGCCATCGC CCTGGGCGCA GCCCTCGCAA GCGGCGCGCT GGTTCTTGCG TCGCGCGCCC GTCATCGCAG GTCGCTGTCC TGA
|
Protein sequence | MNAASHVESP ISGRKLCAFA LAAALALMAG AQFVPAHYAW AEDEPAATAQ ADATLPDAAP LEEAPPEVAA LAPIENAPHL DLKDGSILLK DSGSNLQYSQ DNEATWTSYS GNVTIGGTAG SGTNVQVLNG THAILLNGVT INEPRANHAA IEIGSDTQAA QLTLWLYGSN KLSGSSGYPA IFAPGKAHLI FKGDLYGSLE AQGGYEAPAI GADSAVSSCG TIEFWQPGNV TARSGNGAPA LGDPTSSAVD ATGSVSFFSG TVTLQSSGTC DITAKSIVTG GGNIHLSQRP PANTTLAVDQ GGSFPSSAYV LVSGLAAGEN LLGASFGEKG PVFVRGDRWV DAKHYIVDGF ATWMDDGDLG LFLDRNLLKD GSSVKMSFSE SGNVYEGTIK GSAEAGYTLH LAIPDPPVEN PVVSMHEKKG CKLIVDVSSE GIPQYTLDGG VSWTRYSGYL AVTGAADFTS FPVSNQVIVR KGTHALRFQD LSGVAVSIEG SSNATLTLVG SNYICGHRLE ASPGVAVRGT SSLTINGKGT LNTVGAGTGR QPCILGEPNT TISIAGGTVL AHSGVTGGNG ITAGKLVVSG GTVEALSQSS GSAAITASKS IAVSGGSVTA KGSKFVAGIG SDHYGSCGSI TISGGIVNAH GGDLSAAIGC SVGGSCGEIK ISGGTVNAVG GLGTHGAGRV AAITSSVKAP VITGGTVKCN EAVSTRAASL ASASASPAPA ALNFLLASPV ALASDLDEEA TSLASEDEQR FVNENNDDLS LVTIHGLPAN TPISSLQMRL LDTEFVEEPT GGSTYGMRDV QTDDQGALYL HLTPLEATDK LVVASWNGRN YSGLSERNLD GTFAIALAPT EAAVYYQWHS FEDGWAYEDA GVDWSIDGQT SGNAAGHDVT ALRIETSIGG LNVSYAVDNG SGWSPAVENG DIAGEMEQPV QALRVSLSGD EAARYTVYYR LYVKGVGWMA WAHDGTANGT SGYGYPVKAF QVAILPAGAT PTTGEASATE YAYLVKDGQS QPGLIGEKPS AGHEEREAAP PAKALASTGD SPAFAIALGA ALASGALVLA SRARHRRSLS
|
| |