Gene Elen_0151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0151 
Symbol 
ID8414435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp209446 
End bp212640 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content65% 
IMG OID645023131 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003180534 
Protein GI257789928 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTC GCACACTGTT GATAGGCGCC GCGTTGGGCC TTGCGCTTTG CCTGCCCGTC 
GCTCCGGGCA TGACGGCGGA AGCCGCCGAA GATACCGTGC AAATCACGGA CATCGAGCAA
TTGGTGCCGT ACGTTCCCGA CGCGATCTTC CGCCAAGCCG TGTTCGATGC CGTGAAGGAC
GGGGCTGACG GCGCCGAAGG CGCGGACGTC GAAGAAGCGC TGTACAATTT CAGGGGAACC
GTTCTGTACA ACAAGAGCAA CGCGACGCCC TCGGATCAGA AGATCAAGGA CGTACACGGC
ATTCAATACC TGCGCAACGC CCAGCTCGTC AGCCTGAAGT ACAACGAGAT TCGCGACTTT
TCATGGCTCG AGAGGAGCGG CGGGCTCGAA GACAAGTACT ATGGCGAGCT GCTGGCGAAC
GACCCCACCA TCGAAATCGA CGAGAGGAAC GTGGTGTGGG ATTTCGGCGG CAACCCCTTT
GAGATGCTCC CGACGTTCTT CGGGGGACGC CTGAAGATCA TGCAGCCGGC AAGCTCTTCT
TTCACGTACT CGGAAGACGT ATCCCAGCAC CTTGCCTACG TTCGCCCCGC CGGCGAAACC
GCCGTCAGCG GAGCCCTCGA CATCGGCAAA TCGGCCATAT ACGAGCATGG CGCGAAGATC
GAGGACGCGC ACGTCGTAGA ATGCGGAGTT CACACACGGC CGGGCGACGC CGCCACCAGC
ATGGTCATCG CCTCTCATAA CGACACGACC GCCGCCTTCA CCGGCCTCGA GAAAAGCGGG
GTCTTCCATA TCTACGTCGG CATGGATAAA GAGTTGAAGT ACGGCACCCA GGACGAGTGG
GGCGCCATTA CGGAAGGCGA GCAATCCTAC AAATACTACC TCACGCCCAC CTTTCGCGTA
TACGACCGGA TCACGGCCGC GTCCGCAGCC GGAAGCAGCG CCGTGCTGAC CAAGACCGAC
TCCACCACCA ACGCGCCGGT GGCCGGAGCG ACCTATGCGG TGTACACGGA CAAGGGGGCT
TTCGTCGAAG AGCGCACGAC CGATGATGCG GGCAGCCTCT CCACCTCCAG CCTCCTTCCG
GGCGCCTACT ACTTCCAGGA GACCGAAGCG CCCACCGGCT ACCTGCTCAA CGACAAGAAA
ATTCCCTTCA CCATCGTAGA GGGCACCACG GGCGCAACCA CCTCGGTCGG CGGCGGCGAG
TCGCAGGTGA CCACGTCCGA CGGCCAAACC GTCAACGCCT CTGCCAACGA GCGGCTGTTC
GCCGGCGGCA AGGACGGCTC CGGCACCCTG CTCAGCCCCG ACCTGGAGCT TTCATCGAGC
AACCCCGACG ACGTCGTAGG CGTGCAGGTG ACGTACGACA AGCTCGACGG CGACCGAGGC
GGCGACAACG TCGTGCGCAC GTTCGACAGC CTGACCGACG CCCAAGCCGA CATCAACGCC
GAGAAGGGCG ACAACGCCAT CATGGGGCCC GTCTCCGTCA CGGCTAGGTA TCGCACGTCG
ACCGCCGCGC CCGTGCAGGT GCAAACGAGC GACGAGCCCG TCGAGGCGAT CGACATCCCC
GTGAAGAAGC ACTGGCAGGA CAACCCCGAC TGGCACGGAA CGCGCGCCGA CGTCACCATC
CGCTTGTGGT GCGGCGGCGA CGAGGTGGGC ACCTGGACGC TGATCGGAGG AGAGCCGAGC
GAAGGCGGGG CCGACGACTT CGACCACGTG TTCACCGGCC TGCCGAAAAC CGACCAGTAC
GGCAACGACC TCGTGTACGA AGTGACCGAG GATCCCGTTC GGGACGCTTC GGGCATCACC
GGCAACTACA TCTCGACCAT CGACGCCGAT CCAGCGGTTG ACAACGGCGT CATCGTCTCC
AACCTGTACA ACGTTGCCGA GAAGTTCTAC CTGACCGGTC AGAAAACATG GAGCGGCGAC
ACCGAGGCCG ACCGCCCCGC ATCCGTGTCC CTGACGCTCA CGCAGACGAA CGCGAGCGGG
CACGCCCCCT ACATCTTCAA GACCACGGCG TCGGCGCCCG ACTGGACGTA CACGTTCACG
AACATCCCGC TGCTCGAAGG CTCCGAGCGG GCAACGTACC AGCTCACCGA AACGCCCGTG
AACGGGTACA CGTCGTTCGT CCCCATCGCC AACATCCAGG GCACCGGCGA CATCGAGACG
GTGGTCGTTC CGGGCGAGCA AACCGTCAAC AAGATCAAGC GCGTCGACGT CGCCGGCACC
AAAACATGGC ACGACTACGA CAACGCCCTC GGCACACGCC CGGCGGCGAT CACCGTGAAC
CTGTATCAGG ACGACGTGCT GTTCGACACC GCCACCGCAA CCGCCGACGA CGGCTGGGCG
TACCAGTTCA CCGGTCTGCC CGAAGCCGCC GCGAACGGCG CCATCCACGT GTACACCGTG
CAGGAGGAGG CCGTCGAGCA CTACGCCACC GTCATCGACG GCACCGCCAT CGCGAACACG
CTCGACCCGA AGCTGAACGA CATAGCCGAG GTGAAGGGCA CGAAGACCTG GGACGACAAC
GACAACGCCG GCAACACGCG CCCCGAAAGC ATCACCGTGG AGCTGCTGGA CGGCGACGAT
GTGGTGAAGT CGCTGGAAAC CACGGAGGCT GACGGTTGGG CGTACGCGTT CGCCGAGCTG
CCGAAGTACG CCGACGACGG AACCGAGATC GCCTACACCG TGCGCGAGAA GGACGTCCCG
GCCGGCTACG AGGCGGCCGT CTCGGGCCAC AACATCGCGA ACACGCTGAA GCCCCAGCCC
GGCGGCGACA CCGTCGAAGT CGCCGGCACG AAGACCTGGG TGGACAACGA CAACGCCGGC
ACCACGCGAC CGGAAACGCT CACCGTCTCG CTGTACCAGA ACGACAAGCT CTTCCGCACG
CAGGAGACGT CCGCACAACG CGATTGGGCG TACCGCTTCG CCGACCTCCC CCGCTTCGAC
GCGGATGGGA AGGAGTTCGC GTACTCGATA CGGGAGGACG CCGTCCCCAG CGGGTACACG
GCCTCCGTGA AGGGTTACGA CCTGACCAAC ACCTTGAGCG AGATCCCGCA GAAGCCGAAG
GACCCCACCC CGACCGACGC GCCGCACAAG CTGGCCGCCA CCGGCGATGC CCCGTTCGCC
GCAGTCGCGC TTGGCGCAAC CTTCGCGGCC GCGCTCCTCG TCGTCGCCGC CTTCCGAAAG
CGCAGGCGCG CCTAG
 
Protein sequence
MKVRTLLIGA ALGLALCLPV APGMTAEAAE DTVQITDIEQ LVPYVPDAIF RQAVFDAVKD 
GADGAEGADV EEALYNFRGT VLYNKSNATP SDQKIKDVHG IQYLRNAQLV SLKYNEIRDF
SWLERSGGLE DKYYGELLAN DPTIEIDERN VVWDFGGNPF EMLPTFFGGR LKIMQPASSS
FTYSEDVSQH LAYVRPAGET AVSGALDIGK SAIYEHGAKI EDAHVVECGV HTRPGDAATS
MVIASHNDTT AAFTGLEKSG VFHIYVGMDK ELKYGTQDEW GAITEGEQSY KYYLTPTFRV
YDRITAASAA GSSAVLTKTD STTNAPVAGA TYAVYTDKGA FVEERTTDDA GSLSTSSLLP
GAYYFQETEA PTGYLLNDKK IPFTIVEGTT GATTSVGGGE SQVTTSDGQT VNASANERLF
AGGKDGSGTL LSPDLELSSS NPDDVVGVQV TYDKLDGDRG GDNVVRTFDS LTDAQADINA
EKGDNAIMGP VSVTARYRTS TAAPVQVQTS DEPVEAIDIP VKKHWQDNPD WHGTRADVTI
RLWCGGDEVG TWTLIGGEPS EGGADDFDHV FTGLPKTDQY GNDLVYEVTE DPVRDASGIT
GNYISTIDAD PAVDNGVIVS NLYNVAEKFY LTGQKTWSGD TEADRPASVS LTLTQTNASG
HAPYIFKTTA SAPDWTYTFT NIPLLEGSER ATYQLTETPV NGYTSFVPIA NIQGTGDIET
VVVPGEQTVN KIKRVDVAGT KTWHDYDNAL GTRPAAITVN LYQDDVLFDT ATATADDGWA
YQFTGLPEAA ANGAIHVYTV QEEAVEHYAT VIDGTAIANT LDPKLNDIAE VKGTKTWDDN
DNAGNTRPES ITVELLDGDD VVKSLETTEA DGWAYAFAEL PKYADDGTEI AYTVREKDVP
AGYEAAVSGH NIANTLKPQP GGDTVEVAGT KTWVDNDNAG TTRPETLTVS LYQNDKLFRT
QETSAQRDWA YRFADLPRFD ADGKEFAYSI REDAVPSGYT ASVKGYDLTN TLSEIPQKPK
DPTPTDAPHK LAATGDAPFA AVALGATFAA ALLVVAAFRK RRRA