Gene Elen_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0459 
Symbol 
ID8414743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp585130 
End bp588342 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content64% 
IMG OID645023431 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003180834 
Protein GI257790228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG CATCGCATGT CGAATCGCCA ATATCCGGAC GGAAGCTGTG CGCATTCGCA 
CTCGCCGCGG CGCTTGCGCT CATGGCGGGA GCACAGTTCG TTCCCGCGCA TTACGCCTGG
GCTGAAGACG AGCCGGCGGC AACGGCTCAG GCGGATGCGA CCCTCCCTGA CGCGGCCCCA
CTCGAGGAAG CCCCGCCCGA GGTGGCAGCC TTGGCGCCCA TAGAGAATGC GCCGCACCTC
GACCTGAAGG ACGGTTCCAT CCTCTTGAAG GACTCCGGCT CGAACCTCCA ATACAGCCAG
GACAACGAGG CAACGTGGAC GTCCTATTCC GGCAACGTGA CGATCGGTGG AACGGCAGGC
AGCGGCACGA ACGTGCAAGT GCTCAACGGA ACCCATGCGA TACTTCTCAA CGGCGTTACC
ATCAACGAGC CGCGCGCCAA TCACGCGGCG ATTGAAATCG GCAGCGACAC GCAGGCTGCG
CAGCTTACGC TCTGGCTGTA TGGCTCGAAC AAGCTTTCCG GTTCGAGCGG GTATCCCGCA
ATTTTCGCAC CTGGTAAGGC TCATCTGATT TTCAAAGGCG ATCTCTACGG CAGTCTGGAA
GCGCAGGGAG GATACGAGGC TCCCGCCATC GGTGCGGACA GCGCGGTTTC AAGTTGCGGC
ACCATCGAGT TTTGGCAACC CGGCAACGTA ACCGCGCGAA GCGGCAACGG CGCCCCTGCC
CTCGGCGACC CGACCTCGAG CGCTGTGGAT GCAACGGGGT CGGTGTCCTT CTTTTCGGGA
ACGGTGACCT TGCAGTCCTC AGGCACTTGC GACATTACCG CGAAAAGCAT CGTCACCGGC
GGCGGCAACA TTCACCTCTC GCAGCGCCCG CCCGCCAACA CGACGCTCGC CGTCGATCAG
GGCGGCTCCT TCCCCAGTTC CGCGTATGTT CTGGTGTCGG GCTTGGCGGC AGGTGAGAAT
CTGCTCGGAG CCAGTTTTGG TGAGAAAGGC CCTGTCTTTG TCAGAGGCGA TCGGTGGGTG
GATGCCAAGC ATTACATCGT AGACGGCTTC GCCACTTGGA TGGACGACGG CGATCTGGGA
CTTTTTCTCG ACAGAAACCT GCTTAAGGAC GGCAGCTCCG TCAAGATGAG CTTTAGCGAA
TCCGGCAACG TGTACGAAGG CACGATCAAG GGTTCTGCCG AAGCCGGATA CACGCTTCAT
CTGGCCATTC CCGACCCGCC TGTTGAAAAC CCCGTCGTCA GCATGCACGA AAAAAAGGGC
TGCAAGCTCA TCGTCGATGT GAGTTCGGAG GGCATTCCGC AGTACACCCT CGACGGGGGA
GTGAGTTGGA CGCGCTACAG CGGGTACCTG GCTGTCACGG GAGCCGCAGA TTTTACCTCC
TTCCCCGTAA GCAATCAGGT GATCGTAAGG AAAGGCACTC ACGCACTGCG CTTTCAGGAT
CTGAGCGGCG TTGCGGTTTC AATCGAGGGA TCCTCGAACG CGACGCTCAC GCTGGTGGGA
AGTAATTACA TATGCGGCCA CAGACTGGAG GCGTCGCCGG GGGTGGCGGT TCGAGGCACC
TCGTCTCTTA CCATCAACGG CAAGGGAACC TTGAATACCG TGGGAGCAGG CACAGGTCGT
CAGCCTTGCA TCCTGGGAGA GCCGAACACG ACCATCTCGA TAGCGGGCGG CACGGTTCTC
GCTCATTCGG GAGTAACGGG CGGTAATGGC ATTACTGCAG GCAAGCTCGT TGTTTCCGGA
GGCACTGTCG AAGCGCTCAG TCAGTCGAGT GGAAGCGCAG CGATCACGGC GAGCAAGTCC
ATCGCCGTCA GCGGCGGGTC GGTTACCGCC AAAGGCTCCA AGTTCGTCGC TGGCATCGGG
TCGGATCATT ACGGCTCGTG CGGCTCCATC ACCATCAGCG GCGGCATAGT GAACGCGCAC
GGCGGCGACC TGTCTGCAGC CATAGGTTGC AGCGTCGGCG GCTCGTGCGG GGAAATCAAG
ATCAGCGGGG GCACCGTCAA CGCCGTCGGC GGGCTGGGCA CTCATGGAGC CGGCCGGGTC
GCGGCTATCA CAAGCTCCGT CAAGGCGCCC GTCATCACCG GCGGGACGGT AAAATGCAAC
GAGGCGGTTT CAACGAGGGC GGCGTCTTTG GCGTCGGCAT CAGCTTCGCC GGCTCCGGCA
GCGTTGAACT TCCTGCTCGC CTCCCCCGTC GCGCTCGCGT CCGACCTGGA CGAGGAAGCA
ACGTCTCTTG CCTCCGAGGA CGAGCAGCGC TTCGTCAACG AAAACAACGA CGACCTCTCG
CTTGTAACGA TACACGGGCT TCCGGCGAAC ACGCCCATCT CCTCCTTGCA GATGCGCCTG
CTCGACACGG AATTCGTGGA AGAGCCGACC GGCGGCAGCA CCTACGGCAT GCGCGACGTC
CAGACCGACG ACCAAGGCGC GCTCTACCTG CACCTCACGC CGCTCGAGGC GACGGACAAG
CTGGTCGTCG CCTCGTGGAA CGGCCGCAAT TACAGCGGGC TGAGCGAGAG GAACCTGGAC
GGAACCTTCG CCATCGCACT CGCCCCCACC GAGGCCGCCG TGTACTACCA GTGGCACAGC
TTCGAGGACG GCTGGGCCTA CGAGGACGCC GGCGTCGATT GGAGCATCGA CGGGCAGACG
AGCGGAAACG CCGCCGGCCA CGACGTGACC GCGCTGCGCA TCGAGACGTC CATCGGCGGC
TTGAACGTTT CCTACGCGGT CGACAACGGA AGCGGGTGGA GCCCGGCGGT TGAGAACGGC
GACATCGCGG GCGAGATGGA GCAGCCCGTG CAGGCGCTGC GCGTCAGCCT GTCGGGCGAC
GAGGCTGCGC GCTACACCGT CTACTACCGC CTGTACGTGA AAGGCGTCGG CTGGATGGCT
TGGGCGCATG ACGGCACCGC CAACGGCACG TCGGGATACG GCTACCCCGT CAAGGCGTTC
CAGGTGGCGA TACTTCCTGC GGGAGCGACG CCCACGACCG GCGAGGCCAG CGCGACCGAG
TACGCGTACC TGGTGAAAGA CGGCCAAAGC CAGCCGGGGC TCATCGGCGA GAAGCCTTCC
GCCGGGCATG AGGAACGCGA GGCTGCGCCC CCTGCGAAGG CGCTCGCCTC TACCGGCGAC
AGCCCGGCTT TCGCCATCGC CCTGGGCGCA GCCCTCGCAA GCGGCGCGCT GGTTCTTGCG
TCGCGCGCCC GTCATCGCAG GTCGCTGTCC TGA
 
Protein sequence
MNAASHVESP ISGRKLCAFA LAAALALMAG AQFVPAHYAW AEDEPAATAQ ADATLPDAAP 
LEEAPPEVAA LAPIENAPHL DLKDGSILLK DSGSNLQYSQ DNEATWTSYS GNVTIGGTAG
SGTNVQVLNG THAILLNGVT INEPRANHAA IEIGSDTQAA QLTLWLYGSN KLSGSSGYPA
IFAPGKAHLI FKGDLYGSLE AQGGYEAPAI GADSAVSSCG TIEFWQPGNV TARSGNGAPA
LGDPTSSAVD ATGSVSFFSG TVTLQSSGTC DITAKSIVTG GGNIHLSQRP PANTTLAVDQ
GGSFPSSAYV LVSGLAAGEN LLGASFGEKG PVFVRGDRWV DAKHYIVDGF ATWMDDGDLG
LFLDRNLLKD GSSVKMSFSE SGNVYEGTIK GSAEAGYTLH LAIPDPPVEN PVVSMHEKKG
CKLIVDVSSE GIPQYTLDGG VSWTRYSGYL AVTGAADFTS FPVSNQVIVR KGTHALRFQD
LSGVAVSIEG SSNATLTLVG SNYICGHRLE ASPGVAVRGT SSLTINGKGT LNTVGAGTGR
QPCILGEPNT TISIAGGTVL AHSGVTGGNG ITAGKLVVSG GTVEALSQSS GSAAITASKS
IAVSGGSVTA KGSKFVAGIG SDHYGSCGSI TISGGIVNAH GGDLSAAIGC SVGGSCGEIK
ISGGTVNAVG GLGTHGAGRV AAITSSVKAP VITGGTVKCN EAVSTRAASL ASASASPAPA
ALNFLLASPV ALASDLDEEA TSLASEDEQR FVNENNDDLS LVTIHGLPAN TPISSLQMRL
LDTEFVEEPT GGSTYGMRDV QTDDQGALYL HLTPLEATDK LVVASWNGRN YSGLSERNLD
GTFAIALAPT EAAVYYQWHS FEDGWAYEDA GVDWSIDGQT SGNAAGHDVT ALRIETSIGG
LNVSYAVDNG SGWSPAVENG DIAGEMEQPV QALRVSLSGD EAARYTVYYR LYVKGVGWMA
WAHDGTANGT SGYGYPVKAF QVAILPAGAT PTTGEASATE YAYLVKDGQS QPGLIGEKPS
AGHEEREAAP PAKALASTGD SPAFAIALGA ALASGALVLA SRARHRRSLS