Gene Elen_2852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2852 
Symbol 
ID8417183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3309115 
End bp3312471 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content69% 
IMG OID645025832 
Producthypothetical protein 
Protein accessionYP_003183188 
Protein GI257792582 
COG category 
COG ID 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0153974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.439704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGAA CGCAGCGTAA ACGAAGCATG TCCAGGGGCC GCGCGGCGCG GGCCGTCGTC 
CGCGTCGCGT GCGCCGCGCT CGTGGCCGTC GCCCTGTGGC CGGCCCCCGC GCCCGCCGGG
GAGGCCGACG CCTCCCCGGT TCCGGCCGCG CGGGCCCCGG AGTCCGCCGC CCCGGCCGCG
CCCGATCCCG AAGCCGGCCC CGGTCCCGAC GCCGGCCGGC CCGACGTGCC CGGCCCCGAC
GCCGGCCGGC CCGACGTGCT CGGCCCCGGC GCGTCCGACG CCGTCCAGGA GGATGCCGCG
CAGCGCTCCG AGGCGGACGG CCCCCTCGTG GGCGCCCCCG CGCCGCAGCT CGACGGTCCC
CTCGGCGCCG ACCCGCGCGC GAACGACGAT CCGTTGAACT TTTCGTGGAC GTTCACCGAC
ATCGACGCTG CCACCTGCAA GATAACCGGC TTTTCCGGCT CCGTTCAGCC GACCGACACG
GTACACGTGC CGGCGACCAG CCCCGCGGGC AAGAAGGTCG TCGAGATCGC CCGCGGGAGC
TCGATCCAGA ACACGTACTC GAACCTGAAG TGCAACATCG ACTTCTCCCA GGCGACCAAC
CTGCAGACCA TATCGACGAG CGCCTTCCAG ATGCAGAACG AATACGAGCA GCGAGGACCG
AAGGGCGTCA TCGACCTGTC CAAGTGCACG AGCCTCACCA CGATCGGCTC GAATGCCTTC
GTCAGCTGCG GGGAGATCAC GAGCCTTGTC CTTCCCCCCG CCTTCGAGAC CGTGGGCTCG
AAGGCGTTTT ACAACTGCAA GAAGCTCGCG GGCGAGCTCG CGCTGCCGTC ATCCCTCAAA
GGGGTGAACG AGAGCGCGTT CGCCGGCGGC ACGAGGGATG ACGTCAGCCT GGCCCCGAAG
CTCACCTCCC TTCTCGGCCT CGAGAACACG AAGCTCGCCA CGGTCGGCGT CGCCGCGTTC
AAAGGCAACA TGATCGGCGG CGCGCTCACG TTCCCGGTGA CCCTGGACTC GGTGGGAGCT
TCGGCGTTCC AGAACAATCT GATCACGAAG GTTTCTTTCC TGAACACGAC GGTCGACAAG
ATCGTCTTGG GCAGCAGCTG CTTCCAGAAT AACAAGATAA AGACCAACCC CCTCACCCAG
GGGGTGAGGT TCGGCATGGC CGACTACGCG TTCGCGGACA ACAGTCTGAC GGGTGAGTTC
TCCTTCGAAA ACGAGAACGT CCCAGCCCCC GCGAGGGGCG TGATCTCGGG CAACCCGGGC
GTGACGAAGG TGACCATCTC GAAGTTCTGG GGGTTCATTC CCGGCGATGC GTTCAAGGGC
CTGGCGGGCT TGAAGGAGGT CGTCATCCCT GCGAACAGCA AGTTCACGCG CGTCAGGGCC
GGCGCCTTCC AGGATTGCAT CTCCCTCGGG GGCGTCGACT TCGGGCAGGC CCCCTTGGCG
GACGGCTCCG CGAGCGATCC CGCGATTGGG GAGAACGCCT TCAAGGGCTG CACGTCGCTG
AAATCGGTCA TGATGGGGGA GGGCTCCTAC AGCGGGGTGC CGTCGGCGAC CGTCACGCTC
GGCGCTGGCG CGTTCGAGAA TTGCGTCGCG CTCTCGATCA TCAAGATGCC CGAGGCGGTC
AGCGCCGGCG GGGTGATCCT CAACGTCAAG ATCGGCGCGC GCGCCTTCAA GGGCACGAAC
CTCGGCGCGT TGCCGGATCC GCAGACCGGC CAACCCATGG GCTACCTGCC CCTCGACAAG
TACCGGGTCC AGACGATCGG CGCGAGCGCC TTCGAGAACG CGAACATCTC GGACGTGAGG
CTGCCCTCCA CGCTGCGCTC CGTGGGCGAG AAAGCTTTCG CGGGCAACCA CATCCCCTAC
CTGGAGCTGC CGAACAACGC GGCGCTTGAC ACGGGCGTCG GCGCGAACGT GCTGGCCGAC
CAGACGCTCC CCAAGGGCAC GGCGGTGTAC GAGGGCACCA GCGATACGGT GGGCGACGTG
CTCCTCGACG TGCTCGAGTC CGGCTACGGC ATCCCCGTCT CTCACCTGAT CTCGGTCGCC
CTCAAGCTCG TGGACGGCTC CACGGAGGAT CCGAGCAACA CCGACTGGCA GACCAGCGGG
AGGGCGACGT TCAAGGATGA TTCGCTCAGA GCCTCGGGCA CGCGGTTCTC CTACGGCATG
AACGTATACC GCCAGGGGAG CTCGACCCCC GTGTCCGCCG GCAGCATGAT CCTCGGCAAG
GTGGAGGAGG GCGTCCCCTG CGAGTTCCTT TTCTACAACG ACGAGGGCTA CTCCGACCGC
AGGGACTCCC ACCTCCAGTG GGTGCCCTCC GGCTCCTCGC CCCAAGAGAT CCTGCACGGC
GCGCCGAACT ACGGCCTCGA CAGGCCCGGC TACCACACCG ACCCCCAGGC CTACGACGCG
GCAAGGAAGT CCGGATGGAG GAAGAGCAGC GGGGACAAGG CGCCCGTCGT CCCCAAGGAC
GTCGTCGTCA ACGCGGGAGA GAACCCCTCG TACTCGAACC GCTGGATCGC GAACTCCTAC
TCCGTGTCCT TCGACGACGG CTGGGACGCC ATGGTGGCGA AGGACGGGCG CCTTGGCATG
CCCGACCCGG CCGATCCGGA CAAGATCGTC CGCAACACCG CCTACGTGTC GGGGACGATG
GACCCCGTCG GGAGCCTGTC CTACGGCGCG CCCGCCGACC TGCCCGGGAA CGCGTTCACG
ATGGGCGGCT ACGAGTTCGA CGGCTGGACG ACCTCGCCGG ACCTCGCCGT CGACGACCGC
AAGCAGGGCT CCAACTACTT CGCCGCCGGC ACTCCGATCT CCACGCCCGA CCCGGCCCCC
GCCGACGGCG GCGCGCTGAC GCTGTTCGCG CAGTGGAAGG CGGTCGACTA CGGCGCGGAC
GACCCGGCCC TCCTGGGCTT CCTCTCCATT CCGAGCTCCC TGTCCCTGGA GCCCTACGGC
GACAGGGTCT GGTCCGAGCC CTCGGACCCC GCCGCCCCGG CGGGCGACCA CAGCGTGGTC
GTGGCCGCCG CCCCCGAGCT GCCGGGCGCG ACCTGGCCGG CCGGCAAGAC CTACCGGGTG
TCGGTGACGA GGCCCTCCGC GGGCGCGCCG CTGCTGTCGC TCTCGCTCCC CGGCGGCGGA
GAGGCTCGGG GCTTCGAGGT GCTCGACGCC TCGGGCGCGG TCTACAACCC CGGGGATGGC
TCCGCGCCCG CGCCGCTCAT GGTGATCGAC CCGTACGATC CGATCGCCGG CAGCGGGTCG
TTCTCCCTGA GGAGCGTCGA CCCGGTAGGC TCGTTCCTCG CCAACGCCCG CTACTCCGGG
ACCATGGAGT TCCGGGTCGA CATCGTGTCC GCGACGACCA GGGAGGTGGC GCCGTGA
 
Protein sequence
MDRTQRKRSM SRGRAARAVV RVACAALVAV ALWPAPAPAG EADASPVPAA RAPESAAPAA 
PDPEAGPGPD AGRPDVPGPD AGRPDVLGPG ASDAVQEDAA QRSEADGPLV GAPAPQLDGP
LGADPRANDD PLNFSWTFTD IDAATCKITG FSGSVQPTDT VHVPATSPAG KKVVEIARGS
SIQNTYSNLK CNIDFSQATN LQTISTSAFQ MQNEYEQRGP KGVIDLSKCT SLTTIGSNAF
VSCGEITSLV LPPAFETVGS KAFYNCKKLA GELALPSSLK GVNESAFAGG TRDDVSLAPK
LTSLLGLENT KLATVGVAAF KGNMIGGALT FPVTLDSVGA SAFQNNLITK VSFLNTTVDK
IVLGSSCFQN NKIKTNPLTQ GVRFGMADYA FADNSLTGEF SFENENVPAP ARGVISGNPG
VTKVTISKFW GFIPGDAFKG LAGLKEVVIP ANSKFTRVRA GAFQDCISLG GVDFGQAPLA
DGSASDPAIG ENAFKGCTSL KSVMMGEGSY SGVPSATVTL GAGAFENCVA LSIIKMPEAV
SAGGVILNVK IGARAFKGTN LGALPDPQTG QPMGYLPLDK YRVQTIGASA FENANISDVR
LPSTLRSVGE KAFAGNHIPY LELPNNAALD TGVGANVLAD QTLPKGTAVY EGTSDTVGDV
LLDVLESGYG IPVSHLISVA LKLVDGSTED PSNTDWQTSG RATFKDDSLR ASGTRFSYGM
NVYRQGSSTP VSAGSMILGK VEEGVPCEFL FYNDEGYSDR RDSHLQWVPS GSSPQEILHG
APNYGLDRPG YHTDPQAYDA ARKSGWRKSS GDKAPVVPKD VVVNAGENPS YSNRWIANSY
SVSFDDGWDA MVAKDGRLGM PDPADPDKIV RNTAYVSGTM DPVGSLSYGA PADLPGNAFT
MGGYEFDGWT TSPDLAVDDR KQGSNYFAAG TPISTPDPAP ADGGALTLFA QWKAVDYGAD
DPALLGFLSI PSSLSLEPYG DRVWSEPSDP AAPAGDHSVV VAAAPELPGA TWPAGKTYRV
SVTRPSAGAP LLSLSLPGGG EARGFEVLDA SGAVYNPGDG SAPAPLMVID PYDPIAGSGS
FSLRSVDPVG SFLANARYSG TMEFRVDIVS ATTREVAP