Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2852 |
Symbol | |
ID | 8417183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3309115 |
End bp | 3312471 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645025832 |
Product | hypothetical protein |
Protein accession | YP_003183188 |
Protein GI | 257792582 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02543] Listeria/Bacterioides repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0153974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.439704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGAA CGCAGCGTAA ACGAAGCATG TCCAGGGGCC GCGCGGCGCG GGCCGTCGTC CGCGTCGCGT GCGCCGCGCT CGTGGCCGTC GCCCTGTGGC CGGCCCCCGC GCCCGCCGGG GAGGCCGACG CCTCCCCGGT TCCGGCCGCG CGGGCCCCGG AGTCCGCCGC CCCGGCCGCG CCCGATCCCG AAGCCGGCCC CGGTCCCGAC GCCGGCCGGC CCGACGTGCC CGGCCCCGAC GCCGGCCGGC CCGACGTGCT CGGCCCCGGC GCGTCCGACG CCGTCCAGGA GGATGCCGCG CAGCGCTCCG AGGCGGACGG CCCCCTCGTG GGCGCCCCCG CGCCGCAGCT CGACGGTCCC CTCGGCGCCG ACCCGCGCGC GAACGACGAT CCGTTGAACT TTTCGTGGAC GTTCACCGAC ATCGACGCTG CCACCTGCAA GATAACCGGC TTTTCCGGCT CCGTTCAGCC GACCGACACG GTACACGTGC CGGCGACCAG CCCCGCGGGC AAGAAGGTCG TCGAGATCGC CCGCGGGAGC TCGATCCAGA ACACGTACTC GAACCTGAAG TGCAACATCG ACTTCTCCCA GGCGACCAAC CTGCAGACCA TATCGACGAG CGCCTTCCAG ATGCAGAACG AATACGAGCA GCGAGGACCG AAGGGCGTCA TCGACCTGTC CAAGTGCACG AGCCTCACCA CGATCGGCTC GAATGCCTTC GTCAGCTGCG GGGAGATCAC GAGCCTTGTC CTTCCCCCCG CCTTCGAGAC CGTGGGCTCG AAGGCGTTTT ACAACTGCAA GAAGCTCGCG GGCGAGCTCG CGCTGCCGTC ATCCCTCAAA GGGGTGAACG AGAGCGCGTT CGCCGGCGGC ACGAGGGATG ACGTCAGCCT GGCCCCGAAG CTCACCTCCC TTCTCGGCCT CGAGAACACG AAGCTCGCCA CGGTCGGCGT CGCCGCGTTC AAAGGCAACA TGATCGGCGG CGCGCTCACG TTCCCGGTGA CCCTGGACTC GGTGGGAGCT TCGGCGTTCC AGAACAATCT GATCACGAAG GTTTCTTTCC TGAACACGAC GGTCGACAAG ATCGTCTTGG GCAGCAGCTG CTTCCAGAAT AACAAGATAA AGACCAACCC CCTCACCCAG GGGGTGAGGT TCGGCATGGC CGACTACGCG TTCGCGGACA ACAGTCTGAC GGGTGAGTTC TCCTTCGAAA ACGAGAACGT CCCAGCCCCC GCGAGGGGCG TGATCTCGGG CAACCCGGGC GTGACGAAGG TGACCATCTC GAAGTTCTGG GGGTTCATTC CCGGCGATGC GTTCAAGGGC CTGGCGGGCT TGAAGGAGGT CGTCATCCCT GCGAACAGCA AGTTCACGCG CGTCAGGGCC GGCGCCTTCC AGGATTGCAT CTCCCTCGGG GGCGTCGACT TCGGGCAGGC CCCCTTGGCG GACGGCTCCG CGAGCGATCC CGCGATTGGG GAGAACGCCT TCAAGGGCTG CACGTCGCTG AAATCGGTCA TGATGGGGGA GGGCTCCTAC AGCGGGGTGC CGTCGGCGAC CGTCACGCTC GGCGCTGGCG CGTTCGAGAA TTGCGTCGCG CTCTCGATCA TCAAGATGCC CGAGGCGGTC AGCGCCGGCG GGGTGATCCT CAACGTCAAG ATCGGCGCGC GCGCCTTCAA GGGCACGAAC CTCGGCGCGT TGCCGGATCC GCAGACCGGC CAACCCATGG GCTACCTGCC CCTCGACAAG TACCGGGTCC AGACGATCGG CGCGAGCGCC TTCGAGAACG CGAACATCTC GGACGTGAGG CTGCCCTCCA CGCTGCGCTC CGTGGGCGAG AAAGCTTTCG CGGGCAACCA CATCCCCTAC CTGGAGCTGC CGAACAACGC GGCGCTTGAC ACGGGCGTCG GCGCGAACGT GCTGGCCGAC CAGACGCTCC CCAAGGGCAC GGCGGTGTAC GAGGGCACCA GCGATACGGT GGGCGACGTG CTCCTCGACG TGCTCGAGTC CGGCTACGGC ATCCCCGTCT CTCACCTGAT CTCGGTCGCC CTCAAGCTCG TGGACGGCTC CACGGAGGAT CCGAGCAACA CCGACTGGCA GACCAGCGGG AGGGCGACGT TCAAGGATGA TTCGCTCAGA GCCTCGGGCA CGCGGTTCTC CTACGGCATG AACGTATACC GCCAGGGGAG CTCGACCCCC GTGTCCGCCG GCAGCATGAT CCTCGGCAAG GTGGAGGAGG GCGTCCCCTG CGAGTTCCTT TTCTACAACG ACGAGGGCTA CTCCGACCGC AGGGACTCCC ACCTCCAGTG GGTGCCCTCC GGCTCCTCGC CCCAAGAGAT CCTGCACGGC GCGCCGAACT ACGGCCTCGA CAGGCCCGGC TACCACACCG ACCCCCAGGC CTACGACGCG GCAAGGAAGT CCGGATGGAG GAAGAGCAGC GGGGACAAGG CGCCCGTCGT CCCCAAGGAC GTCGTCGTCA ACGCGGGAGA GAACCCCTCG TACTCGAACC GCTGGATCGC GAACTCCTAC TCCGTGTCCT TCGACGACGG CTGGGACGCC ATGGTGGCGA AGGACGGGCG CCTTGGCATG CCCGACCCGG CCGATCCGGA CAAGATCGTC CGCAACACCG CCTACGTGTC GGGGACGATG GACCCCGTCG GGAGCCTGTC CTACGGCGCG CCCGCCGACC TGCCCGGGAA CGCGTTCACG ATGGGCGGCT ACGAGTTCGA CGGCTGGACG ACCTCGCCGG ACCTCGCCGT CGACGACCGC AAGCAGGGCT CCAACTACTT CGCCGCCGGC ACTCCGATCT CCACGCCCGA CCCGGCCCCC GCCGACGGCG GCGCGCTGAC GCTGTTCGCG CAGTGGAAGG CGGTCGACTA CGGCGCGGAC GACCCGGCCC TCCTGGGCTT CCTCTCCATT CCGAGCTCCC TGTCCCTGGA GCCCTACGGC GACAGGGTCT GGTCCGAGCC CTCGGACCCC GCCGCCCCGG CGGGCGACCA CAGCGTGGTC GTGGCCGCCG CCCCCGAGCT GCCGGGCGCG ACCTGGCCGG CCGGCAAGAC CTACCGGGTG TCGGTGACGA GGCCCTCCGC GGGCGCGCCG CTGCTGTCGC TCTCGCTCCC CGGCGGCGGA GAGGCTCGGG GCTTCGAGGT GCTCGACGCC TCGGGCGCGG TCTACAACCC CGGGGATGGC TCCGCGCCCG CGCCGCTCAT GGTGATCGAC CCGTACGATC CGATCGCCGG CAGCGGGTCG TTCTCCCTGA GGAGCGTCGA CCCGGTAGGC TCGTTCCTCG CCAACGCCCG CTACTCCGGG ACCATGGAGT TCCGGGTCGA CATCGTGTCC GCGACGACCA GGGAGGTGGC GCCGTGA
|
Protein sequence | MDRTQRKRSM SRGRAARAVV RVACAALVAV ALWPAPAPAG EADASPVPAA RAPESAAPAA PDPEAGPGPD AGRPDVPGPD AGRPDVLGPG ASDAVQEDAA QRSEADGPLV GAPAPQLDGP LGADPRANDD PLNFSWTFTD IDAATCKITG FSGSVQPTDT VHVPATSPAG KKVVEIARGS SIQNTYSNLK CNIDFSQATN LQTISTSAFQ MQNEYEQRGP KGVIDLSKCT SLTTIGSNAF VSCGEITSLV LPPAFETVGS KAFYNCKKLA GELALPSSLK GVNESAFAGG TRDDVSLAPK LTSLLGLENT KLATVGVAAF KGNMIGGALT FPVTLDSVGA SAFQNNLITK VSFLNTTVDK IVLGSSCFQN NKIKTNPLTQ GVRFGMADYA FADNSLTGEF SFENENVPAP ARGVISGNPG VTKVTISKFW GFIPGDAFKG LAGLKEVVIP ANSKFTRVRA GAFQDCISLG GVDFGQAPLA DGSASDPAIG ENAFKGCTSL KSVMMGEGSY SGVPSATVTL GAGAFENCVA LSIIKMPEAV SAGGVILNVK IGARAFKGTN LGALPDPQTG QPMGYLPLDK YRVQTIGASA FENANISDVR LPSTLRSVGE KAFAGNHIPY LELPNNAALD TGVGANVLAD QTLPKGTAVY EGTSDTVGDV LLDVLESGYG IPVSHLISVA LKLVDGSTED PSNTDWQTSG RATFKDDSLR ASGTRFSYGM NVYRQGSSTP VSAGSMILGK VEEGVPCEFL FYNDEGYSDR RDSHLQWVPS GSSPQEILHG APNYGLDRPG YHTDPQAYDA ARKSGWRKSS GDKAPVVPKD VVVNAGENPS YSNRWIANSY SVSFDDGWDA MVAKDGRLGM PDPADPDKIV RNTAYVSGTM DPVGSLSYGA PADLPGNAFT MGGYEFDGWT TSPDLAVDDR KQGSNYFAAG TPISTPDPAP ADGGALTLFA QWKAVDYGAD DPALLGFLSI PSSLSLEPYG DRVWSEPSDP AAPAGDHSVV VAAAPELPGA TWPAGKTYRV SVTRPSAGAP LLSLSLPGGG EARGFEVLDA SGAVYNPGDG SAPAPLMVID PYDPIAGSGS FSLRSVDPVG SFLANARYSG TMEFRVDIVS ATTREVAP
|
| |