Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1588 |
Symbol | |
ID | 8415887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1885945 |
End bp | 1888782 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024557 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003181945 |
Protein GI | 257791339 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0385421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGAACA TGGCTGAAAC CACGTTGACA CGTCGAAGCT TCGTCAAGGC GTCCGCTCTG GTGGGCGCGA CGGCGGCGTT CGGCGCTTCG ATGGCCGGCT GTATGCAGGA GGCTCCGCAG GAGCAGGCTC CGTCCGGCGG CGGCGCCGAC GAGGGTCTCG TGAGGATGAA GACGTCCTGC CACGGCTGCA TCCAGATGTG CCCGGCTATC GCGTATCTGA AGGACGGCGT CGTCGTAAAG CTGGAAGGCG ACCCCGACGC GCCGGTAAGT CGCGGAAGCC TGTGCATCAA GGGCCTCAAC CAGCTGCACA CCATGTACAG CCCCCGCCGC GTGCTGCATC CGCTTCGGCG TGCCGGCGAG CGCGGCGAGA ACAAATGGGA GGTCATCAGC TGGGACGAGG CCGTCGAGGA AGCGGCCACG CATATCTGCG ACGCCATCGA CAAATACGGC CCCTATTCCT TCTTCGCCAG CGTGGGCGGC GGCGGGGCCT ACTCGTTCAT GGAGGCCATG ACCCTGCCCA TGGCGTTCGG GTCGCCCACC GTGTTCGAGC CCGGCTGCGC CCAGTGCTAT CTGCCGCGCT GGAGCATGTC GAAACTGTTC TACGGCGGCA ACGACCAGTC CATCGCCGAC AACGCCGTGC AGGAGATATT CCGTCCCGAC CCCGACAACA AGGCCGAGGT CGTGGTGCTG TGGGGCGCCC AGCCTTCCGT CAGCCAGACG GCGGAATCGG GTCGCGGAAT GGCCGAGCTG CGCGCCAAGG GCGTGAAGAC CATCGTGGTC GATCCCAACT TCTCGCCCGA CGCGGTGAAG GCCGACGTGT GGCTGCCGGT GCGCCCGGCC ACCGACACGG GTCTGCTCCT GTGCTGGTTC CGCTACATCT TCGAGAACAA GCTCTACGAC GAGCAGTTCA CGAAGTACTG GACGAACCTG CCCTTCCTTA TCGACCCCGA GACGAAGCTG CCGGTGAAGG CGCAGGAGCT GTTCCCCGAC TTCCAGCAGA CCACGCCCGA GAACACCCCG GCCTACGTCT GCTACGACCT CAAGACGAAC GCAGTGGCGC CCTTCGAGTT CTCCGCGCCC GCCGACGCCG CAGTGGACCC CGAGATCTTC TGGACGGGCG ACTTCGAAGG GAAGACGTAC AAGACCGCCG GCCAGATCTA CAAGGAAGAA GCCGATCCCT GGACGCTCGA GCACACCGCC GAGAACTGCT GGCTCGATGC CGGCAAGATC GAAAAAGCCA TCAAGATCTA CGCCGAGGCC TCCGTGGCCG GCATCGCCAA CGGCGTGGCG TCGGATATGA CCGAGTCCGC CTCGCAGGTG CCGCTGGGCT GCATGGGCCT GGACTCCATC ATGGGCTACG TCAACAAGCC CGGCTGCACG ATGACGCAGT ACGGCGCCGC CGGCGCGCCG CCGACGAAGC GCCCGGTCAC GTACAACAAC GGCTTCGACG GCATGTTCTC GGACATGTAC GGCATCGGCG CGGTCATCGG CATGAGCGAT GCCGAGAACG AGGCGCGCGC CAAGAAGCTG GGGGAGGAGA ACCCGCAGCA GAAGCTGGCG AACCAGCTGC TCGTCGATCG ACTTGGCATG AAGGACCACA AGGGCCTGTA CGCATGGTGC CACAGCCATA TCCCCACGGT GCGCGAGGCC ATCGCCACCG GCGAGCCGTA CAAGCCGCGC GTGTGGTTCG ACATGTCGGG CAACAAGCTG GCCATGCTGG GCAATGCGAA GTCGTGGTAC GACGTGTTCC CCGAGGTCGA CTACATCATC GGCCAGTACC CGATGCTCAC CTCCTTCCAC ATCGAGGCCG CCGACCTCGT GTTCCCCGTG CGCGAGTGGC TGGAAGAGCC CATGGTGAAC ATGACCCAGC TCAACACCCA GTGGCTGCAG AACGAGTGCG TGCACATCGG CGAGACGGTG TCGCACTCCA TCCCAGCCGC GCAGGTGGTG GCGAAATGCG CGGAGAAGAT GGGCGGCGAG CTGCCCGGTC TCAAGCCCGG ATACCTGGGC AGCGCCACCG AGGAGGAGGT CAAGGCCTCC GTCGCCGAGA CGCTGCACGC GCCCAGCTGG GACGAGCTGG TGAAGGACGC CGACAAGTAC GTGCCCTACG TCACGCCGGC CAGCGAGTAC TTCCATTACG ACCAACATGA AACCGTGGTG GACGATGGCC TGCCTGCCGG CTTCGGCACC GAGTCCCGCA AGATCGAGGT GTACTGCCAG ATCCTGCTCA AGCTGGCGCG CACGGGATAT CCGTTCTGCT ATCCGGAGCC GCAGGAGCCC TGCGAGGACT ACAGCCCCAT CTGCTCGTAC ATCGAGCCGG CGGAGAGCCC GCTATCCGAC GAGGAGTACC CCTTCGTGCT CACGTCGGGC CGCGTGCCGT ACTTCCACCA TGGCACCATG CGCCACGCCG CGCTGTCGCG CGAGCTGTTC CCCACGGCCG AGATCCGCAT CAATCCGGCG AGCGCCAAGG AGCTGGGCAT CGAGCATATG GATTGGGTGA AGGTGACCAG CCGCCGCGGC GAGGTGCACG CGCGCGCCTA CCTCACCGAG GGCGTGCATC CGAAAACCGT GTGGATGGAG CGCTTCTGGA ACCCGGAGTG CTACGACGAG TCCCAGACGA ACCCGACGGC CGGCTGGCGC GAGTGCAACG TGAACGTGCT CACGAAGAAC GACGCCCCGT TCAACGAAGT GTACGGCTCC TACACGAACC GCGGTTTCAC GGTGAAGATC GAGAAGTCCC AGAAACCTGC GAACGTATGG GTGGAGCCCG AGGAGTTCGC GCCGTTCCTG CCGACTGAGG AAATGCTGTC CGAAGCTCAG ACGAAGGATG TGTTCTGA
|
Protein sequence | MENMAETTLT RRSFVKASAL VGATAAFGAS MAGCMQEAPQ EQAPSGGGAD EGLVRMKTSC HGCIQMCPAI AYLKDGVVVK LEGDPDAPVS RGSLCIKGLN QLHTMYSPRR VLHPLRRAGE RGENKWEVIS WDEAVEEAAT HICDAIDKYG PYSFFASVGG GGAYSFMEAM TLPMAFGSPT VFEPGCAQCY LPRWSMSKLF YGGNDQSIAD NAVQEIFRPD PDNKAEVVVL WGAQPSVSQT AESGRGMAEL RAKGVKTIVV DPNFSPDAVK ADVWLPVRPA TDTGLLLCWF RYIFENKLYD EQFTKYWTNL PFLIDPETKL PVKAQELFPD FQQTTPENTP AYVCYDLKTN AVAPFEFSAP ADAAVDPEIF WTGDFEGKTY KTAGQIYKEE ADPWTLEHTA ENCWLDAGKI EKAIKIYAEA SVAGIANGVA SDMTESASQV PLGCMGLDSI MGYVNKPGCT MTQYGAAGAP PTKRPVTYNN GFDGMFSDMY GIGAVIGMSD AENEARAKKL GEENPQQKLA NQLLVDRLGM KDHKGLYAWC HSHIPTVREA IATGEPYKPR VWFDMSGNKL AMLGNAKSWY DVFPEVDYII GQYPMLTSFH IEAADLVFPV REWLEEPMVN MTQLNTQWLQ NECVHIGETV SHSIPAAQVV AKCAEKMGGE LPGLKPGYLG SATEEEVKAS VAETLHAPSW DELVKDADKY VPYVTPASEY FHYDQHETVV DDGLPAGFGT ESRKIEVYCQ ILLKLARTGY PFCYPEPQEP CEDYSPICSY IEPAESPLSD EEYPFVLTSG RVPYFHHGTM RHAALSRELF PTAEIRINPA SAKELGIEHM DWVKVTSRRG EVHARAYLTE GVHPKTVWME RFWNPECYDE SQTNPTAGWR ECNVNVLTKN DAPFNEVYGS YTNRGFTVKI EKSQKPANVW VEPEEFAPFL PTEEMLSEAQ TKDVF
|
| |