Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0452 |
Symbol | |
ID | 8414736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 575770 |
End bp | 579033 |
Gene Length | 3264 bp |
Protein Length | 1087 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645023425 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180828 |
Protein GI | 257790222 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGC AGCATAAAGA CGGACGGCGA TGCGATGCCG CAAGCGAGCC TCAGCGTTCG AGCTTGTTCG GAACGCCTTC GCGGCGCGCC GTGGTCGGCG CCGGTTGCGC GGTCGCCGCA GGCGCCTTGG TCGCCGGAGG CGGGCTGGGC GCGTACATGG CCGCTGATGA CCCGCTGACC GACGAGCCTC ACGGGCGCGG CAACGCCACC GGCGCGTGCG CGGCCGACGA CGTCATCCTG TCCATGTGCA ACAACTGCAA CAGCTACTGC ACCATCAAGG TGCGCGTGAC CGATGCGGCC GACGGCAAGC AGGCGAACGA CGGCGCCACC GCGCTGGTGC GCAAGATCGC GGGCAATCCG TACTCGCCGC TCAACTCGCA GCCCTACGCG CCGATCCCCT ACGCTACGCG GCCCGAGGAA GCGCTCGCCC CCGGCGACGA CATGGCCGTG GCCGGTCGCG CTTCGAACGG GGGCATGATA TGCCTCAAGG GCCAGGCGGG CATTCAGCTC GTGCACGACC GGTTCCGCAT CACGCAGCCT TTGCGCCGCG TGGGGGAGCG CGGGTCGGAC GAGTGGGAGA CGGTCAGCTG GGACACGGCC CTCGACGAGA TCGTGAACGG GTCGCCCGCG CTGGGCACGC CGGGCATCGC CGAATGGTAC GCCTACGCGC CGAAGAAGCA GGTGGAGGCC GATGTCGCGC TTGTGGAGTC GGGCGAGATG ACCAAGGACG CCTTCGCGGC GAAATGGGCC GACAAGCTGA TCGACGTGGA GCATCCCGAC CTGGGCCCCA AGTCCAACCT GTTCTGCTCG GCCGGCGGCG ACCGCATGTT CCTCATCGGG GACCGCTTGA CGCAGCTGGG GTTCGGCTCG GTCAACAACT TCAACCACGG CGGCGTGTGC GGCATGACGG GCGTCATGGC CAACGTGCGC ACGCACCCCA CGACGAACCA TAAGCGCATG TACGCCGACA TCGACCATTG CGAATGCCTT ATCATCTGGG GCACCGAGCC CATGACGGCG AACAAGGGTC CGTCGTGGCT GGCTCCGCGC CTGTCGGTGG CGCGCGAGCG CGGCATGAAG CTGTACGTGG TCGACCCGCG CCAGGGGCGT AGCGCCTCGA AGGCCGACGT GTGGCTGCCC GTGATTCCCG GCAAGGACGC CGAGCTGGCG TTCGCCATGA TGTCGTGGAT CATCGCGAAC GAGCGCTACG ATGCGGCCTA CCTGTCGGCG CCTTCGAAGA AGGCAGCCGC GGCGCTGGGC GAGCCCACCT GGTCCGATGC GACGCACCTC GTGGCCGTCG ACCTGCCGAA CCGGCCCATC GTCACGGCCA AGGCGCTGGG GCGCGCGGGC GAGGTCGGCG CGAACGGCGA GGCGCTGGCC GACGACGCGC GCTTCGTGCT GGTGGACGGC GAGCTGACTC TCGCCGATGC CGCCGAGGGC GCCGCCGATC TGCTAGTGGA CGCAGAGATC GACATCAAGG GAAAGCCTGC GCGCGTGAAG AGCGTGTTCC AGCTTTTGAA AGAGCGCGTC GAGGAGCGAA CGCTGGAGGA GTACGCCGCC GATGCGGGCA TCGAGCCCTC CGTGGTGGAA GAGGTGGCGC GCGAGTTCAC GTCGCACGGC AAACGTGCCT GCGTCATGAG CTACCGCGGC CCGGCTATGC ACGCCAACGG CTTCGATGCG GTGCGCGCCG TGGGGTATCT CAACTTCCTC ATCGGCAACC ACGATTGGAA GGGCGGTCAC ATCGCCGCCG CGGCGAAGTT CGCGCCGTTC GAGGGCCGCT ACGACTTGAA GACCGTGCCG GACGCGCATG CCGGATGGGG CATTCCCATC ACGCGCCAGA AGACCGAGTA CGAGAAGACC AGCTACTTCA AGCAGGACGG CTATCCGGCG CCGCGTCCGT GGTACCCGCT GCCGGGCAAC CTGTCGCACG AGATCGTGCC CACCTTGCGC GCGGGCTACG TGTACGATCA TTTGGGCGCG CTGTTCATCC ATCGCCATTC GCTGGTGGAC TCCACGCCCG GCGGGCGGCG ACTGGCCGAC GTGCTGGGCG ACCAGGACAA GATCAACCTG CTGGTGTCGT TCGACGTGGA GATAGGCGAC ACGTCGCGCT TTGCCGACTT CGTGTTACCG GACAAGGTGT ACCTCGAGCG CTTCAGCCAG GAATCCATCT ATCCGAACCA GCAGTACCAG CTCATCCAGC TGGGGCAGCC GGCGGTGCGC GCCTTCGACG GCCCGCGCTC GGTGGAGGAC GTGTACTTCG ACATCATGCA ACGCCTCGGG CTGCCGGGCG TCGGCGAGCA TGCCGTGCCG GTGGGCAAGG ACGGCGACGG CGGCACGGCG GCCCTTTCGA CCGAGTACGA CTACTGGTTG AAGATGGCCG CGAACATCGC CTACGCCGGC GAGAAGCCGG TGCCGGACGC CGACGACGAC GAGCTCGCCC TGTTCGAGCG CGCCCGCAAG CGCGCGCTGG GCGAGGCGTT CGATCTGGAG GCGTGGAAAG CCGCCGTCAC CGAAGAGGAA TGGCCGAAGG TGGTGTACGT GCTGAACCGC GGCGGGCGGT TCGCGTCGGC CGACCCTGCG AAGGGCGACG GTTACGACGG AGATCTGATC AAGACGAAGT ACGCCGGCTT GTGCGCGTTC TACGACCCGA AGACGGCTTC GCTGAAAGAC GCGTTGACCG GTGAGAACTT CGACGGATTG GCCCATACGG CGCCGATCGC CTTCGCCGAC GGCACGCCGA TGGCGCGGCC GGCCGACCGA CCGTTCGCGT TCATCAACTG GAAAGCGCGC ACGAACGGCA CGCACCGCAC CATCGCCGCG TCGTGGCTGC GCGAGACGAC CACCGAGAAC TTCGTATGGA TGTCGCCGTC CGACGCTGCC GAGCGCGGCT TGAAGAACGG CGATGCCGTG GAGGTCGTGG GGCCTGAAGG CACGCTTTCC GGTCACGTGC GCGTGACCGA GGGCATTCGT CCCGGCGTGG TGGGGGCCAA CTACTCGTTC GGCCAGCAGG GCTACGCCGC GCGCGCGGTC ACCATCGACG GCATGTTGAC GGGCCCGGCT CCCGATTACC TCGAAGAAGA GGGCATCCTC GACGGCGACG AGCCCGGCAA GCAGAAGACC GGCTTCGCCG GCGGGCGGGG GCGCGGCTTC TGCATGAACG AGCTGCTGCC CGAGGAAACG CTTGCCGGAG GCGGTGGCGT GACCGATCCT ATTGGAGGCG GCGCCGCCCA GTTCGACCTC TGGGTGGACG TGCGGAAGGT GTAG
|
Protein sequence | MEAQHKDGRR CDAASEPQRS SLFGTPSRRA VVGAGCAVAA GALVAGGGLG AYMAADDPLT DEPHGRGNAT GACAADDVIL SMCNNCNSYC TIKVRVTDAA DGKQANDGAT ALVRKIAGNP YSPLNSQPYA PIPYATRPEE ALAPGDDMAV AGRASNGGMI CLKGQAGIQL VHDRFRITQP LRRVGERGSD EWETVSWDTA LDEIVNGSPA LGTPGIAEWY AYAPKKQVEA DVALVESGEM TKDAFAAKWA DKLIDVEHPD LGPKSNLFCS AGGDRMFLIG DRLTQLGFGS VNNFNHGGVC GMTGVMANVR THPTTNHKRM YADIDHCECL IIWGTEPMTA NKGPSWLAPR LSVARERGMK LYVVDPRQGR SASKADVWLP VIPGKDAELA FAMMSWIIAN ERYDAAYLSA PSKKAAAALG EPTWSDATHL VAVDLPNRPI VTAKALGRAG EVGANGEALA DDARFVLVDG ELTLADAAEG AADLLVDAEI DIKGKPARVK SVFQLLKERV EERTLEEYAA DAGIEPSVVE EVAREFTSHG KRACVMSYRG PAMHANGFDA VRAVGYLNFL IGNHDWKGGH IAAAAKFAPF EGRYDLKTVP DAHAGWGIPI TRQKTEYEKT SYFKQDGYPA PRPWYPLPGN LSHEIVPTLR AGYVYDHLGA LFIHRHSLVD STPGGRRLAD VLGDQDKINL LVSFDVEIGD TSRFADFVLP DKVYLERFSQ ESIYPNQQYQ LIQLGQPAVR AFDGPRSVED VYFDIMQRLG LPGVGEHAVP VGKDGDGGTA ALSTEYDYWL KMAANIAYAG EKPVPDADDD ELALFERARK RALGEAFDLE AWKAAVTEEE WPKVVYVLNR GGRFASADPA KGDGYDGDLI KTKYAGLCAF YDPKTASLKD ALTGENFDGL AHTAPIAFAD GTPMARPADR PFAFINWKAR TNGTHRTIAA SWLRETTTEN FVWMSPSDAA ERGLKNGDAV EVVGPEGTLS GHVRVTEGIR PGVVGANYSF GQQGYAARAV TIDGMLTGPA PDYLEEEGIL DGDEPGKQKT GFAGGRGRGF CMNELLPEET LAGGGGVTDP IGGGAAQFDL WVDVRKV
|
| |