Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0515 |
Symbol | |
ID | 8414799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 663588 |
End bp | 666482 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645023486 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180889 |
Protein GI | 257790283 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.470196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAAC TGCTCATGAC CCGACGTGCG TTTGCGAAGG TGATGGCCGT GACGGCTGCG GCCGCCGGTT TCACGGGGGC TCAGTCGGCG TTGGCGGATA CCGAGCCGGC TGCCTCGTCG GGCGAAGTCA AGCGCATACG CTCGGCCTGT CGCGGATGCG GCAAGATGGA GTGCGGCGTT TGGGTGACCG TTCAGGATGG GCGGGTTGTC AAAACGGAGG GCGATGAAAG CGCTTTCCAG TCGGCGGGAA ACCATTGCGC GAAAGGGCAG GCGTCCTTGC AGGCGGCGTA TCATCCCGAC CGTCTCATGT ACCCGCTCAA GCGCACGAAT CCCAAAGGGC AAGAGCCCGG TTGGGTGCGC ATCAGCTGGG ATGAGGCGTA TCGATCCACC GTCGAGGCAA TCCATAAGAA CCAGGAAAAG TACGGCAACG AGACGTGCTT CTTCATGGGC GGCACGTCGC GTATCTGGGC CATGGGCCCT TATGGCGCGC TGAAGCAGTG CTTCGGGTCG CCGAACGGCA TACAGGCCAA CGAGATATGC AAAGGCCCGC GCTTCTACGC TACGAAGCTG AACGATTCGA ACGCCTACAG CTGGATGGAA GTGGTGGGGC GTCCGCGCGT GTACGTGCAA TGGGGCGGCG CGTCGGAGCT GTCCAATTAC GACGATAGCT GTCGCACCAC GGTCGACGTG GCGACGCGCG CCGACAAGCA CATCCTGGTC GATCCGCGCC AGACGAACTT GGGGAAAGAG GCGGATATCT GGGTGAACCT TCGTCCCGGA ACCGATGGGG CAGTGGCGAA CTGCTGGGCC AACGTGATCA TCGAGAACGA GCTGTACGAC GATTTGTACG TACGGAAGTG GATGAACGCT CCCATGTTGG TGGTGGAAGA TGAGAGCTTC GAGCCAACCC CTTCCTCGTC GAGCGCGCAG ACGGCCAAGG TGCGTACGCG CCTTCTCAAA GAGTCCGACC TCGTCGAGGG CGGGGCGGAT ACCCGCTTCA TGGTGCTCAA CGAGATCACA AACGAGCTGA GCTGGTACGA TGCCGGCGGC GACAACCCCG GCTGGGAGGG CGAGGACTGG GTTCCCGCAA CCGAGGGCAA AGAAGCTCAT CAGCCGGGAT TGGATCTGAC GGGTCAGACG CAGGGCTTCG TGCTTGACTA CGTACCGTTT CCCGACGGGC TGCTTCCCGC GCTGCATACC CCCGAAGGCG GCTTCGAGGT GGAGTTGAAA GACGGCTCCA GCGTGCACGT GCGCACGGTG TGGGAGCGCT ATATCGAGTT CTTGGAAGAC TACACTCCCG AAAAGGTGTC GGAGATATCG GGCGTTGACG TCGAAGTGCT AAAAGAGGCG GCCATCACGT ACGCAACGCG CGTCGACCCT TCGACCGGGT ACGGCAATGG CGGAATCCAG TACATGCTGG CGTTGGAACA CGCGTGCAAT TCGACGCAGA ACAACCGTGC CTGCGACCTG CTGGCGGGCA TAACGGGCAA CATGGACACT CCGGGCGGCA TGCGCGGCTC GACTCCCGGT TGGCCCGTCT ACGACCTCGG CATGTGCGTG CCCGACTCCG GCAAAACCAC CGAGTACACC ATCGAGAAGA TTTTGGGCAA AGAGCGTTTT CCGATGATCG GCTCGGACTG CAACCCCAGC TGGGCGGATG CGACGTCGGT GTACGACGCT ATCGAGAGCG GCGAGCCTTA TAACGTGACG TGCGGCATCG GGCAAACGGG CGACTTCATG AACCAGTCGA ACTCCCTCTT TGCCGCGGAG CAGCTTCAGA AGCTCGACTT CTGGTGCTCG ATCGATCTGT GGCACACCCC GTGCGTGGAT ATGATGGCCG ATATCGCAAT GCCCGCCGCT CATTGGCTGG AGCTCGATTG CATTCGCAAA AGCCAGGGCT CGTCGGGTGC GTTCGGCGCT ACGGTGAAAG CGGTGGAGCC TCCCGGAGAA GCGAAGAACG ATCTGGAGAT CGTGGTTGGC CTGTATAAGG CCGCCGGGGT CCCGTACTTC GACGAGGAGT ATCACGGTGC GGCGTGGCTC GAGGGGGATG AAGCCGTGGA CGCGTGCAAC AACGTTGCGC TCAAAAGCTT CCGCATTCCC GATTGGAATG ACTACAAGAA GGAATTCCAA GAGAAGGGCT GGTTCGATTC CAAGGTGGAG AAGCCCGACG ATTGGGGCGT CTACCGGCGC TATCAAACCG GAAACGGCCA CATCAACGGA GGGTTCCCGC CCAATCCGAA CCAGCATCAA GGCTGGAACA CGACCACGCA CAAGCAGGAG ATCTGGTCGA CGGTGCTGGA ATCATGGTTG CCCGGCGAGG GCGAGGAGTT CCCGAAATTC GTGGAGGCTC CGCATGGCCC CGTCGCCGAC CCCGATTTGT TCACGGACGA CAACTCGTTC TTGATGACCA CCGGGCGTCG TCAAGGAACC TACTTCCATT CCGAGCACCG GCAGCTGCCA TGGTGCCGCG AGCTGTGGCC CGTTCCTCGG CTTGAGATGA ATCCCGTTGA CGCCGAGCGA CTTGGTCTCG AGCAGGGGGA TTGGGTATGG ATCGAAACCG ATCAGCACAA GATTCGCGAA GTGGTCGATC TGTACTACGG CATCGCCCCG GGTGTGGTGA ACGCCGAGCA CCAATGGTGG TACCCCGAGC TCAATCAGCC CGATCACGGG TTCAAATTGT CCGGCGTGAA CTGTTTGATC GATCGCCATG CCCAAGATCG CATCATCGGG TCGTCGAACC TGCGTGCTTA CGGTGTGAAG GTGTACAAGG CCACGCCCGA GAATTCGCCG TTCGGCAACC CCGTGCCGTG CGGAGACGAC GGGACGCCCA TCATCCACAC CTGCGACGAT CCGCGCCTGA AGGAATGGCA ACCGTTGTAC GAGGGGAGGG AGTGA
|
Protein sequence | MGELLMTRRA FAKVMAVTAA AAGFTGAQSA LADTEPAASS GEVKRIRSAC RGCGKMECGV WVTVQDGRVV KTEGDESAFQ SAGNHCAKGQ ASLQAAYHPD RLMYPLKRTN PKGQEPGWVR ISWDEAYRST VEAIHKNQEK YGNETCFFMG GTSRIWAMGP YGALKQCFGS PNGIQANEIC KGPRFYATKL NDSNAYSWME VVGRPRVYVQ WGGASELSNY DDSCRTTVDV ATRADKHILV DPRQTNLGKE ADIWVNLRPG TDGAVANCWA NVIIENELYD DLYVRKWMNA PMLVVEDESF EPTPSSSSAQ TAKVRTRLLK ESDLVEGGAD TRFMVLNEIT NELSWYDAGG DNPGWEGEDW VPATEGKEAH QPGLDLTGQT QGFVLDYVPF PDGLLPALHT PEGGFEVELK DGSSVHVRTV WERYIEFLED YTPEKVSEIS GVDVEVLKEA AITYATRVDP STGYGNGGIQ YMLALEHACN STQNNRACDL LAGITGNMDT PGGMRGSTPG WPVYDLGMCV PDSGKTTEYT IEKILGKERF PMIGSDCNPS WADATSVYDA IESGEPYNVT CGIGQTGDFM NQSNSLFAAE QLQKLDFWCS IDLWHTPCVD MMADIAMPAA HWLELDCIRK SQGSSGAFGA TVKAVEPPGE AKNDLEIVVG LYKAAGVPYF DEEYHGAAWL EGDEAVDACN NVALKSFRIP DWNDYKKEFQ EKGWFDSKVE KPDDWGVYRR YQTGNGHING GFPPNPNQHQ GWNTTTHKQE IWSTVLESWL PGEGEEFPKF VEAPHGPVAD PDLFTDDNSF LMTTGRRQGT YFHSEHRQLP WCRELWPVPR LEMNPVDAER LGLEQGDWVW IETDQHKIRE VVDLYYGIAP GVVNAEHQWW YPELNQPDHG FKLSGVNCLI DRHAQDRIIG SSNLRAYGVK VYKATPENSP FGNPVPCGDD GTPIIHTCDD PRLKEWQPLY EGRE
|
| |