Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0471 |
Symbol | |
ID | 8414755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 602045 |
End bp | 605098 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023442 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180845 |
Protein GI | 257790239 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAACC TGACCATGTC ACGACGCACG TTCGTCAAGA CCGCCGCCAT CACCGGTGCG GCAGCCGCGG CGTTCGGCGC TTCGACGCAC ACCGCGCTGG CCGAAGAGAC GTACAGCAGC GTGTCCGGCA ACGACACCGT GGCCGTGAAA ACGTGCTGCC GCGGCTGCGG CAAGATGGAG TGCGGCGTCA AGGTCATCGT CCAGAACGGG CGGGCCATAC GCGTCGAGGG CGACGAGGGA GCGTTCCAGT CCATGGGCAA CTGCTGCACG AAGTCGCAGT CGTCCATCCA GGCGGCGTAC CACCCCGACC GTCTGCACTA TCCCATGAAG CGCACGAACC CGAAGGGCGA GGAGCCCGGC TGGCAGCGCA TCTCGTGGGA CGAGGCCATG CAGTCCATCG TGGACAACTT CATGGACATC AAGGCGAAGC ACGGCGGCGA GGCCATCGCC TGCCAGGTGG GCACGTCGCG CATCTGGTGC ATGCACTCGG AGTCCATCCT GAAGAACATG CTGGAAACGC CGAACAACGT TGAGGCCTGG CAGATCTGCA AGGGCCCGCG CCACTTCGCC ACCACCATGG TGTCGCAGTT CGCCATGTCG TGGATGGAAA CCATCACGCG CCCGAAAGTG TACGTGCAGT GGGGCGGCGC GTCCGAGCTG TCCAACTACG ACGACTCCTG CCGCACCACG GTGGACGTGG CATCGCGCGC CGACGTGCAC ATCAGCGTCG ACCCCCGCAT GGCCAACATG GGCAAGGAAG CCGACTACTG GCAGCACCTG CGCCCCGGCA CCGACGGCGC CCTGGCCCTG GCCTGGACGA ACGTCATCAT CGAGAAGAAG CTCTACGACG AGCTGTACGT GAAGAAGTGG ACGAACGCCC CGTTCCTCGT GTGCGAGGAC ATGGAGCCCT CCGGCTTCCC CACCGTGCGC ACCGACGGTT CCTACTGGGA CGTGAAAACC GCGCTGCTCA AGGAAAGCGA CATCAAGGAG GGCGGCAGCC CCTATAAGTT CCTCGTGTAC GACAACAACT GGGAGAAGCT GAAGGCCGAG GGCGTCGAGC ACGAGTACGG CGCGTTCACC TGGTTCAACG CCGACCAGGA AGGCGTCATC GACGAGACCG GCGGCTTCTG GGAGGGCGAG AACTACGACT CCGAGAAGGC GCGCCAAGGC CGCGAGGCCG CGCAGGACAA CCTGCTGCCG GGCCAGACGC AGGGCTGGCT GCCCGATCCC ATGCCGTTCG ACCCCGCCAT CGACCCCGCG CTGGAAGGCG AGTTCGAGAT CACGCTGAAG GACGGCAAGA CCGTGAAGGT CAAGCCGGTG TGGGAGCACT ACAAGGCCCG CGCCGCCGAG TACAAGCCCG AGGTGGCGGC CGAGATCACC GGCATCCCCG CCTCCGAGAT CGAGGCGGCG GCCACGGCCT ACGGCACGCG CATCGACCCG TCCACCGGCT ACGGCAACGG CGGCATCCAG TACATGCTGG CGGTCGAGCA CTTCTGCAGC GCCATCCAGA ACTGCAGCGC CTTCGACAAC CTCGTGGGCA TCACCGGCAA CATGGACACC CCGGGCGGCA ACCGCGGCCC GACCATCGTC CCCATCGACG GCGACCTCCA GGGCTTCAGC GCCTGGGCCC CCGGCGCCAC CACCCCGCCG GAGGAAGTCA ACCGCAAGCA GATCGGCATC GACAAGTTCC CGCTTCTGGG CTGGTGGCAG TACTGGTGCG ACAGCCATTC GCTGTGGGAC GCCGTCATCA CGGGCGACCC CTACCCGGTG CGCGCCCTCT GGAACGAGTC CGGCAACTTC ATGAGCCAGA CGAACACCAC GCGCGCCTGG GAGGCCCTGT GCTCGCTTGA CTTCTACGTG GACCTCAACC TGTGGCACAC GCCGCAGAAC GACACCGCCG ACATCATCCT GCCGGTGGCC CATTGGATCG AGCTCAACTC GCCGCGCGCC AGCCAAGGTT CCGCCGGCGC CATGGGCGCC ACGGTCAAGT GCGTGCAGCC GCCCGCGGAA GCCAAGTACG ATCCCGAGAT CGTCATGGAC CTCGCCCGCC GCATGAACTG GAAGTGGACC GACGAGCCGG GCAACGAGTG GCCCGACATC AACTGGCAGC TGGACGACTC CATCAAGCTG CTCACCGACG ACGAGCTGAC CTACACCACG TGGCACGTCG AGAACGGCAA GCCCACGTTC GAGCGCCACG GCGTTCCGAT GGCCGAGGTC ACGCCCAAGT ACAAGACGTG GGACGAGTAC GTCAAGGCCT TCCAGGAGCA CGGCTGGTGG CAGGCGAAGG ACATCGAGCC GCGCAACTGG GGCACGTACC GCCGCTACCA GACCGGCGCG ATGCGCGCAC GCGACCGCGT GTGGGGCCGC CTCGACTACA CGGCCGGCAA GGGCATCGGC GACTGGAAGC CGGGCTGGTT CACCCCGACG ATGAAGCAGG AGATCTGGTC CACCGTCATG GAATCGCACC ATCCCGACCA TCCCGAGTGG AGGCTTCCCA CCTACACCGA GCCGCCTCAC GGCCCGAAGG ACGGCGACCG CATCAAGGAG TACCCGCTGA CCGCCACCAC CGGCCGTCGC ATCCCGGTGT ACTTCCACTC CGAGCATCGT CAGCTGCCCT GGTGCCGCGA GCTGTGGCCC GTGCCGCGCG TGGAGATCAA CCCGAAGACG GCCGCCGAGT ACGGCATCGA GCAGGGCGAC TGGGTGTGGA TCGAAACCGA ATGGGGCAAG ATCCGCGAAG TGGCCGACCT GTACTACGGC GTGAAGGAAG ACGTCATCAA CCTCGAGCAC ACGTGGTGGT ACCCCGAGGT GAAGGACGCC GGCCACGGCT GGCAGTTCTC CCAGGTGAAC CAGCTGATCG ACCACTACGC CCAGGATCCG CACTCCGGCA CATCCAACCT GCGCGCCTAC CAGGTGAAGA TCTACAAGGC CACGCCCGAG AACTCGCCGT TCAACAACCC CGTGCCGTGC GACTCCACCG GCACGCCCAT CATCCATACC TCCGACGACC CCCGTCTGAA GGAATGGCTG CCTACCTACG AAGGGAGGGA GTAA
|
Protein sequence | MGNLTMSRRT FVKTAAITGA AAAAFGASTH TALAEETYSS VSGNDTVAVK TCCRGCGKME CGVKVIVQNG RAIRVEGDEG AFQSMGNCCT KSQSSIQAAY HPDRLHYPMK RTNPKGEEPG WQRISWDEAM QSIVDNFMDI KAKHGGEAIA CQVGTSRIWC MHSESILKNM LETPNNVEAW QICKGPRHFA TTMVSQFAMS WMETITRPKV YVQWGGASEL SNYDDSCRTT VDVASRADVH ISVDPRMANM GKEADYWQHL RPGTDGALAL AWTNVIIEKK LYDELYVKKW TNAPFLVCED MEPSGFPTVR TDGSYWDVKT ALLKESDIKE GGSPYKFLVY DNNWEKLKAE GVEHEYGAFT WFNADQEGVI DETGGFWEGE NYDSEKARQG REAAQDNLLP GQTQGWLPDP MPFDPAIDPA LEGEFEITLK DGKTVKVKPV WEHYKARAAE YKPEVAAEIT GIPASEIEAA ATAYGTRIDP STGYGNGGIQ YMLAVEHFCS AIQNCSAFDN LVGITGNMDT PGGNRGPTIV PIDGDLQGFS AWAPGATTPP EEVNRKQIGI DKFPLLGWWQ YWCDSHSLWD AVITGDPYPV RALWNESGNF MSQTNTTRAW EALCSLDFYV DLNLWHTPQN DTADIILPVA HWIELNSPRA SQGSAGAMGA TVKCVQPPAE AKYDPEIVMD LARRMNWKWT DEPGNEWPDI NWQLDDSIKL LTDDELTYTT WHVENGKPTF ERHGVPMAEV TPKYKTWDEY VKAFQEHGWW QAKDIEPRNW GTYRRYQTGA MRARDRVWGR LDYTAGKGIG DWKPGWFTPT MKQEIWSTVM ESHHPDHPEW RLPTYTEPPH GPKDGDRIKE YPLTATTGRR IPVYFHSEHR QLPWCRELWP VPRVEINPKT AAEYGIEQGD WVWIETEWGK IREVADLYYG VKEDVINLEH TWWYPEVKDA GHGWQFSQVN QLIDHYAQDP HSGTSNLRAY QVKIYKATPE NSPFNNPVPC DSTGTPIIHT SDDPRLKEWL PTYEGRE
|
| |