Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0507 |
Symbol | |
ID | 8414791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 651274 |
End bp | 654132 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645023478 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180881 |
Protein GI | 257790275 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.485596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.607063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAACA CCACGCAAAG GCGGTCGTTC GGCACCATGA GCCGTCGCAG CTTCATGAGG CTCGCAGGGG TGACGAGCGC TGCGCTTGCG CTGACGTCGG CAACGGCGCC GGCGGCGCTT GCCGAAGAGC ACGACAGCGG AACGGTTGCG TCGTCGGACG GCGTGCAGCG TATTCGCACG ATGTGCCGCG GGTGCGGCAA GATGGAATGC GGCGTGTGGG TGACGGTGGA GAACGGCCGC GCCATCAAGA TCGAGGGGGA CGAGAGCTCG TTCGCGTCGT CGGGGAACAG CTGCAGCAAG TCGCAGGCTT CGCTGCAGGC GTGCTACCAT CCCGATCGGC TCGCGTATCC CATGAAACGC ACGAATCCCA AGGGCGACGA CGATCCGGGT TGGGTGCGCA TCAGCTGGGA CGAGGCTCTC GCCGAAGCCG GCACGAAGCT CAACGAGATC AAGGAGCAGC GCGGCGGGAA CTCCATGTTC TCGATGTGCG GCACCAGCCG CATCTACTGC ATGGCGAGCG CGCTCGGCAT GCAGGGCATT CTGAACACGG CGAACACCCA TCAGGCGTAC CAGATTTGCA AGGGTCCCCG CCATGTGGCC ACCGGTATGG TGTCGGCTCG TGCGTACAGC TGGATGGCCA CGGTCGACCG GCCGAGCGTG TTCGTGCAAT GGGGCGGCGC TTCGGAGCTG TCCAACTACG ATGACTCGTG TCGCACGACG GTCGATGCTG CGGTCAAGGC GGACAAGCAC ATCATCGTCG ATCCGCGTCA GACGAACCTC GGCAAAGAAG CCGACATATG GAATCCGCTG CGTCCCGGCA CCGACGGCGC GGTGGGGCTC GGCTGGCTCA ACGTGATCAT GGAGAACAAC CTGTACGACG AGCTTTGGGT GAAGCGGTGG ACGAACGGCC CGTTCCTCGT GTGCGAGGAT ATCGAGCCTT CCGGTTGGCA GCAGATGGGT GCCGGCGGTC CAGAAGAAAT CAAAACGCGC CTGCTCAAGG AATCCGATGT GCAGGAAGAC GGCAGTCCGA AACGGTTCAT GGTCTACGAT CAGTTGAACC AGCGGCTTAC GTATTTCGAT GCGGATACGG GATACTGGGA GGGCGAGCAG CCGCGTACGC TGACGGGGAA GGAGGCGCGG CAAAAGCACC TCGCACCCGG CGTGACGCAA GGTTGGGTGC CCGATCCTAC CGGGTTCGAT CCGGAGATCG ATCCTCAGAT ATTCGGGCAG GTGGAAGTCA CCCTGAAAGA CGCTTCGACC TCGGTCTGCA AAACGGTGTG GCAAACGTTC TCGGACTACG TAGCCGATTT CACGCCCGAG AAAGTGGAGG AGATCACCAG CGTTTCGGCT GATGCTTTGC GCGAGGCGGC CATCACCTAT GCGACGCCTA TCGATCCATC GACCGGATAC GGCAACGGCG GCATCCAGTA CATGCTGGCC ATCGAGCATG CGTGCAACTC GGTCCAGAAC AGCCGTATCT GCGATCTCAT CGTGGGGATC ACGGGGAACT TCGACACCCC CGGCGGCAAT CGCGGCGCAA CGGCCGCGAC TTTCGACGAA GAGTTCGCCA TGATGGGCAG CGGTCTGCCC ATGGCGTCGG CTGACTTGTG GGACAAGGTG CTGGGCGTGG AGGACATTCC TTTGCTCAAG CATCACGGCA TCTGGGCCGA TTCGACGGCC ATTTGGGATG CGTGCAACAA CGAAGGGGCG CCGTACCCGT TGTACGGCGG CGTCTGCCAG TCGGGCGATG TGATGAACAT GTCCAACGCT CTTTGGGGTT GGGAAGGCTT GAAGAAGCTG GACTTCCTGC TGGACATCGA CCTGTGGCAT ACGCCCACGT CGCAGCTGGC GGATATCCTG CTGCCTGCGC GCCATTGGCT CGAGGTGGAT TGCCCTCGCC GCTCCCAAGG GTCTGGCGGC ATGGAGGGAA GCCATTGCAA GTGCGTGGAG CCGCTCGGCG AGAGCTGGTT CGACGTGGAC ATCATCATCC AGCTGTGCAA GGCCATGGGC ATTCCCTGGA GCGCCGACCC CGACGATCCG TGGCCGGACT CCATCAAGGA GCTTGATGCG GCATGCGAGC CGATGGGTCT TACCTGGGAG GAGTGGAAGC AGGAGTTTCA GAAGACCGGT TTCCGCGACT GCAAGAAGGA ATACCCCGAC GACTGGGGCA CTTACCGACG CTACGAAACG GGCCACTGCC GCTCCGATGG CAAGCCGGGC TTGCAGACGC CCACGCTCAA GCAGGAGATA TGGTCGACCA TCATCGAGAC GTACCATCCT GACGGCCGGT ACAACCTTCC CACGTATTCC GAGCCCCCGG AGAGTCCCGT TGCGCAGCCT GAGCGGGCTC AGGAGTACCC CTACATCATG ACGACGGGCC GTCGCATTCC CGTGTACTTC CACTCCGAGC ACCGGCAGCT GCCGTGGTGC CGCGAGCTGT GGCCGGTGCC GCGCGTAGAG ATCAACCCGA AGGATGCGCT TGAGCTCGGC ATAGAGCAGG GCGATTGGGT GTGGATCGAG ACAGAGCGCG GCAAGGTGCG ACAAGTGGCG GATCTCTATC ACGGCATCCG CCCGGGGACC ATCAACTGCG AGCATCAGTG GTGGCTGCCC GAGTTCCACG GCGCGACGAA GGGTTTCGAC CTCATCAGCA TCAACTGCCT GGTGAACAAG GACATGCGCG ATCCTCTGTG CGGATCTTCG TACGCGCGCG CTTACAACGT GAAGGTGTAC AAAGCCACGC CCGAGAACTC GCCGTTCGGC AATCCCGTGC CGTGCGACGT CGACGGGACC GAGATGATCA CGTCGCCCGA TGACCCGCGT TTGAAGGAAT GGCTGCCGAA CTACGAGGGG AGGGACTAG
|
Protein sequence | MANTTQRRSF GTMSRRSFMR LAGVTSAALA LTSATAPAAL AEEHDSGTVA SSDGVQRIRT MCRGCGKMEC GVWVTVENGR AIKIEGDESS FASSGNSCSK SQASLQACYH PDRLAYPMKR TNPKGDDDPG WVRISWDEAL AEAGTKLNEI KEQRGGNSMF SMCGTSRIYC MASALGMQGI LNTANTHQAY QICKGPRHVA TGMVSARAYS WMATVDRPSV FVQWGGASEL SNYDDSCRTT VDAAVKADKH IIVDPRQTNL GKEADIWNPL RPGTDGAVGL GWLNVIMENN LYDELWVKRW TNGPFLVCED IEPSGWQQMG AGGPEEIKTR LLKESDVQED GSPKRFMVYD QLNQRLTYFD ADTGYWEGEQ PRTLTGKEAR QKHLAPGVTQ GWVPDPTGFD PEIDPQIFGQ VEVTLKDAST SVCKTVWQTF SDYVADFTPE KVEEITSVSA DALREAAITY ATPIDPSTGY GNGGIQYMLA IEHACNSVQN SRICDLIVGI TGNFDTPGGN RGATAATFDE EFAMMGSGLP MASADLWDKV LGVEDIPLLK HHGIWADSTA IWDACNNEGA PYPLYGGVCQ SGDVMNMSNA LWGWEGLKKL DFLLDIDLWH TPTSQLADIL LPARHWLEVD CPRRSQGSGG MEGSHCKCVE PLGESWFDVD IIIQLCKAMG IPWSADPDDP WPDSIKELDA ACEPMGLTWE EWKQEFQKTG FRDCKKEYPD DWGTYRRYET GHCRSDGKPG LQTPTLKQEI WSTIIETYHP DGRYNLPTYS EPPESPVAQP ERAQEYPYIM TTGRRIPVYF HSEHRQLPWC RELWPVPRVE INPKDALELG IEQGDWVWIE TERGKVRQVA DLYHGIRPGT INCEHQWWLP EFHGATKGFD LISINCLVNK DMRDPLCGSS YARAYNVKVY KATPENSPFG NPVPCDVDGT EMITSPDDPR LKEWLPNYEG RD
|
| |