Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2753 |
Symbol | |
ID | 8417079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3192632 |
End bp | 3194881 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645025728 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003183089 |
Protein GI | 257792483 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.28964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGT ATATCAATGT TTCGCGCCGC ACGTTCCTCA AGGGGACGGC GGCCGTTGCG GGTGCCGCGC TGGCCGGTGG GGCGTACGAG ATCGTGCATC CCGAAGGGGC CGTTGCGGAA GAGGCGCCCA TCGAGGTGAA GAACACCTAT TGCGACATGT GCAACCACGT GCCGAAGTGC GGCATCGCCG CGTCGGTGAA GGACGGCAAG GTGGTGCGCG TGGAGGCGCG CGACAAGTAC CCGGCCGACC CCATTTGCGC GAAGGGCATT TCCAGCTTGC AGGAGTTGTA CGATCCGCAT CGCATCACGT ATCCCATGGT GCGCACGAAC CCGAAGGGCA CGGGTGCGCC CGAATGGGAG CCCATCTCCT GGGACGACGC GTACGCGCGC ATCGCCAGCG AGCTCAACCG CATCAAAGAA GAGAGCGGTC CCGAAGCGGT GCTGTTCTAC TGCGGCGACC CGAAGGAGCC GCGCGGCGCT ATGCAGCGCC TGGCCACGCT GTTCGGCTCG CCCACGTACG GCACAGAAAG CTCCACCTGC GCGGCGGCCA CGTGGATCTG CTCGCAGCTG GTGACCGGCC AGCTGACCAT GGGTTCCGAT CCTACCGACG CCACGGCCAG CTGCCTGGTG TGGTCGCTCA ACCCGGCATG GTCGCAGCCC TACCGTTTCG GCGACATGAT GAAGCAGAAG GAGCGCGGCT GCAAGTTCGT CGTCGTCGAC CCGCGCATCA CGCCCACGGT GACCGGCTTG GCCGACGTGC ATCTGCAGCT GCGCCCCGGC ACCGACGGCG CGCTTGCGCT CGGGTTCATC CACGTCATGC TGCGCGACGG TCTGTACGAC AAGGACTTCG TGGAGAAGTG GACGCACGGC TTCGACGAGC TGTCCGGTTA CGTGCAGGAG TTCACGCCCG AGCGCGTGGA GGAGATCACC TGGGTTCCCG CTGCGAAGCT CGAGGAGGCC GTGCGCATCA TCTGCGAGAA CGCGCCCGCG ACGCTGGTGT CCAGCTCGGC CGGCGCGTGC CACGCCACCA ACGTGGGCAA CTTCCAGCGC GCGGTGTACT CGATCATCGC GCTGACGGGC GATTTGGACG TGGCGGGCGG CCTGGCCATG GGGCCGGGGC TGCCGTTCGA TTACTCGGCG TCCACTGCGG CGTTTCGACT CGAGGACATG TACGCCGAGA AGGGCCTGCA GGATATCCGC TACGACAAGG ATGACTTCCC GGTGTGGGCG CACTATTTCA AGATGATCCA GACCGCGCAC CTGCCCGAGC TGGTGGCCGA CGGCAAGATC CGCGCCGGCG TGCTGCTGGG CGTGAACTCC ATGATGTGGC CGCAGACGCC TGAGTACCAG AGGGCCATTA AGGACATGGA GTTCACCGTG GCCATCGACT ACTATATGCG TCCTTGGACG CACGACCTGG TGGACATGCT GCTGCCGGCC GCCATGTGCT ACGAGCGCAT GGCCCCGCCA GCCATCTTCG GCCGCAAGAT CTTCCATCGC GACCCGGTGG TGAAGCCGAT GGGCCAGTGC CGCGAGGATT GGCAGATCAT CCTGGAGATC GGCTGCGCGC TGGGCTTCGA GGAGGAGTGC TACGGCGGCA GCGTGGAGGC GGCGCTCGAC GATATGTACC GGGGTGCGGG CATCGACATC TCGCTGGAGC TGCTGCGGGA GCATCCCGAG GGTTACGAGG TGCCGGGCGG CTCGAAGGAC GAGAAGAAGT ACGAGACCGG CGGCCTGCGC AAGGACGGCC AGCCCGGCTT CAACACGCCG ACGGGCAAGA TCGAGCTGGT GAGCGAGATT CTGAAGCAGT ACGGCTTCGA GGGGCTGCCG GTGTACGAGG AGCCGGTGCA TAGCCCGGTG AGCACGCCCG ACGAGGCGAA GGACTACTCG CTGGTGCTGA ACAGCGGCTC GCGCGTGCCG TACTACACCC ACTCGAAGCT GCGCGATCTG CCGTGGCTCA ACCAGTTCAT GCCCGACCCG GTGGTGCGCC TGCACCCCGA CGACGCCGAG GCGCGCGGCA TCACGGACGG CGCGCAGGTG CGCGTGTTCA ACCAGTTCGG TGAAGTGACG ATGAAAGCCG AGATCACGAA CCTCGTGCTG CCCGGTGTCG TGGACGTCTT CCACGGTTGG CACCAGGCCG ATATCAACTT GCTGACCACG CGCGACTTCG ACCCCATCAC CGGATTCCCT CCCTTCCGAT CCGGCTTGTG CGAGGTCGAG CGTACGGGCA AGGGCAAGAT CACCGTATAG
|
Protein sequence | MSEYINVSRR TFLKGTAAVA GAALAGGAYE IVHPEGAVAE EAPIEVKNTY CDMCNHVPKC GIAASVKDGK VVRVEARDKY PADPICAKGI SSLQELYDPH RITYPMVRTN PKGTGAPEWE PISWDDAYAR IASELNRIKE ESGPEAVLFY CGDPKEPRGA MQRLATLFGS PTYGTESSTC AAATWICSQL VTGQLTMGSD PTDATASCLV WSLNPAWSQP YRFGDMMKQK ERGCKFVVVD PRITPTVTGL ADVHLQLRPG TDGALALGFI HVMLRDGLYD KDFVEKWTHG FDELSGYVQE FTPERVEEIT WVPAAKLEEA VRIICENAPA TLVSSSAGAC HATNVGNFQR AVYSIIALTG DLDVAGGLAM GPGLPFDYSA STAAFRLEDM YAEKGLQDIR YDKDDFPVWA HYFKMIQTAH LPELVADGKI RAGVLLGVNS MMWPQTPEYQ RAIKDMEFTV AIDYYMRPWT HDLVDMLLPA AMCYERMAPP AIFGRKIFHR DPVVKPMGQC REDWQIILEI GCALGFEEEC YGGSVEAALD DMYRGAGIDI SLELLREHPE GYEVPGGSKD EKKYETGGLR KDGQPGFNTP TGKIELVSEI LKQYGFEGLP VYEEPVHSPV STPDEAKDYS LVLNSGSRVP YYTHSKLRDL PWLNQFMPDP VVRLHPDDAE ARGITDGAQV RVFNQFGEVT MKAEITNLVL PGVVDVFHGW HQADINLLTT RDFDPITGFP PFRSGLCEVE RTGKGKITV
|
| |