Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3031 |
Symbol | |
ID | 8417365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3518954 |
End bp | 3522535 |
Gene Length | 3582 bp |
Protein Length | 1193 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645026010 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003183363 |
Protein GI | 257792757 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAACC TATCACTGAG TCGTCGAAGC TTCTTGAAGG CCTCTGCCAT GGCTGCCGCG GCAACCACGG TGGGCTTCGC AGCAACGCCT TCGACGGCGT TGGCCGAAGG TGAAGACGCC TCAGCGGGCG AGATCAAGCG CATCCGCTCC TGCTGCCGCG CGTGCGGCAA GGTGGAGTGC GGCGTGTGGG TGACCGTTCA GGACAACAAG GTCATCAAAG TCGAGGGCGA CGAGTCCAAC GCTCACAGCC GCGGTCACTG CTGCGCGAAG TCGCAGTCGT CCATGCTGGC TCTGTACCAT CCCGATCGTC TGCGCTACTG CATGAAGCGC ACGAACCCCA AAGGCGAGGA CGACCCCGGT TGGGTGCGCA TCACGCTTGC CGAGGCGTTC GACGAGGCGG GTGCCAAGTT CAACGAGATC GTGGAGAAGT ACGGCGGGGA AGCCAACTTC GCCATGGGCG GCACCTCGCG TGTGTGGGCG CAGCCGCCGT ACGGCACGCT GAAGTCCATC TTCCCCACGC CGAACGCGCA CCTGGCCTAC GAGATCTGCA AGGGCCCGCG CCACTTCGGC GGCATCCTCA CCGACGAGAT CGGCTCGCCG TGGATGGAGG TGGAGCAGGG CCCTCTGGTG TACGTGCAGT GGGGCACCGC GTCGGAGTAC TCGAACTACG ACTCCACGAA CCGTACGGTG GTGGACTGCT CGCAGCGCGC CTACAAGCAC ATCCTGGTGG ATCCCCGCAT GACGCCGCTG GGCAAGGAAG CCGACGTGTG GCTGCCGCTG CGCGTGGGCA CCGACCTGTG CCTGTCGCTG GGTTGGCTCA AGTGGATCCT CGACAACGAG GCGTACGACG ACGCGTTCGT GCGCCGTTGG ACGAACGCCC CGTTCTTGTG GAACCCCGAG AAGGACGGCC GCACGGCGAA GGGTTGGTTC ATGGAAATGA ACGGCGGCAT CGACATGGAG AGCCGCATCC TGACCGAGGC CGACTGCGAC CCCGAGTGGA TCGGCCAGTA CTGGGACTAC GAGGGCCGCT ACCAGCGCTT CATCTGCTGG GACGAGAACA ACAACAAGCC TACGTACTGG GATGCCGAGG CCTGCCAGTG GGAGGGCGAG AAGCACAAGA TCCCGACGAC GGGCACCTGG ATCGAGCATC CGTACAAGCC CATCATCGCG GACGCCTGGC TGCCCGACCC CTCCAAGTTC GCCAATCCGG CCGACGTGCA GTACGACGCC TACTGGGACG AGGGCAACGA GGGCGGCAAG CGCTCGAACC CGGCCGGGCT GCCCAAGAAC CCGGCGCTGT TCCCCGGCGG CGTGGAGGTG AAGCTGAAGG ACGGCTCCAC CATCAGCGCC GACACGGTGT GGGAGGCGTT CTCCGACAGC CTGGAGCAGT ACACGCTGGA GTACGTGTCC GAGGTCACCG AGGTGCCGGT CGACAAGATC GAGGAGGGCG TGCGCATCTA CACGACGCGC CTGAACCCGC TGCACGGCAA CGGCGGCATC CACTACCAGC TTGCTCCCGA CCAGACGGGC CACGCGGTGC AGAACACCCG CGCGCTGCAG CTGATCGCCT GCATCACCGG CAACTCCGAC GAGCCCGCCG GCAACCGCGG TTCGTCGAAG GCCCAGGTGG ACGGCTGCTG CGGCCGCGCC AACATGCTGG TGACCGACCA TGAGCCGAAG GACTGGGGCT TGGACGTGGG CACCATGGAG CTGGGCAACA CCCCGCGCGA CCTGTCCGTG GAAGACCAGA TCCCGCTGAT CCAGAACTTC GTGCAGTACC TCATCGACGA GAAATCGCCC TTGGCCGAGC GCTACGGCAA CAAGGTTCCC ACCGCCGAAG AGGCGCGCTG GATCGCCGAG CGTAAGGGCG GCGCCTACAA GCCCAGCCGC TCGTGGCCGG CGCTCAAGAC GTCGTTCGAC AGCAACGAGA AGCAGATTTC CGCCGAGCGC TTCCCGCTGC TGCGCTACTG GAACCGATGG GCCGACTCCG CTGCCATCTG GGACTCCATC AACGGCATCG ACACGCCGTA CCAGATCCAC GGCGGCGTGT GCATGTCGGG CGACTTCATG AACGAGTCCA ACCTGCTGGA GGCTTGGGAG GCGCTCACCC GTCTGGACTT CTGGCTGGAC TTCAACCTGT GGTCCTGCCC GAACAACGGC TGCGCCGACA TCGTGATCCC CGTGCTGCAC TGGCTTGAAG TGAACACGGG CCGCGTGTCG CAGGGCGCGG GCGGCATCTT CGGCGCCGGG CAGCGCTGCG TGGAGCCGAT GGGCGATTGC ATCTACGACC CGGTTGCCGT CATCTGCCTG TACAAGGCCA TGGGCGTGGT GTGGAACAAC CGCGACCCCG AATACGACGA ATGGAACAAC CTGGACTACC GCGACTTCGT GCAGATGGGC GGTAGCGTCG GTTACGAGGA GCAGGAGTAC CGCGTGCTCA AGGACGCCAC CGACTGGTGG AAGACCGAGG AGTTCCCCGA CGGCCCCGAC TTCCCGCAGT ACGCGGCGAA GTTCCAGGAA GAAGGCTGGT TCGACTGCCG CAAGTGGCAC CCCGAGCGTT GGGGCACGTA CCGCCGTTGG GAGATGGGCT ACCGTCGCCA GCAGGGTGGC TACAACCTGT ACGCGGCCAT CGACGAGAAG TGCGGCTTCA TGACCCCCAC CGCCAAGGTG GAGGTGTGGT CCACCATCGC CGAAACGTAC ATCCCCGACG GCGCGGCCAC CTTCGCCAGC ACGAACACGG TCGATCCGAA CATCCCCGAC ATCGACAAGT TCCCGCACTG GGTGGAGCCG AAGAACTCGC GCGTATCGAA TCCCGAGTAC TTCGACGCCT CGCTGGTCGA CCAGATCAAG ACGTCGGACG CCTACATCAA CGACAACTAC CAGGGCGATC ATCTGGTGGA GGAGTACAAG GAGGCGCTGA CGGCGCATCC GGACAGCGCG TTCATCATGA CGACGGGCTC CCGCCAGCCG GTGTACTTCC ACTCCGAGCA TCGCCAGCTG CCGTGGTGCC GCGAGCTGTG GCCCAGCCCT CGTTTGGAGA TGAACCCCAA CGACGCGGCG CGCCTGGGGT TGGAGCAGGG GCAGTGGATT TGGATCCGCA GCCCGTGGGG CGCCATCCGC GAGGTCGTGG ACTTGTACTA CGGCATCAAG GAGGGCACGG TCAACGCGAA CCACGCCTGG TGGTATCCCG AGATCGACAC CGCTTCGCAC GGCTTCGAGC TGGTGAACAT CAACTGCACG ATGGACAAGT ACGCGCAGTG TTGGATTTGC GGCGCGTCCC AGCTGCGCGG CGTGCCGGTG CTCGTGTACC CCGCGACGCC CGAGAACTCG CCGCATGGCA ACCCGGTTCC GTGCGACCCG CAAGGCAATC CGGTCATCAC GAACGCAAAC GATCCGCGCC TCAAGGAATG GTTGGCGAAC GATCCGCGCC TGGAGGATTC CAAGGTGGAG CTCACGTTCG CGAACATGGC GGCGGTCGGC TGCCAGCCGA GCGTCCAAAG CCCCGACCTG CTCTCCGGCG GCAAGCTGGC CGTCGGCAGC GTGGGCGGCG ATGCGCTGGG CGCCTATTCG AAATCGATGT AA
|
Protein sequence | MANLSLSRRS FLKASAMAAA ATTVGFAATP STALAEGEDA SAGEIKRIRS CCRACGKVEC GVWVTVQDNK VIKVEGDESN AHSRGHCCAK SQSSMLALYH PDRLRYCMKR TNPKGEDDPG WVRITLAEAF DEAGAKFNEI VEKYGGEANF AMGGTSRVWA QPPYGTLKSI FPTPNAHLAY EICKGPRHFG GILTDEIGSP WMEVEQGPLV YVQWGTASEY SNYDSTNRTV VDCSQRAYKH ILVDPRMTPL GKEADVWLPL RVGTDLCLSL GWLKWILDNE AYDDAFVRRW TNAPFLWNPE KDGRTAKGWF MEMNGGIDME SRILTEADCD PEWIGQYWDY EGRYQRFICW DENNNKPTYW DAEACQWEGE KHKIPTTGTW IEHPYKPIIA DAWLPDPSKF ANPADVQYDA YWDEGNEGGK RSNPAGLPKN PALFPGGVEV KLKDGSTISA DTVWEAFSDS LEQYTLEYVS EVTEVPVDKI EEGVRIYTTR LNPLHGNGGI HYQLAPDQTG HAVQNTRALQ LIACITGNSD EPAGNRGSSK AQVDGCCGRA NMLVTDHEPK DWGLDVGTME LGNTPRDLSV EDQIPLIQNF VQYLIDEKSP LAERYGNKVP TAEEARWIAE RKGGAYKPSR SWPALKTSFD SNEKQISAER FPLLRYWNRW ADSAAIWDSI NGIDTPYQIH GGVCMSGDFM NESNLLEAWE ALTRLDFWLD FNLWSCPNNG CADIVIPVLH WLEVNTGRVS QGAGGIFGAG QRCVEPMGDC IYDPVAVICL YKAMGVVWNN RDPEYDEWNN LDYRDFVQMG GSVGYEEQEY RVLKDATDWW KTEEFPDGPD FPQYAAKFQE EGWFDCRKWH PERWGTYRRW EMGYRRQQGG YNLYAAIDEK CGFMTPTAKV EVWSTIAETY IPDGAATFAS TNTVDPNIPD IDKFPHWVEP KNSRVSNPEY FDASLVDQIK TSDAYINDNY QGDHLVEEYK EALTAHPDSA FIMTTGSRQP VYFHSEHRQL PWCRELWPSP RLEMNPNDAA RLGLEQGQWI WIRSPWGAIR EVVDLYYGIK EGTVNANHAW WYPEIDTASH GFELVNINCT MDKYAQCWIC GASQLRGVPV LVYPATPENS PHGNPVPCDP QGNPVITNAN DPRLKEWLAN DPRLEDSKVE LTFANMAAVG CQPSVQSPDL LSGGKLAVGS VGGDALGAYS KSM
|
| |