Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0497 |
Symbol | |
ID | 8414781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 635502 |
End bp | 638810 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023468 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180871 |
Protein GI | 257790265 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.455569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGC TCACGCTCAC GCGCCGCAGC TTCATGAAGG CGATGGCGGT CACCGGTGCC GCTGCGTCGC TCACGGCGGT CGCCGAGCCG ATGCGGGCGC TTGCCGAGGG CGTCGATGCC GATGCGGGGG AAGTCAAGCG CGTTCGCTCG TGCTGCCGCG CTTGCGGCAA GGTGGAGTGC GGCGTTTGGG TAACCGTGCG CGACGGCAAG GTCGTCAAGG TGGAAGGCGA CGAGTCCGCG CCGCAAAGCC GCGGGCACTG CTGCTCGAAG TCGCAGTCAT CGATGCAGGC GCTATATCAT CCCGACCGCC TTCGTTTCCC GGTGAAGCGC ACGAACCCTA AGGGCGAGGA CGATCCCGGT TGGGTGCGCA TCACGCTGGA CGAGGCGTTC GAGATATGCG GCGAGAAGCT CAAGGAAGTG AAGGAGAAGT ACGGCGGGGA GTCCATCTTC GTGATGTGCG GCACGTCGCG CGTGTGGTCG CTCGGGCCTT ACCAGGGCAT GAAGCAGCTG TTCGGCACGC CGAACGCGCA TTTGGCCTAC CAGGTGTGCA AAGGCCCGCG CCACTTCGGC GGCATCATGA CCGACGAGAT GGGGTCCCCG TGGATGGAGG TGGAGGCCGA GCCCAGCGTG TACGTGCAGT GGGGCACGGC GTGCGAGTAC TCCAACTACG ACTCCACGAA CCGCACGGTC ACCGACGTCG CGCATCGTGC GTCTAAGCAT ATCGTGGTGG ATCCGCGCGT CACGCCTTTG GGCAAGGAAG CCGACATCTG GCTGCCGCTT CGTCCCGGCA CCGACGGCGC GCTGGCGCTG TCGTGGCTCA ACTGGATCAT CGAAAACGAA GCTTACGATG ACACGATGGT GCGTCGCTGG TCGAACGCGT CGTTCCTCTA CGTGGACGAC AAGCCCGAGC TGACGCAAGG CTGGTTGGTG GAGGGCAACG GCGGCATCAA CATGAAGACG AAGCTGCTGA CCGAGGCCGA CCTCAAGGAA GATGGCAAGT ACCAGCGCTT CATGGTGTGG GACGAGGCGA ACGAGCGTCT CACGTACTGG GATGCCGAGC TTGGCATGTG GGAGGGCGAG GAGCACCGCA TCCCCACCAC CGGCACATGG ATCGAGCATC CGTACAAGCC GCTTGTTGCC GATGCATGGC TGCCCGATCA GTCGACGTTC GCCGATCCGG CTACCGAGCC CGACCGCTTC CCCGACGGTT TCGAGGAATG CAATCCCAAG GGGTTGCCGA AGCGCCCATC TCTGCTTCCT GGTGAGGTAG AGGTAACGCT CAAAGACGGA ACCGTGCACA AGGCTCGCTC TGTGTGGGAT GCGTTCCATC AGATGACCAG CGAGTACACC ATGGAGAAGG CAGAGGAGAT CACCGGCGTG CCGGCGGCGA AGAGCGAGGA GGCTGTACGC GCCTGGACGA CGCGCGTCAA TCCGCTGCAT GGCAACGGCG GCATCCACTT CCAGCTGGCC ACCGACCAGA ACGGCAACTC CATCCAGAAC GTGCGTGCGC TGCAGATCCT GTCGTGCATC ACCGGCAACT CCGACGAGCC GGCCGGCAAC CGCGGCTCGT CGAAGGCGCA GTTCGACGGC AATCCAGGGC GTTCGAACAT GCAGGCCGGC GCGCCGTACG GCGACGAGGC CAAGATGTGG GACGGCCGCG ACGTCGACCT CGAAGGGCTC GCCGCGAAGA TGCAGGATTT CGTGCAGTAC CTCATCGACA ACGATTCGCC CCTGGCCGAG CGCTACGGCA ACAAGGTGCC CAGCCATGAG GAAGCCCTCG TTATCGCCAA GCGCATGGGC GGCGCATTCC GCACGTCGCA GGCGTGGCCG AACCCCAAGA CCGTGTTCGA GCGACAAGCC AACGAGGTTG ACGCCGAGCG ATTCCCCCTC AACCGCTACT GGGCGCGCTG GGCGGACGCG AACTCCATCT GGGACGGCTG CCTGGACAAC GACGTGCCGT ACCAGTTGCA TGCGGGCGTG TGCCAGTCGG GCGACTTCAT GAATATGGGC AACATCGACG TGGCGTGGGA GGCCATGACG AAGCTGGACT TCTTCACCGA CATCAACCTG TGGTTCTGCC CGAACAACGG CAACGCCGAC GTCATCTTTC CGTGCTACCA CTGGTTGGAA GTTGACACGA CGCGCGTCAG CCAGGGCGCC GGCGGGTTCT TCGGCTGCGG GTGCGCCGCT GTCGAGGCGC CGGGCGAGTG CATCTACGAT CCCGACTGGA ACGTGGGCAT GTACAAGGCC ATGGGCGTGC CATGGAACAC GAAGGACGAG TCGGCCGAGC CGTCCGAGAT CCTCAACTTC GGCGAGAAGA CCGACTACCG TTGGCCGAAC CGCAGCCGCG TGCTCAAGGA CAACGTGGAC GCGTGGAAGA CAGCCGAGTT CCCCGACGGT CCCACGTGGG AGCAAGCGAA GGAGCACTTC CAGAAGAACG GCTGGATGGA TTGCCGCACG TGGCATCCCG AGCGTTGGGG CACGTATCGC CGGCACGAGA TGGGCTGGCG TCGCCAGCAG GGCGGCTTCA ACCTGTTCCC GTTGGTGGAC GACCATCCGG GCTTCATGAC GCCGTCGGGC CTCATTGAAA TCTGGTCGCT TGTGTGCGAG GCGTACTTAG GCGACGAGGA CAAGTTCCCC GTGTACCGCG AACCCACGAA CTCGCCGGTG ACCACGCCCG AGTACTTCGA CGCCGCGAAG GTTGCCGACA TCGATGAGTC GAAGCTGCGC AATATGGATT ATCCCGCCAG CCTGCAGGAG CACGCCGATG CGACGTTCCT CATGACCACG GGCGCGCGCC AGCCGGTGTA CTTCCACAGC GAGCATCGCC AGCTGCCGTG GTGCCGCGAG CTGTGGCCCG CGCCGCGCCT GGAGATGAAC CCGAACGACG CTGCGCGTCT GGGGCTCGAG CAGGGCGACT GGGTGTGGAT CGAGAGCCCG TGGGGCAAGG TGCGCGAGGT GCTGGATCTG TACTACGGCA TCAGCAAGGG CGTGGTGAAC GCGAACCACG CGTGGTGGTT CCCCGAGATG GACACGGCCA GCCACGGCTA CGAGCTGGTG AACATCAACT GCGTGATGGA TCCTTACGGC CAGGATGTGG TGTGCGGTGC AGCAACGATG CGCTCGGTTC CCGTGCTGGT TTATAAGGCT ACGGCGGAAA ACTCGCCGTT CGGCAACCCG GTGCCGTGCG ATCCGCAGGG TAACCCGTGC ATCTGCGATG CGGGCGATCC GCGGTTGAAG GAATGGATGG GAACGGGATT GCGATCTCGC GTTGAAGGCG AAGAGGCGTG GACCGGAGAG GGGGCGTAG
|
Protein sequence | MTQLTLTRRS FMKAMAVTGA AASLTAVAEP MRALAEGVDA DAGEVKRVRS CCRACGKVEC GVWVTVRDGK VVKVEGDESA PQSRGHCCSK SQSSMQALYH PDRLRFPVKR TNPKGEDDPG WVRITLDEAF EICGEKLKEV KEKYGGESIF VMCGTSRVWS LGPYQGMKQL FGTPNAHLAY QVCKGPRHFG GIMTDEMGSP WMEVEAEPSV YVQWGTACEY SNYDSTNRTV TDVAHRASKH IVVDPRVTPL GKEADIWLPL RPGTDGALAL SWLNWIIENE AYDDTMVRRW SNASFLYVDD KPELTQGWLV EGNGGINMKT KLLTEADLKE DGKYQRFMVW DEANERLTYW DAELGMWEGE EHRIPTTGTW IEHPYKPLVA DAWLPDQSTF ADPATEPDRF PDGFEECNPK GLPKRPSLLP GEVEVTLKDG TVHKARSVWD AFHQMTSEYT MEKAEEITGV PAAKSEEAVR AWTTRVNPLH GNGGIHFQLA TDQNGNSIQN VRALQILSCI TGNSDEPAGN RGSSKAQFDG NPGRSNMQAG APYGDEAKMW DGRDVDLEGL AAKMQDFVQY LIDNDSPLAE RYGNKVPSHE EALVIAKRMG GAFRTSQAWP NPKTVFERQA NEVDAERFPL NRYWARWADA NSIWDGCLDN DVPYQLHAGV CQSGDFMNMG NIDVAWEAMT KLDFFTDINL WFCPNNGNAD VIFPCYHWLE VDTTRVSQGA GGFFGCGCAA VEAPGECIYD PDWNVGMYKA MGVPWNTKDE SAEPSEILNF GEKTDYRWPN RSRVLKDNVD AWKTAEFPDG PTWEQAKEHF QKNGWMDCRT WHPERWGTYR RHEMGWRRQQ GGFNLFPLVD DHPGFMTPSG LIEIWSLVCE AYLGDEDKFP VYREPTNSPV TTPEYFDAAK VADIDESKLR NMDYPASLQE HADATFLMTT GARQPVYFHS EHRQLPWCRE LWPAPRLEMN PNDAARLGLE QGDWVWIESP WGKVREVLDL YYGISKGVVN ANHAWWFPEM DTASHGYELV NINCVMDPYG QDVVCGAATM RSVPVLVYKA TAENSPFGNP VPCDPQGNPC ICDAGDPRLK EWMGTGLRSR VEGEEAWTGE GA
|
| |