Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0607 |
Symbol | |
ID | 8414897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 774769 |
End bp | 777501 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023584 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180981 |
Protein GI | 257790375 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.408169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGC TCAACATGTC GAGGCGCGGG TTCGTGAAAG CCGCAGCCGC CACCGGAGCG CTTGCCGCAT TCGGCGCGAC GGCGGTCGAC GGCGCGACGT TCCGCGAAGC GCATGCCGAC GAGACGACCT CCCAAACAAA GAAGGTCTAC ACCTCGTGCC AAGCGTGCAT CTGCAGCTGC GCCGTCATCG CGACGGTGCG CGACGGACGC GTCATTCGGC TCGAAGGCAA TCCGGAAAGC CCCATCAGCC GCGGAGGCCT GTGCGCGAAG GGCCTTTCGG GCATCCAAGC GCTGTACAAC CCCTGTCGCA ACAAGTACCC CATGAAGCGC GTGGGGGAGC GCGGCACGAA CAGCTTCGAG CGCATCAGCT GGGATCAGGC TATCGAGGAG ATCGCGCAGA AGCTCACGCA GGACTTCTTC AAGTACGGCG GCGAGTCGCT GGTGACCTCT ACGGGCGGCG GCGGGAACCC GCACTTCAGC TCGCCGTGTC GCTTCACGCA GGCGTTGGGA TCGCCGAACA TCTTCGAGCC GGGCTGCGCG CAGTGCTTCC TGCCGCGCAT GGCCACGTTC TCGCTCATGT ACGGCGGCAG CAAGTCGGGC ACCACGTCGA TGGCCGATTC CAAGTACGGC GGCTGTTCGT CGCTGTACTT CCCGGAGGAC AATCCTATCC AGACGCTGGT CATGTGGGGC ACCTGCACCA GCTACCACGC GCCCAGCGGT TCGGGGCGCG CCATTGCCGA GCTGCGCGCA CGCGAGCAGG GGCTCAACAT GGTAGTGGTG GACCCTCGCT TCACGCCCGA TGCGGCTATG GCCGACGTGT GGCTGCCCAT TCGTCCCGGA ACCGACGTGG CGCTTATGAT GACGTGGATC CGCTACATCA TCGAGAACAA GCTGTACGAC GAAGATTTCT GCAAGCGTTG GACGAACCTG CCGTACCTCA TCGACACCGA TACGAAGAAG ATGCTGCGCG CCTACGAAGT GGGCCTGGGC GAAGCCGATG CGGAAGATGC GGAAAAGACG TTCGTGGTGT GGGATCAGAA GACGAACTCC GCGAAGGCGC TGCCATGGCC GTACGACGAA GCGCTGGACC CCGCGTTCTT CGGCACGTAC GAGATCGACG GCGTCGAATA CCCCACGGCG TTCACGCTGC TGCAGGAGCG CGTGGACGAA TGGACGCTCG AAAAGGGCTG CGAGGTATGC TGCCTTGAGA AGGACCAGGT GGAGAAGGCC ATCCGCATCT ACGCGGAGAA CGCGCCGTCC GGCCTGGTTC TGGGCGTGGC CACCGACATG AGCCCTCAGT CGGCCCAGGG TACGGAAGGC GCCTGCATCC TGGAGTTCTT GCTGGGCAAC GTGGAGAAGC CGGGCGCGCT GCTGCAGCGT TTCCCCGACC CGCCGAACAA GGTGGATCTC GGCACTATGA ACACGCTGGT CACAGGCGAT ATGCTGGAGA AGCGCCTAGG GTATCGCGAG CACAAGGGCC TGGGCATCTG GTCGCATGCG CACATCCCCA CGGTGTTCAA GGCCATCACC ACAGGCGAGC CGTACCAGCC GCGCAACTGG ATGGAACGCT CTGGCAACAA GCATGCGATG ATCGGCAACG CGGGGCAGCT GTCCGAGATC ATCGACAAGA TGGAGATGAT CTGCCATCTG TACATGTACC CCACGGCCTT CACCATCGAA GCGGCCGATT ACGTTCTGCC CACGCAGGAG TGGCTGGAGA GCTATTTCAC CATCGCCCAT GCGAACAAGA TCATCATCCG CCAGCCGGTG GTGCACTTGT ACGAAACGGT GAACGAGGGC GTCATTTGGT CGGAGATCGC GCATCGTTGC GCCGAGCTGG GCAATCCGTT CGCTCAGAAG GCCTTCGACA AGGAATACCT GGCAACGCTG GGCACCGATC TGGTGTACTG GCGCGGCCAG CAGGAAATGA TGGACTTCCA TATGGGCTCG CTGCCTATGA GCTGGGACGA ACTGGCCGAG ATGGGCGCGT ACGAGTGGAT CTCCAAGGAA GACTACCTGA CGTATTACAC CTACAAGACC ATCGATCCGA AGACGGGCAA GGAGAAGGGC TGGAACACGC CGTCCAAGAA GGTCGAGCCG TATTCGGAAG GCACGCTCAT GCTAGGCCGC ACGGGCGAGC CGTGGGCGAG CGCCGAGGGC AAATCCTACG TCATGCCGCC GGCCGACGAA GATTACGATC CTTTGATCTA CTATCTGGAG CCCGAGGAGA CGAACCTCAC CGATACGGAA TATCCCATCA TGCTGACCCA GGGACGCATT CCGCACTACC ATCACGGCAC GTTGCGCAAC ATTCCGTATC TGCGCGAGCT GTATCCGGTG CCGCTGGTCA GCATCCATCC CGAGACCGCC GAGAAGTACG GCGTGGAGGA CGAGCAGTGG GTGTGGGTGG AAAGCCGTCG CGGCAAGGTG CGCGGGAAGG CGCATGTGAC GGCGGGCATC GCAAAAGACG CCGTGCACAT GGAACGCTTC TGGAATCCCG AGTACCTGGA CACCGACACG CCGAGCAAGG CCTGGACTGA GATGAACGTG AACATCCTCA CCAAGACCGA CGGGCGTTAC AGCCCCGAGC ACGGCACGTA CACGCTGCGC GGCTTCACGG TGAAGGTGTA CCCGGCTCCC GAAGGCGCGC CTGAAGGCGC ATGGATCAAC CCGACCGATT TCGAACCGTG GATGCCTGAA TTCAGCGAAT CCACTGAGGT GGTGTTCAAA TAA
|
Protein sequence | MSQLNMSRRG FVKAAAATGA LAAFGATAVD GATFREAHAD ETTSQTKKVY TSCQACICSC AVIATVRDGR VIRLEGNPES PISRGGLCAK GLSGIQALYN PCRNKYPMKR VGERGTNSFE RISWDQAIEE IAQKLTQDFF KYGGESLVTS TGGGGNPHFS SPCRFTQALG SPNIFEPGCA QCFLPRMATF SLMYGGSKSG TTSMADSKYG GCSSLYFPED NPIQTLVMWG TCTSYHAPSG SGRAIAELRA REQGLNMVVV DPRFTPDAAM ADVWLPIRPG TDVALMMTWI RYIIENKLYD EDFCKRWTNL PYLIDTDTKK MLRAYEVGLG EADAEDAEKT FVVWDQKTNS AKALPWPYDE ALDPAFFGTY EIDGVEYPTA FTLLQERVDE WTLEKGCEVC CLEKDQVEKA IRIYAENAPS GLVLGVATDM SPQSAQGTEG ACILEFLLGN VEKPGALLQR FPDPPNKVDL GTMNTLVTGD MLEKRLGYRE HKGLGIWSHA HIPTVFKAIT TGEPYQPRNW MERSGNKHAM IGNAGQLSEI IDKMEMICHL YMYPTAFTIE AADYVLPTQE WLESYFTIAH ANKIIIRQPV VHLYETVNEG VIWSEIAHRC AELGNPFAQK AFDKEYLATL GTDLVYWRGQ QEMMDFHMGS LPMSWDELAE MGAYEWISKE DYLTYYTYKT IDPKTGKEKG WNTPSKKVEP YSEGTLMLGR TGEPWASAEG KSYVMPPADE DYDPLIYYLE PEETNLTDTE YPIMLTQGRI PHYHHGTLRN IPYLRELYPV PLVSIHPETA EKYGVEDEQW VWVESRRGKV RGKAHVTAGI AKDAVHMERF WNPEYLDTDT PSKAWTEMNV NILTKTDGRY SPEHGTYTLR GFTVKVYPAP EGAPEGAWIN PTDFEPWMPE FSESTEVVFK
|
| |