Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2768 |
Symbol | |
ID | 8417094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3210159 |
End bp | 3212855 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025743 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003183104 |
Protein GI | 257792498 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.81742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.229258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC CTACCATCAC GCGACGCAGT TTCGTGAAGT CGGCGGCGGC GCTCGGCGCT GCATGCGGCC TGGGCGTCGC CGTGAGCGAC GACCTCGTGC ACGTCGACCC GGCACATGCC GATACGGGCG GCGACGTGAA GATCGTGAAG ACGGCCTGTC GCGCCTGCAT CGCAAGCTGC GCCGTGCTGG CGCATGTGAA GAACGGTCGC GTCATCAAGA TCGAGGGCAA TCCTGAAAGC CCCATGAGCC AAGGCGGCCT GTGCGCGAAG GGCATGGCCG GCATCCAGGC GCTGTACCAT CCCAACCGCA ACAAGTACCC CATGCGGCGC GTGGGCGAGC GCGGTCAGAA CCAGTGGGAG CGCATCACCT GGGACGAGGC CATCACCGAG ATCGCCGAGA AGCTCATCGA GATCGACGAG AAGTACGGCT CGGAATGCGT GGCCGTGTCC ACCGGCGGTG GCGGCAACCC GCACTTCAGC AACGTGAAGC GCTTCGGCGA GGCCGTCAAC ACGCCCAACG TGTGGGAGCC CGGCTGCGCG CAGTGCTACC TGCCGCGCAT GGGCGCCTCG CAGCTGTCGA ACGGCATGGG CAAGCCGAAC AACCTCTCGT TCGCCGACTC CAACGGCTGG GACTACTACT TCACCGACTC GCCGGTCGAC TCCCTCGTGC TGTGGGGCAC CGACCCGTCG AACAGCTCGG TTGCCACGGG CGGCCGCGCG TTGGCCGAGC TGCGCGCCCG CGAGCAGGGC TTGAAGACCG TGGTCATCGA CCCGCGCATG ACGATGGACG CCGCCAAAGC CGAAGTGTGG CTGCCCATCC GCCCCGGCAC CGATGCCGCG CTGCTCATGG CGTGGACGAA GTGGATCATC GACAACGAGA AGTACGACGA GGACTTCTGC ACGAACTGGA CCAACATGCC CTATATCGTG AACCCCGAGA CGCGCCTGAC CCTCAAGCCC ACCGAATGCG GCCTCGAGGG CACCGACAAG GACTACGTCA TCTGGGATCC GGCCGAGGGC AAGCCCGTCG TGTTGGAGTA CCCCATGAAC GAAGGCGTCA CCCCGCCGCT GTTCGGCACG TACGAGATCG ACGGCAAGCA GTATCCCACG GGCGGCCAGC TGCTCAAGGA GAGCGTGGAC GAGTTCACGC TGGCGAAAGC CGCCGAGATC TGCTGGCTCG ACGAGGGTCA GATCGAGAAG GCGCTGGAAA TCTACACGTC GGGCCAGTCC GGTATCATGC AGGGCGTGCC CATCGACCAG TACGAGCAGT CGCAGCAGTG CGCGCTCGGC GCCCTCAACC TCGAGTACCT CATGGGCAAC GTGCAGAAGC CCGGGGCCAT GCTGCAAAGC TTCAAGCCCT GCCCGGCCCG CGACCAGATT CCCAACACGC CGCGCTTCCT CAAGAAGGAG AAGATGCTCA AGCGGCTGGG CGTGCAGGAG CACAAGGGCC TGCTCGACTG GGACATGGCG TTCATCCCCG CGGTGTTCAA GGCCATGAAG GACGGCGATC CCTACCAGAT CCACGCGTGG TTCGAGCGCT CCGGCAACAA GCACGTGGTG CTGGGCAATG CAACGTGCCT CGACGAGATC GTGCCGAACC TCGACTTCGT CTGCCACATC TTCATGTACC CGACGGCGTT TTCCGTGCTG TGCGCCGACA TGCTGCTCCC CTCCGCGGAA TGGCTGGAAA CCAACCTGGT CATCCCGCAG CTCAACGCCA TCGTCATACG CCAGGCGGTC ACCCACCTGT TCGAGACGGT GGAGGAAGGC CTCATCTGGA CGCAGATCGT GCAGAAGATG GCCGAGATGG GCAACGTGCA TGCGCAGGAG GCCTTCACGG CCGAGGGCAC GAACGGCATC CCGATGTACA AGAACGAGTA CGAGAAGCAG AAGCTGCACC TCAACAACCT GAAGATGAGC TGGGAGGAAG CGGCCGAGAA GGGCGTGTTC GAGTGGTGCA CCGAAGAGGA GTACCGCACG TACTACGGCT ATAAGGCGGT TGACGAGGCC ACCGGCAAAG CCAAGGGCTT CGGGACGCCT TCCAAGAAGT GCGAGGTGTA CTGCGAGTCC AATACCATCC TGGGCCGCAC GGGCTTCCCC TGGTCGCAGT GCTTCGAGGA GAAGAACCTG GTGCTGGAGC CGGCGTCGAA GGACTACGAG CCGCTGTGCT ACTACAAGGA GCCGGCCGAA AGCCCGCTCA CGGACACCGA GTACCCGCTG GTGCTGACCG AGGGCCGTCT GCCCATGTAC CACCACGGCA CGCTACGCAA CATCCCGTAT CTGCGCGAGA TCTACCCCGT GCCCGAGATG TGGATCCATC CCGACGACGC GGCGACCTAC GGCGTGGAGG ACGGCCAGTG GTGCACCATC GCCAGCCGCC GCGGCGAGAC GCACGGCAAG GCGCGCGTTA CCACGGCCAT CGCCAAGGGC GTCATCTACC AGGAGCGCTT CTGGGCGCCC GAGCTGCTGG ACACCGACCC CGAGCGCGCT TACCGCGTGA TGAACATCAA CGTGCTGACC AAGAACGACG AGCCGTACAA CCCCGAGTAC GGCACCTATA CCCTGCGCGG CTTCCAGGTG AAGGTCACGC CTGACGCCGC TGCTCCGGAA GGCGTGTGGA CCGAGCCCGA GCAGTTCGAG CCCTGGATGC CGCAGCCGAG CGATCCTACC GAGGAGGTGT TCGATTATGG CGCATAA
|
Protein sequence | MTQPTITRRS FVKSAAALGA ACGLGVAVSD DLVHVDPAHA DTGGDVKIVK TACRACIASC AVLAHVKNGR VIKIEGNPES PMSQGGLCAK GMAGIQALYH PNRNKYPMRR VGERGQNQWE RITWDEAITE IAEKLIEIDE KYGSECVAVS TGGGGNPHFS NVKRFGEAVN TPNVWEPGCA QCYLPRMGAS QLSNGMGKPN NLSFADSNGW DYYFTDSPVD SLVLWGTDPS NSSVATGGRA LAELRAREQG LKTVVIDPRM TMDAAKAEVW LPIRPGTDAA LLMAWTKWII DNEKYDEDFC TNWTNMPYIV NPETRLTLKP TECGLEGTDK DYVIWDPAEG KPVVLEYPMN EGVTPPLFGT YEIDGKQYPT GGQLLKESVD EFTLAKAAEI CWLDEGQIEK ALEIYTSGQS GIMQGVPIDQ YEQSQQCALG ALNLEYLMGN VQKPGAMLQS FKPCPARDQI PNTPRFLKKE KMLKRLGVQE HKGLLDWDMA FIPAVFKAMK DGDPYQIHAW FERSGNKHVV LGNATCLDEI VPNLDFVCHI FMYPTAFSVL CADMLLPSAE WLETNLVIPQ LNAIVIRQAV THLFETVEEG LIWTQIVQKM AEMGNVHAQE AFTAEGTNGI PMYKNEYEKQ KLHLNNLKMS WEEAAEKGVF EWCTEEEYRT YYGYKAVDEA TGKAKGFGTP SKKCEVYCES NTILGRTGFP WSQCFEEKNL VLEPASKDYE PLCYYKEPAE SPLTDTEYPL VLTEGRLPMY HHGTLRNIPY LREIYPVPEM WIHPDDAATY GVEDGQWCTI ASRRGETHGK ARVTTAIAKG VIYQERFWAP ELLDTDPERA YRVMNINVLT KNDEPYNPEY GTYTLRGFQV KVTPDAAAPE GVWTEPEQFE PWMPQPSDPT EEVFDYGA
|
| |