Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2249 |
Symbol | |
ID | 8416573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2643124 |
End bp | 2645835 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025236 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003182599 |
Protein GI | 257791993 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.580662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAGCA TCACGATAGA CGGGATCGCC CTCGACGTTG CCGAAGGCTC CACCATCCTC GATGCCGCGC GCGCTGCCGG CATCCGCATC CCCACGCTGT GCTTCCTGAA AGAGCGCAGC GCCATCGCCT CGTGCCGCGT GTGCGTCGTG GACGTGGAAG GGCTCGACCA GCCGGTGCCC TCGTGCGCCA CGCCCGTGCA AGACGGTATG AAGGTGACCA CCTCCTCGCC GCGCATCGAG GCGTACCGCC GCATCGCGCT CGAGCTCATC ATCGCCGATC ACGGCCTCGA TTCCACGAAC TACTGCTTCT CGTGCGATAA GAACGGCGCC TGCGAGCTGC AGGCCGTCTG CCGTGAGTAC GGCGTGCTGG AGTCTCCTTT CGAGGCAGCG CAGAAGCGCG AGCCCGTGCG CGACGAGAAC CCGTTCCTCG CTTACGACCC GAACCTGTGC ATACGCTGCC AGCGTTGCGT GGGCGCCTGC AACGACGCCG CGCGCAACCA CACGCTGGGA ACCGCGAAGC GCGGCGTTCG CACGCTGATC GAGGCGCCGT TCGGCGCGGA CTGGCGCGCC ACCGACTGCG AGTCGTGCGG CAACTGCGCG CAGGCGTGCC CCACCGGAGC GCTCACCGAG AAGCGCCGCG CGACGTACCG CTCGTGGGAG ACCGAGCGCG TGCGCACCAC GTGCCCGCAC TGCGGCGTGG GCTGCCAGCT GGATTTGGTG GTGAAGGACG GCGTCGTCGT GGACGCCGAG GCCGCGCCGG GCCCGTCGAA CCACGGGCTT CTGTGCGTGA AGGGGCGCTC GGCAAGCTTC GATTTCGTCG ACGCGCCCGA CCGCATACGA ACGCCGCTCG TGAAGAACCG CGAGACGGGA GAGTTCGAGC CCGCCACCTG GGACGAAGCG CTCGACCTCG TGGCGAGGCG CTTCACGGAG CTCCGCGATG TCCACGGCGG CGAGTCGCTG GCGGCGTTCG CGTGCTCGCG CTCCACGAAC GAGGACATCT ACCTGTTCCA GAAGATGGCG CGCATCGTGC TCCAAACGAA CAACGTGGAC TGTTGCGCGC GTGTTTGACA CGCCCCCACG GTCGCCGGTC TGGCGACCAT GCTTGGTTCC GGCGCGATGA CGAACTCCAT CGAAGACGTC ACGCGCAAGG CGGAGGTCAT CATGCTCGTG GGGTCGAACC CCGAGGAGGC CCACCCCGTC ATAGGCATGC AGATCCGCGC GGCGGTCGAA CGCGGCTGCC GGCTCATCGC GGTCGACCCG CGCGACATCG GGCTGGCCGC GCATGCCGAC ATTCATTTGA AGCTGAGGCC GGGAACGAAC GTCGCGTTCG CGAACGGCAT TGTGAACTAC CTGATTCAGC ACGAGCTGTA CGACGAGGAC TTCGTGCGCG AGCGCACCGA GGGCTTCGAG ATATTGGCCG CGACGGTGCG CGACTACACG CCCGAGCTGG TGGAGGACAT CTGCGGCATC GACCGTCGCG ACCTCGTGGC CGCGGCGAAG ATGTACGCGG CAGCCGACGC GGCGGCCATC ATGTACTGCC TCGGCGTGAC GGAGCACTCC ACGGGAACCG ACGGCGTCAT GGCGCTGTCG AACATCGCGA TGATCTCCGG CAACCTGGGC AAGCCGGGCG GCGGCGTGAA CCCGCTGCGC GGCCAGAACA ACGTGCAGGG CGCCTGCGAC ATGGGCGCCG GTCCCGACGA CCTGCCCGGC TACCAGAAGG TGGCTGATCC CGAGGTGGTG CGCCGCTTCG AGAAGGCGTG GGGCGCCGCG CTTCCGCGTG CGCGCGGCAT CAAGGCCACC GAGTGCTTCC CGGCCATGAT CGAGGGCGGC ATCAAGGGGC TGTTCCTGTT CGGCGAGGAT CCCGTGCGCA CCGACCCCGA TACGCATCAC GTGATCCGCT CGCTCGAGGC GCTCGAGTTC TTCGTGGTGG ACGACCTGTT CATGACCGAA ACGGCCAAGT ACGCCGACGT CATCCTGCCC GGGCGCAGCT ACGCCGAGAA GGAGGGCACG TTCACGAACA CCGAGCGGCG CGTCCAGCGC GTGCGCAAGG CCGTGGACGG GCCTGTGGGC GCGTGGCTCG ACACCGACAT CTTCACCGAG GTCATGAACC GCATGGGCTA TGCGCAGCCT CGTCTGAGCG CGGCCGAGGT CATGGACGAG ATCGTCTCGG TCACGCCCAC GTACGGCGGC ATGAGCCATG CGCGCCTCGA CGGCGACGAG ACGGCCGGCC GCGGCTTGCA GTGGCCGTGC CCAAGCGCCG AGCATCCCGG CACGCCCATC CTGCACATGG GCGAGTTCGC CCAGGGGCTG GGCGCGTTCT CCACACCCGA CTACCAGCCG TCCGCCGAGC TGCCCGATGC GGAGTATCCG CTGGTCATGA TGACGGGTCG CATCCTGTAC CAGTACAACG CCTGTGCCAT GACCGCTCGC ACCGACGGCG TGAACGAGAT AGCGAACCGC TCGTTCATCG AGCTGAACAC GTGCGATGCC GAGGCTCTCG GCATCGCCGA CGGGGACATC GTGCGCGTCT CATCGCGTCG CGGCTCCATC GAGTCCGTCG CGCATGTGTC CGAGAAGACG TCGCCGGGAC ATACGTGGAT GCCGTTCCAT TTCCAAGACG GCAACAGCAA CTGGCTCACC ATCGCCGCCC TCGACCGCGT GTCGAAGGCC CCCGAGTACA AGGTGTGCGC CGTGAAAGTG GAAAAGGCGT AA
|
Protein sequence | MPSITIDGIA LDVAEGSTIL DAARAAGIRI PTLCFLKERS AIASCRVCVV DVEGLDQPVP SCATPVQDGM KVTTSSPRIE AYRRIALELI IADHGLDSTN YCFSCDKNGA CELQAVCREY GVLESPFEAA QKREPVRDEN PFLAYDPNLC IRCQRCVGAC NDAARNHTLG TAKRGVRTLI EAPFGADWRA TDCESCGNCA QACPTGALTE KRRATYRSWE TERVRTTCPH CGVGCQLDLV VKDGVVVDAE AAPGPSNHGL LCVKGRSASF DFVDAPDRIR TPLVKNRETG EFEPATWDEA LDLVARRFTE LRDVHGGESL AAFACSRSTN EDIYLFQKMA RIVLQTNNVD CCARVUHAPT VAGLATMLGS GAMTNSIEDV TRKAEVIMLV GSNPEEAHPV IGMQIRAAVE RGCRLIAVDP RDIGLAAHAD IHLKLRPGTN VAFANGIVNY LIQHELYDED FVRERTEGFE ILAATVRDYT PELVEDICGI DRRDLVAAAK MYAAADAAAI MYCLGVTEHS TGTDGVMALS NIAMISGNLG KPGGGVNPLR GQNNVQGACD MGAGPDDLPG YQKVADPEVV RRFEKAWGAA LPRARGIKAT ECFPAMIEGG IKGLFLFGED PVRTDPDTHH VIRSLEALEF FVVDDLFMTE TAKYADVILP GRSYAEKEGT FTNTERRVQR VRKAVDGPVG AWLDTDIFTE VMNRMGYAQP RLSAAEVMDE IVSVTPTYGG MSHARLDGDE TAGRGLQWPC PSAEHPGTPI LHMGEFAQGL GAFSTPDYQP SAELPDAEYP LVMMTGRILY QYNACAMTAR TDGVNEIANR SFIELNTCDA EALGIADGDI VRVSSRRGSI ESVAHVSEKT SPGHTWMPFH FQDGNSNWLT IAALDRVSKA PEYKVCAVKV EKA
|
| |