Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2191 |
Symbol | |
ID | 5695037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2662702 |
End bp | 2665665 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641264795 |
Product | formate C-acetyltransferase |
Protein accession | YP_001530072 |
Protein GI | 158522202 |
COG category | [C] Energy production and conversion |
COG ID | [COG1882] Pyruvate-formate lyase |
TIGRFAM ID | [TIGR01774] pyruvate formate-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA ACCGTAAAGG CATGACCAAT ATGCTGGCCA ACCTGTTTTT CCGGTTTTTT GCCGCCAACT TCAACCTTCG GCCCGCGTTG AACCGTTACC TTTCGTGCGA TGACGGGCCC ATCAATTTTA CCTTCGGCAT TCGCACCGAA AGCGGCAGCG TGGAACAGGC TGTTCAGTTT GAAAACGGCC GAGTCCGTGT ACTGAAAAAA ATGCCGGAAA AACCCGACGC TCAACTGGTG TTTGTGGACG AAGCCGCGGT AAAGGAGGCG GCCACCCAGC CCCCCAACAA GCTGATGCTG GCCCTGATGG AAAACCGCAT GGTCACCCGG GGCAACCTGG GCTACCTCCA GCTGATGAAC TTTTACCTGT CGCTGCTGTT AAAAAAGGTA CAGGTGGGAA AGCTGCAAAA AGAGGCCAAA CGGGAAAAGC GGGATTACGA TCCAGCCGAA GCGTCTCAGA AAAGCATTAA CAAAAAAAAC AAAGACCGGC TGTCGGCCAC GGCCACGGAC CCCGGGGTCC GGTTTCTTGC CGACCCCTAC CTGGCGGATT ACAGCCTGGA CGACTTTCCC CGGCTCAAAA CCTTTCTGGA TATTCATTTA AAGACCCGAC CCGCCCTCTG CCACGAGCGG CCCGAAATCC TGACCCGGTG GTACAGGGAA AATGGTTTTG AAACCGACAA AAACGGCGAG CCCTGGGTGG CCGAGATTCG CCAGGCCCAT GCCTTAAAAT ACCTGATGGA AAACCGCAAA CCCCTCATTC GGGAAAACGA CCTGGTGGCT GGCACCACCA CCGCCCAAGA GATCGGCGTG GTGCTCTACC CGGACGCCCA CGGCACCATG ATCTGGGGTG AACTGCTTAC CGCCCCCCAC AGGCCCTTAA ACCCCTACGA CGTTTCCCCG AACACCATTG ACGTGCTGCA CAACTCCGTG TTTCCCTTCT GGCTGAACCG CAATTTCCGC GAATGGGTGC GGGACAACAA AGGCTATCCT GACTGTCAGA AACTCGACGA GCGGTTTGCC GCCTGCTTTT TGTGGAAAAC CGTGGCCCTT TCCCACACCA TTCTTGATTA CCCGACCCTG CTCTCTCTGG GCACAAAAGG CATCATGGCC CAAATCGACG CCGAGATTGG CCGCACCAGC GAGGCGGACG CGGAAAAGCA CACCACCCTG CGGGCCATGA AGCTGACCCT GGAGGGCATC AACGCCTACG CGAAAAACCT GGCCGCTGAA GCCCACCGGC TGGCAGGCAT GGAAAAAGAC CCGGCCCGCA AAAAAGAGCT GGTCCGGCTG GCCGACATCT GCCGAAAGGT GCCGGAAAAC CCGGCCGCCA CCCTGGATGA AGCGATCAAC GCCATCTGGA TCGTCTGGGT GGGGGTTCAC ATGGAAAACA CCAACGCCGG TTTTTCCCTG GGCCGCATGG ACCAGTGGCT GCAACCCTAT TTTGAGGCGG ACATGGCAGC CCTGAAAAAC GAAAACGAAC AGAAGGACTA TATCCGCCGC GCCATCGAGC TTGTCGGGTG CTTCTACATG CGATGCACCG ACCACCTGCC CCTGATTCCG GACATCGGCA ACTACCTGTT CGGCGGCAGC TCATCGGACC AGGCCATCAC CCTGGGCGGG GTGACCCCGG ACGGCGAGGA CGCGGTCAAC GACATGACCT ACATCTTTTT AAAGGTCACG GAGATGCTCT CCATTCGGGA CCCCAACGTC AATGCCCGGT ACAACCGCGA AAAAAACAGC GACGCTTATC TTGCGCGGCT CTGCGAGGTG AACCTGAACA CCGCGGCCAC CCCCTCCATT CACAATGACG AGGCGGTCAT GGCCTCGCTG GCCGAATTCA ACTACCCGGC CGAGCACCTG CGGGACTGGG CCGCCACCGG GTGCGTGGAG CCCACCCTTT CCGGCCGTCA CATCGGCCAC ACCAACTGCA TGATGTTCAA CATGGTGGCG GCCCTTGAAA TGGCCTTGAA CAACGGGCGC CACCCCCTGA TGCGCTGGGA CCTGGGCCCC AAAACAGGGG ACGTGACCAC CGGCCATTTC AAGGATTTTG AATCGTTTTT CGAGGCCTTT GCAACCCAGC TTGGATTTCT GGCCGACCAG ACCTGCGAGT ACAACAACCT GCTGGGCCAG GCCCACCAGA CAATCCGGCC CACGCCATAC ATCTCGGCCC TGATCCAGGG CCCGAAAGAA AAAGGCAGGG ATGTCACCAA AGGCGGGGCC CTTTACAACT CGTCAGGCGT GGCCTGCATC GGTCTTGCCG ACATCACCGA CTCCATGATG GTGATCAAGA AACTGGTGTT CGACGGCAAG CAGGTCTCCT TTGGCGATCT TCACCGGGCC CTGGCCGCCA ATTTTGAAAA CGAACCGGCC CTGCTGGCGA TCATCAAAAA CAAAATTCCC CTGTTCGGGT CCGGCGACAA AGAGGCCCTG GCCATGGCCA ACCGCATCAT CCGGCTGGCC CACGACATTT TCGGGGCCCA CACCAACTAC CGGGGCGGCC CCTATACCGC GGGGTTCTGG TCCATGTCCA ACCACGTGGC CTTCGGCACC CTGACCGGAG CACTGCCGTC GGGACGACTG GCCGGCAAGG CCTTTACACC GGGCCTGACC CCTGAACCCC ATTCTTCCCC CAGTATTTTA GACAACCTGC GCGACGTGGC CGGCCTGGAC CCCACCGCCA TGAACAACAA CATCGCCTTT AACGTCAAGG TGGTGCCGGC GCCGGGCGAG TCCCATGCCC AGACCGTCAA CACCCTCTGC TCCTATACCA AGGCCTATAC CGGCCTGGGC GGCATGCAGA TGCAACTCAA CGTGGTGTCG TCGGACACGC TGCGCGACGC CATGGAACAC CCGGAAAACT ACCAGAACCT GCTGGTGCGC ATATCGGGAT ACAACGCCTA TTTTGTCTCG CTGAACCGGG AGATGCAGAT AGAGCTGATT GAACGGGCTG AATTCGGCAT GTGA
|
Protein sequence | MNANRKGMTN MLANLFFRFF AANFNLRPAL NRYLSCDDGP INFTFGIRTE SGSVEQAVQF ENGRVRVLKK MPEKPDAQLV FVDEAAVKEA ATQPPNKLML ALMENRMVTR GNLGYLQLMN FYLSLLLKKV QVGKLQKEAK REKRDYDPAE ASQKSINKKN KDRLSATATD PGVRFLADPY LADYSLDDFP RLKTFLDIHL KTRPALCHER PEILTRWYRE NGFETDKNGE PWVAEIRQAH ALKYLMENRK PLIRENDLVA GTTTAQEIGV VLYPDAHGTM IWGELLTAPH RPLNPYDVSP NTIDVLHNSV FPFWLNRNFR EWVRDNKGYP DCQKLDERFA ACFLWKTVAL SHTILDYPTL LSLGTKGIMA QIDAEIGRTS EADAEKHTTL RAMKLTLEGI NAYAKNLAAE AHRLAGMEKD PARKKELVRL ADICRKVPEN PAATLDEAIN AIWIVWVGVH MENTNAGFSL GRMDQWLQPY FEADMAALKN ENEQKDYIRR AIELVGCFYM RCTDHLPLIP DIGNYLFGGS SSDQAITLGG VTPDGEDAVN DMTYIFLKVT EMLSIRDPNV NARYNREKNS DAYLARLCEV NLNTAATPSI HNDEAVMASL AEFNYPAEHL RDWAATGCVE PTLSGRHIGH TNCMMFNMVA ALEMALNNGR HPLMRWDLGP KTGDVTTGHF KDFESFFEAF ATQLGFLADQ TCEYNNLLGQ AHQTIRPTPY ISALIQGPKE KGRDVTKGGA LYNSSGVACI GLADITDSMM VIKKLVFDGK QVSFGDLHRA LAANFENEPA LLAIIKNKIP LFGSGDKEAL AMANRIIRLA HDIFGAHTNY RGGPYTAGFW SMSNHVAFGT LTGALPSGRL AGKAFTPGLT PEPHSSPSIL DNLRDVAGLD PTAMNNNIAF NVKVVPAPGE SHAQTVNTLC SYTKAYTGLG GMQMQLNVVS SDTLRDAMEH PENYQNLLVR ISGYNAYFVS LNREMQIELI ERAEFGM
|
| |