Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_03810 |
Symbol | fdhA |
ID | 7759341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 361140 |
End bp | 364196 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803305 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002797616 |
Protein GI | 226942543 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTTG GCAGAAGACA GTTCTTCAAG CTCTGTACCG CCGGGGTGGC CGGCGCCACC GCCGCCACCC TGGGCTTCGC CCCCGGCGTG GCCAACGCCA CCCAACCGCG CCAGTACAAG CTGCTGCGCG CCAAGGAGAC CCGCAACAAC TGCACCTACT GTTCGGTGGG CTGCGGGGTG CTGATGTACA GCCTGGGCGA CGGCGCGAAG AACGCCAAGC CGCGCATCTT CCACATCGAG GGCGACCCGG ACCATCCGGT CAGCCGCGGT TCGCTGTGCC CGAAGGGCGC CGGCCTGGTC GACTTCATCC ACAGCGAGCA GCGTCTGCAG TATCCCGAGT ACCGCGCGCC GGGCTCGGAC AAGTGGCAGC GGATAAGCTG GGAAGAGGCC ATCGAGCGCA TCGCCCGGCT GATGAAGGAC GACCGCGACG CCAACTTCGT CGAGAAGAAC GCCGCCGGCG TCACCGTGAA CCGCTGGCTG ACCACCGGCA TGCTCTGCTC CTCGGCGGCC AGCAACGAGA CCGGCGCCCT CGACTGGCGC TTCACCCGCG CCCTCGGCAT TCTCGGCATG GACTGCCAGG CGCGGCTCTG CCACGGTCCG ACGGTGTCCG CCCTGGCGCC CAGCTTCGGC CGCGGCGCGA TGACCAACAA CTGGGTGGAC ATCAAGAACG CCAACGTCGT GCTGGTGATG GGCGGCAATC CCGCCGAGGC CCACCCGGTG GGCTTCAAGT GGGCCATCGA GGCGAAGATC CGCAACGGCG CCAAGCTGAT CGTGGTCGAC CCGCGCTTCA ACCGCAGCGC CGCGGTCGCC GACCTCTACA CGCCGATCCG CGCCGGCTCC GACGTGACCT TCCTGATGGG CGTGGTCAAC TACCTGATCG CCAACGACAG GATCCAGCAC GAGTACGTGC GCGCCTACAC CAACGCGAGC CTGCTGGTGC GCGACGATTT CGGCTTCGAC GAGGGCCTGT TCAGCGGCTA CGACGAGGTG AAGAAGCAGT ACGACCGCAG TTCCTGGGCC TATCAGCTCG ACGAGAACGG CCACGCCCGG CGCGACCCGA CCCTCAGCCA TCCGCGCTGC GTGTGGAACC TCCTCAAGCA GCACGTCAGC CGCTACACCC CGGAAATGGT CACGCGCCTG TGCGGCACGC CGAAGGAGGA TTTCCTGGAG ATCTGCCGGC TGCTCGGCGA CACCAGCGCG CCGGACAAGA CCGCGACCTT CCTCTACGCT CTCGGCTGGA CCCACCACAC CAGCGGCGTG CAGATCATCC GCGGCGCGGC GATGATCCAG CTGTTGCTCG GCAACATCGG CATGCCCGGC GGCGGCATCA ACGCCCTGCG CGGCCACTCG AACATCCAGG GTTACACCGA CCTCGGCCTG CTCTCGGTGC GCATGCCCGG TTACCTGAAC CTGCCCTCGG AGCGCCAGCC CGACCTGGCC ACCTACCTCG CGCAGAGCAC GCCCAAGGCG CTGCTGCCCG GCCAGGTGAA CTACTGGCAG AACACCGAGA AGTTCTTCGT CAGCCTGATG AAGAGCGTCT GGGGCGACAA GGCGACCAGG GACAACGACT GGGGCTTCGA CTGGCTGCCC AAGTGGGACG TCGAGTACGA CGTGCTGCGC TATATGGAGC TGATGTACGA GGGCAAGGTC AACGGCTACC TCGCCCAGGG CTTCAACCCG ATCGCCGCCT TCCCGGACAA GAACAAGGCG GTCGCCGCCC TGTCCAGGCT CAAGTACCTG GTGGTGATCG ACCCGCTGGT CACCGAAAGC TCGAACTTCT GGCAGAACCA CGGCGAAGCC AACGACGTGA ATCCCGCCGA GATCCAGACC GAGGTGTTCC GCCTGCCGTC GTCCTGCTTC GCCGAGGAGA ACGGCTCGAT CGCCAACTCC GGGCGCTGGC TGCAGTGGCA CTGGGCCGGG GCGGCGCCGC CGGCCGAGGC CTGGCACGAC GGCAAGATCC TCGGCCACCT GTTCCTCCGG CTGCGCGAGC TGTACGAGCG GGAGGGCGGC GCCAATCCCG CGCCGCTGCT CAACATGGCC TGGAACTACA GCGATCCGGA GGACCCGGCG CCCGAGGAGG TCGCCCGGGA GGCCAACGGC TACGCCCTGG CCGACCTCTA CGATCCGCAG GGCAACCTGC TGGCGAAAAA GGGCGAGCTG CTGCGCGACT TCTCCCTGCT GCGCGCCGAC GGCAGCACCG CCAGCTTCTG CTGGGTCTTC GCCGGCTCCT GGACCGAGGC CGGCAACCAG ATGGCGCGGC GCGACAACGC CAGCGACGGC CAGGGCCTCG GCAGCACGCC GGGCTGGGCC TGGTCCTGGC CGCAGAACCG CCGCATCCTC TACAACCGCG CCTCGGTCGA CCCGCAGGGC AAGCCCTGGG ACCCGAAACG CAAGCTGATC GAGTGGAACG GCCAGCGCTG GAGCGGCATC GACGTCGCCG ACTTCGCCGC CACCGTCGCG CCGGGCAGCG AGGCCAATCC GTTCATCATG CTGCCCGAGG GCCTCGGCCG GCTGTTCGCG GTCGGCGCTC TGGGCGACGG GCCCTTCCCC GAACACTACG AGCCGATCGA GTCGCCGCTG GCGGAAAACC CGCTGCACCC GAAGGTCACC TTCAACCCGA CCACCCGGCT GTTCGACAAC GACCGCCAGC GCATGGGCAC CGCGGCGGAC TTCCCCTACG TCGCCACCAC CTACTCGATC ACCGAGATGT TCCGTCACTG GACCAAGGCC TCGCGGCTCA ACGCCATCGT CCAGCCCGAG CAGTTCGTCG AGATCGGCGA ACAGCTTGCG GCGCAGAAGG GCATCCGCGC CGGCGACACC GTCAAGGTCA GCTCGATGCG CGGCTTCATC AAGGCCAAGG CGGTGGTGAC CAAGCGCATC GCCACCCTGG AGATCGACGG CCAGCCGGTC GACACCGTCG GCATCCCCTG TCACTGGGGC TTTACGGGCA CCACCCGCAA GGGCTTTCTG GCCAACACCC TGACGCCCGC GGTTGGCGAC GCCAACGCGC AGACGCCGGA GTACAAGGCC TTCCTCGTGA ACATCGAAAA GGTCTGA
|
Protein sequence | MELGRRQFFK LCTAGVAGAT AATLGFAPGV ANATQPRQYK LLRAKETRNN CTYCSVGCGV LMYSLGDGAK NAKPRIFHIE GDPDHPVSRG SLCPKGAGLV DFIHSEQRLQ YPEYRAPGSD KWQRISWEEA IERIARLMKD DRDANFVEKN AAGVTVNRWL TTGMLCSSAA SNETGALDWR FTRALGILGM DCQARLCHGP TVSALAPSFG RGAMTNNWVD IKNANVVLVM GGNPAEAHPV GFKWAIEAKI RNGAKLIVVD PRFNRSAAVA DLYTPIRAGS DVTFLMGVVN YLIANDRIQH EYVRAYTNAS LLVRDDFGFD EGLFSGYDEV KKQYDRSSWA YQLDENGHAR RDPTLSHPRC VWNLLKQHVS RYTPEMVTRL CGTPKEDFLE ICRLLGDTSA PDKTATFLYA LGWTHHTSGV QIIRGAAMIQ LLLGNIGMPG GGINALRGHS NIQGYTDLGL LSVRMPGYLN LPSERQPDLA TYLAQSTPKA LLPGQVNYWQ NTEKFFVSLM KSVWGDKATR DNDWGFDWLP KWDVEYDVLR YMELMYEGKV NGYLAQGFNP IAAFPDKNKA VAALSRLKYL VVIDPLVTES SNFWQNHGEA NDVNPAEIQT EVFRLPSSCF AEENGSIANS GRWLQWHWAG AAPPAEAWHD GKILGHLFLR LRELYEREGG ANPAPLLNMA WNYSDPEDPA PEEVAREANG YALADLYDPQ GNLLAKKGEL LRDFSLLRAD GSTASFCWVF AGSWTEAGNQ MARRDNASDG QGLGSTPGWA WSWPQNRRIL YNRASVDPQG KPWDPKRKLI EWNGQRWSGI DVADFAATVA PGSEANPFIM LPEGLGRLFA VGALGDGPFP EHYEPIESPL AENPLHPKVT FNPTTRLFDN DRQRMGTAAD FPYVATTYSI TEMFRHWTKA SRLNAIVQPE QFVEIGEQLA AQKGIRAGDT VKVSSMRGFI KAKAVVTKRI ATLEIDGQPV DTVGIPCHWG FTGTTRKGFL ANTLTPAVGD ANAQTPEYKA FLVNIEKV
|
| |