Gene Avin_03810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_03810 
SymbolfdhA 
ID7759341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp361140 
End bp364196 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content68% 
IMG OID643803305 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002797616 
Protein GI226942543 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTG GCAGAAGACA GTTCTTCAAG CTCTGTACCG CCGGGGTGGC CGGCGCCACC 
GCCGCCACCC TGGGCTTCGC CCCCGGCGTG GCCAACGCCA CCCAACCGCG CCAGTACAAG
CTGCTGCGCG CCAAGGAGAC CCGCAACAAC TGCACCTACT GTTCGGTGGG CTGCGGGGTG
CTGATGTACA GCCTGGGCGA CGGCGCGAAG AACGCCAAGC CGCGCATCTT CCACATCGAG
GGCGACCCGG ACCATCCGGT CAGCCGCGGT TCGCTGTGCC CGAAGGGCGC CGGCCTGGTC
GACTTCATCC ACAGCGAGCA GCGTCTGCAG TATCCCGAGT ACCGCGCGCC GGGCTCGGAC
AAGTGGCAGC GGATAAGCTG GGAAGAGGCC ATCGAGCGCA TCGCCCGGCT GATGAAGGAC
GACCGCGACG CCAACTTCGT CGAGAAGAAC GCCGCCGGCG TCACCGTGAA CCGCTGGCTG
ACCACCGGCA TGCTCTGCTC CTCGGCGGCC AGCAACGAGA CCGGCGCCCT CGACTGGCGC
TTCACCCGCG CCCTCGGCAT TCTCGGCATG GACTGCCAGG CGCGGCTCTG CCACGGTCCG
ACGGTGTCCG CCCTGGCGCC CAGCTTCGGC CGCGGCGCGA TGACCAACAA CTGGGTGGAC
ATCAAGAACG CCAACGTCGT GCTGGTGATG GGCGGCAATC CCGCCGAGGC CCACCCGGTG
GGCTTCAAGT GGGCCATCGA GGCGAAGATC CGCAACGGCG CCAAGCTGAT CGTGGTCGAC
CCGCGCTTCA ACCGCAGCGC CGCGGTCGCC GACCTCTACA CGCCGATCCG CGCCGGCTCC
GACGTGACCT TCCTGATGGG CGTGGTCAAC TACCTGATCG CCAACGACAG GATCCAGCAC
GAGTACGTGC GCGCCTACAC CAACGCGAGC CTGCTGGTGC GCGACGATTT CGGCTTCGAC
GAGGGCCTGT TCAGCGGCTA CGACGAGGTG AAGAAGCAGT ACGACCGCAG TTCCTGGGCC
TATCAGCTCG ACGAGAACGG CCACGCCCGG CGCGACCCGA CCCTCAGCCA TCCGCGCTGC
GTGTGGAACC TCCTCAAGCA GCACGTCAGC CGCTACACCC CGGAAATGGT CACGCGCCTG
TGCGGCACGC CGAAGGAGGA TTTCCTGGAG ATCTGCCGGC TGCTCGGCGA CACCAGCGCG
CCGGACAAGA CCGCGACCTT CCTCTACGCT CTCGGCTGGA CCCACCACAC CAGCGGCGTG
CAGATCATCC GCGGCGCGGC GATGATCCAG CTGTTGCTCG GCAACATCGG CATGCCCGGC
GGCGGCATCA ACGCCCTGCG CGGCCACTCG AACATCCAGG GTTACACCGA CCTCGGCCTG
CTCTCGGTGC GCATGCCCGG TTACCTGAAC CTGCCCTCGG AGCGCCAGCC CGACCTGGCC
ACCTACCTCG CGCAGAGCAC GCCCAAGGCG CTGCTGCCCG GCCAGGTGAA CTACTGGCAG
AACACCGAGA AGTTCTTCGT CAGCCTGATG AAGAGCGTCT GGGGCGACAA GGCGACCAGG
GACAACGACT GGGGCTTCGA CTGGCTGCCC AAGTGGGACG TCGAGTACGA CGTGCTGCGC
TATATGGAGC TGATGTACGA GGGCAAGGTC AACGGCTACC TCGCCCAGGG CTTCAACCCG
ATCGCCGCCT TCCCGGACAA GAACAAGGCG GTCGCCGCCC TGTCCAGGCT CAAGTACCTG
GTGGTGATCG ACCCGCTGGT CACCGAAAGC TCGAACTTCT GGCAGAACCA CGGCGAAGCC
AACGACGTGA ATCCCGCCGA GATCCAGACC GAGGTGTTCC GCCTGCCGTC GTCCTGCTTC
GCCGAGGAGA ACGGCTCGAT CGCCAACTCC GGGCGCTGGC TGCAGTGGCA CTGGGCCGGG
GCGGCGCCGC CGGCCGAGGC CTGGCACGAC GGCAAGATCC TCGGCCACCT GTTCCTCCGG
CTGCGCGAGC TGTACGAGCG GGAGGGCGGC GCCAATCCCG CGCCGCTGCT CAACATGGCC
TGGAACTACA GCGATCCGGA GGACCCGGCG CCCGAGGAGG TCGCCCGGGA GGCCAACGGC
TACGCCCTGG CCGACCTCTA CGATCCGCAG GGCAACCTGC TGGCGAAAAA GGGCGAGCTG
CTGCGCGACT TCTCCCTGCT GCGCGCCGAC GGCAGCACCG CCAGCTTCTG CTGGGTCTTC
GCCGGCTCCT GGACCGAGGC CGGCAACCAG ATGGCGCGGC GCGACAACGC CAGCGACGGC
CAGGGCCTCG GCAGCACGCC GGGCTGGGCC TGGTCCTGGC CGCAGAACCG CCGCATCCTC
TACAACCGCG CCTCGGTCGA CCCGCAGGGC AAGCCCTGGG ACCCGAAACG CAAGCTGATC
GAGTGGAACG GCCAGCGCTG GAGCGGCATC GACGTCGCCG ACTTCGCCGC CACCGTCGCG
CCGGGCAGCG AGGCCAATCC GTTCATCATG CTGCCCGAGG GCCTCGGCCG GCTGTTCGCG
GTCGGCGCTC TGGGCGACGG GCCCTTCCCC GAACACTACG AGCCGATCGA GTCGCCGCTG
GCGGAAAACC CGCTGCACCC GAAGGTCACC TTCAACCCGA CCACCCGGCT GTTCGACAAC
GACCGCCAGC GCATGGGCAC CGCGGCGGAC TTCCCCTACG TCGCCACCAC CTACTCGATC
ACCGAGATGT TCCGTCACTG GACCAAGGCC TCGCGGCTCA ACGCCATCGT CCAGCCCGAG
CAGTTCGTCG AGATCGGCGA ACAGCTTGCG GCGCAGAAGG GCATCCGCGC CGGCGACACC
GTCAAGGTCA GCTCGATGCG CGGCTTCATC AAGGCCAAGG CGGTGGTGAC CAAGCGCATC
GCCACCCTGG AGATCGACGG CCAGCCGGTC GACACCGTCG GCATCCCCTG TCACTGGGGC
TTTACGGGCA CCACCCGCAA GGGCTTTCTG GCCAACACCC TGACGCCCGC GGTTGGCGAC
GCCAACGCGC AGACGCCGGA GTACAAGGCC TTCCTCGTGA ACATCGAAAA GGTCTGA
 
Protein sequence
MELGRRQFFK LCTAGVAGAT AATLGFAPGV ANATQPRQYK LLRAKETRNN CTYCSVGCGV 
LMYSLGDGAK NAKPRIFHIE GDPDHPVSRG SLCPKGAGLV DFIHSEQRLQ YPEYRAPGSD
KWQRISWEEA IERIARLMKD DRDANFVEKN AAGVTVNRWL TTGMLCSSAA SNETGALDWR
FTRALGILGM DCQARLCHGP TVSALAPSFG RGAMTNNWVD IKNANVVLVM GGNPAEAHPV
GFKWAIEAKI RNGAKLIVVD PRFNRSAAVA DLYTPIRAGS DVTFLMGVVN YLIANDRIQH
EYVRAYTNAS LLVRDDFGFD EGLFSGYDEV KKQYDRSSWA YQLDENGHAR RDPTLSHPRC
VWNLLKQHVS RYTPEMVTRL CGTPKEDFLE ICRLLGDTSA PDKTATFLYA LGWTHHTSGV
QIIRGAAMIQ LLLGNIGMPG GGINALRGHS NIQGYTDLGL LSVRMPGYLN LPSERQPDLA
TYLAQSTPKA LLPGQVNYWQ NTEKFFVSLM KSVWGDKATR DNDWGFDWLP KWDVEYDVLR
YMELMYEGKV NGYLAQGFNP IAAFPDKNKA VAALSRLKYL VVIDPLVTES SNFWQNHGEA
NDVNPAEIQT EVFRLPSSCF AEENGSIANS GRWLQWHWAG AAPPAEAWHD GKILGHLFLR
LRELYEREGG ANPAPLLNMA WNYSDPEDPA PEEVAREANG YALADLYDPQ GNLLAKKGEL
LRDFSLLRAD GSTASFCWVF AGSWTEAGNQ MARRDNASDG QGLGSTPGWA WSWPQNRRIL
YNRASVDPQG KPWDPKRKLI EWNGQRWSGI DVADFAATVA PGSEANPFIM LPEGLGRLFA
VGALGDGPFP EHYEPIESPL AENPLHPKVT FNPTTRLFDN DRQRMGTAAD FPYVATTYSI
TEMFRHWTKA SRLNAIVQPE QFVEIGEQLA AQKGIRAGDT VKVSSMRGFI KAKAVVTKRI
ATLEIDGQPV DTVGIPCHWG FTGTTRKGFL ANTLTPAVGD ANAQTPEYKA FLVNIEKV