Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49950 |
Symbol | |
ID | 7763847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5061798 |
End bp | 5063423 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807827 |
Product | Aldehyde dehydrogenase |
Protein accession | YP_002802061 |
Protein GI | 226946988 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.489954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG AGAGCCGTCT GCACACCCTT TTCCCCGCCG CCGCCGATAT CCCCGAGCAA TACCGCCCGG GCGCGCCCCT CGAACAGCGC GACTACCTGG TCGACGGTGA ACTGCGCCGC TGGGACGGCC CGCTGGCCGC CGTGCGCAGT CCCATCCACC TGAAGACCGC CAAGGGCGAC GAGCAGGTCG TCCTCGGCAG CACCCCGCTG CTCGACGCCG GCGCCGCGCT AAGCGCGCTG GACGCCGCGG TCAGGGCCTA CGACAACGGC CAGGGCCTGT GGCCGAGCCT GCCGGTGGCC GGGCGCATCC AGCACGTCGA GACCTTCCTG GCGCGTATGC GCGAGCAGCG CGAGGCGGTG GTCAAACTGC TGATGTGGGA GATCGGCAAG AACCTCAAGG ACGCCGAGAA GGAATTCGAC CGCACCTGCG ACTACATCGT CGACACCATC CACGAACTCA AGGAACTCGA CCGCCGCTCC AGCCGCTTCG AGCTGGAGCA GGGCACCCTC GGCCAGATCC GCCGCGTGCC GCTGGGCGTG GCGCTGTGCA TGGGCCCCTA CAACTACCCG CTGAACGAGA CCTTCACCAC CCTGATCCCG GCGCTGATCA TGGGCAACAC CGTGGTGTTC AAGCCGGCCA AGTTCGGCGT GCTGCTGATC CGCCCGCTGC TCGAGGCGTT CCGCGACAGC TTCCCGGCCG GGGTGATCAA CGTCATCTAC GGGCGCGGCC GCGAGACCGT CAGCGCGCTG ATGGAAAGCG GCAAGGTGGA CGTGTTCGCC TTCATCGGCA CCAACAAGGG CGCCAGCGAC CTGAAGAAGC TGCACCCACG CCCGCACCGC CTGCGCGCCG CGCTCGGCTT GGACGCCAAG AACCCCGGCA TCGTGCTGCC CGAGGTGGAC CTGGACAACG CGGTCGGCGA GGCGATCACC GGCGCGCTGT CGTTCAACGG CCAGCGCTGC ACGGCGCTGA AGATTCTCTT CGTCCACGAA CAGGTGGTCG ACGCCTTCCT CGAGAAATTC AACCAGAAGC TCGCCGCGCT CAAGCCGGGC ATGCCCTGGG AGCCGGGGGT GGCGCTGACC CCGTTGCCGG AGCCGGGCAA GACCGATTTT CTCGCCACCC TGGTGGCCGA CGCCCTGGCC AAGGGGGCGA AGGTGGTCAA CCCCGGCGGC GGCGAAGTGC GCGAGACCTT CTTCTACCCG GCGCTGCTCT ACCCGGTGAG CCCGCAGATG CGCGTCTACC AGGAGGAGCA GTTCGGCCCG CTGATCCCGG TGGTGCCCTA CCGCGACCTG CAGACGGTGA TCGACTACGT GCGCGAGTCG GACTTCGGCC AGCAGTTGTC GATCTTCGGC AACGATCCGA AGCAGGTCGG CCGGCTGGTG GACGCTTTCG CCAATCAGGT CGGACGGATC AACATCAACG CCCAGTGCCA GCGCGGCCCG GATAGCTTTC CGTTCAACGG CCGCAAGAAC TCGGCGGAAG GGACCCTGTC GGTGTACGAC GCGCTGCGCG TGTTCTCGAT CCGCACCCTG GTGGCGACCA AGTTCCAGGA GGATAACAAG CGGCTGATCA GCGAGATCCT GCGCCACCGC GCGTCGAGCT TCCTGAGTAC CGACTACATC TTCTGA
|
Protein sequence | MSTESRLHTL FPAAADIPEQ YRPGAPLEQR DYLVDGELRR WDGPLAAVRS PIHLKTAKGD EQVVLGSTPL LDAGAALSAL DAAVRAYDNG QGLWPSLPVA GRIQHVETFL ARMREQREAV VKLLMWEIGK NLKDAEKEFD RTCDYIVDTI HELKELDRRS SRFELEQGTL GQIRRVPLGV ALCMGPYNYP LNETFTTLIP ALIMGNTVVF KPAKFGVLLI RPLLEAFRDS FPAGVINVIY GRGRETVSAL MESGKVDVFA FIGTNKGASD LKKLHPRPHR LRAALGLDAK NPGIVLPEVD LDNAVGEAIT GALSFNGQRC TALKILFVHE QVVDAFLEKF NQKLAALKPG MPWEPGVALT PLPEPGKTDF LATLVADALA KGAKVVNPGG GEVRETFFYP ALLYPVSPQM RVYQEEQFGP LIPVVPYRDL QTVIDYVRES DFGQQLSIFG NDPKQVGRLV DAFANQVGRI NINAQCQRGP DSFPFNGRKN SAEGTLSVYD ALRVFSIRTL VATKFQEDNK RLISEILRHR ASSFLSTDYI F
|
| |