Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15120 |
Symbol | mhpF |
ID | 7760447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1488179 |
End bp | 1489114 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804409 |
Product | acetaldehyde dehydrogenase |
Protein accession | YP_002798702 |
Protein GI | 226943629 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4569] Acetaldehyde dehydrogenase (acetylating) |
TIGRFAM ID | [TIGR03215] acetaldehyde dehydrogenase (acetylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.179269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC TCAAAGTCGC CATCGTCGGC TCCGGCAACA TCGGCACCGA CCTGATGATC AAGATCCTGC GCCACGGCCA GCACCTGGAA ATGGGCGCCC TGGTCGGCAT CGACCCGGAT TCCGACGGCC TGGCCCGCGC CGCGCGCCTC GGCGTGGCCA CCACCGCCGA GGGCGTCGAG GGCCTGGCCC GCCTGCCGGG GTTCGGCGAG ATCGATTTCG TCTTCGACGC CACCTCGGCC GCCGCCCACG TGAAGAACGA CCCGTTCCTG CGCGGCCTCA GGCCCGGCCT GCGGCTGATC GACCTGACCC CGGCGGCCGT CGGCCCCTAC TGCGTGCCGG TGGTGAATCT CGAGCAGAAC CTGCGCGAAC CCAACGTCAA CATGGTCACC TGCGGCGGCC AGGCGACCAT CCCCATGGTC GCCGCGGTGT CGCGGGTGGC CAGGGTCCAC TACGCCGAGA TCGTCGCCTC GATCGCCAGC CGGTCGGCCG GCCCCGGCAC CCGCGCCAAC ATCGACGAAT TCACCGAGAC CACCTCGAAA GCCATCGAGG CGATCGGCGG GGCGCGCAAG GGCAAGGCGA TCATCGTCCT CAACCCGGCC GAGCCGCCGC TGATCATGCG CGACACCGTC TACGTGCTCT CCGCGCCGGC CGACCAGGCC CGGGTCGAGG CCTCCCTCGC GGAAATGGCC CAGGCGGTAC AGGGCTACGT GCCGGGCTAT CGCCTCAAGC AGCGGGTGCA GTTCGACGAG ATCCCCGACG CCGCGCCGCT GAACATCCCC GGCCTCGGCC GCCTGTCCGG CCTGAAGACC TCGGTGTTCC TCGAGGTCGA GGGCGCCGCC CATTACCTGC CGGCCTACGC CGGCAACCTG GACATCATGA CCTCCGCCGC GCTGGCTACC GCCGAGCGCA TGGCGCAATC CATGCTGAAC GCCTGA
|
Protein sequence | MKKLKVAIVG SGNIGTDLMI KILRHGQHLE MGALVGIDPD SDGLARAARL GVATTAEGVE GLARLPGFGE IDFVFDATSA AAHVKNDPFL RGLRPGLRLI DLTPAAVGPY CVPVVNLEQN LREPNVNMVT CGGQATIPMV AAVSRVARVH YAEIVASIAS RSAGPGTRAN IDEFTETTSK AIEAIGGARK GKAIIVLNPA EPPLIMRDTV YVLSAPADQA RVEASLAEMA QAVQGYVPGY RLKQRVQFDE IPDAAPLNIP GLGRLSGLKT SVFLEVEGAA HYLPAYAGNL DIMTSAALAT AERMAQSMLN A
|
| |