Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_13890 |
Symbol | mmsA |
ID | 7760326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1348188 |
End bp | 1349684 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804282 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_002798581 |
Protein GI | 226943508 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0371215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTCA TTGGACACTT GATCGGCGGC GAGCGCCGCA ACGACGCGAC CCGCACTCAG GAGGTCTTCA ATCCCGCCAC CGGCCGGGCC GAAAAACAGG TGGCCCTGGC CGCCAAATCC ACCGTGGAAG AAGCCATCGC CGCCGCGCAG GCGGCCTTCC CCGCCTGGCG CGACACCCCG CCGATCAAGC GCACACGGAT CATGTTCCGC TTCAAGGAAC TGCTGGAGCA GAACGCCGAG CGGATCTGCC AACTGATCGG CGAGGAGCAC GGCAAGATCG TCCACGACGC CGCCGGCGAG CTGCAGCGCG GCATCGAGAA CGTCGAGTAC GCTTGCGGCG CGCCACAGTT GCTCAAGGGC GAGCACAGCC GCAGCGTCGG TCCGGGAATC GACTCCTGGA GCGAATTCCA GCCGCTCGGC GTGGTGGCCG GTATCACTCC GTTCAACTTC CCGGTGATGG TGCCGTTGTG GATGTTTCCC ATGGCCATCG TCTGCGGCAA CTGCTTCGTC CTCAAGCCTT CCGAGCGCGA TCCCGGCGCG ACCCTGTTCA TCGCCGAGCT GTTGCACGAG GCCGGCCTGC CGCCGGGCGT ACTGAACCTG GTCAACGGCG ACAAGGAGGC GGTCGACACC CTGCTGCACG ATCCGCGCGT GCAGGCGGTC AGCTTCGTCG GCTCGACGCC CATCGCCGAA TACATCTATG CGACCGCCGC GGCTAACGGC AAACGCTGCC AGGCGCTGGG CGGGGCGAAG AACCACGCCA TCCTGATGCC CGACGCCGAC CTGGACAACG CGGTCAACTC GCTGCTGGGC GCCGCCTTCG GTTCTTCCGG CGAGCGCTGC ATGGCGCTCT CGGTGGTGGT GGCGGTCGGC GACGCGGTGG CCGATGCGCT GGTCGCCCGT CTGCGGGAGG CCATGCGGGG CCTGAAGCTC GGCATCCACC ACGAGCGCGG TAACGACTTC GGCCCGCTGA TCACCCGCCA GCACAAGGAC AGGGTGGTCG GCCACATCGA CAGCGCCGAA CGCCAGGGCG CGCAGGTGGT GGTGGATGGA CGCGGCGTGC GGACTCCCGG CTGCGAGGAG GGTTTCTTCG TCGGCGCCAC CCTGCTCGAT CGGGTGAGCC CGGAGATGGA CAGCTACCGC GCGGAAATCT TCGGCCCGGT GTTGCAGGTG ATACGGGTGT CGAGCCTGGA GGAAGCCATG GCGCTGATCG ACGCCCACGA ATATGGCAAC GGCACCTGCA TCTACACCCG CGACGGCGAG GCGGCGCGCT ATTTCAGCGA CCGCATCCAG GTCGGCATGG TGGGCATCAA CGTGCCGCTG CCGGTGCCGG TGGCCTATCA CAGCTTCGGC GGCTGGAAGC GCTCGCTGTT CGGCGACCTG CACGCCTACG GCCCGGACGG GGTGCGCTTC TACACCCGGC GCAAGACCAT CACCCAGCGC TGGCCGTCGG CCAGCCTGCG TGAAAGCGCC GAGTTCTCGA TGCCGACCCT GAAGTGA
|
Protein sequence | MTLIGHLIGG ERRNDATRTQ EVFNPATGRA EKQVALAAKS TVEEAIAAAQ AAFPAWRDTP PIKRTRIMFR FKELLEQNAE RICQLIGEEH GKIVHDAAGE LQRGIENVEY ACGAPQLLKG EHSRSVGPGI DSWSEFQPLG VVAGITPFNF PVMVPLWMFP MAIVCGNCFV LKPSERDPGA TLFIAELLHE AGLPPGVLNL VNGDKEAVDT LLHDPRVQAV SFVGSTPIAE YIYATAAANG KRCQALGGAK NHAILMPDAD LDNAVNSLLG AAFGSSGERC MALSVVVAVG DAVADALVAR LREAMRGLKL GIHHERGNDF GPLITRQHKD RVVGHIDSAE RQGAQVVVDG RGVRTPGCEE GFFVGATLLD RVSPEMDSYR AEIFGPVLQV IRVSSLEEAM ALIDAHEYGN GTCIYTRDGE AARYFSDRIQ VGMVGINVPL PVPVAYHSFG GWKRSLFGDL HAYGPDGVRF YTRRKTITQR WPSASLRESA EFSMPTLK
|
| |