Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_10720 |
Symbol | |
ID | 7760016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1018121 |
End bp | 1019626 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803976 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_002798278 |
Protein GI | 226943205 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.593646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCACT CCATCCCCAG CATCAAACTG CTGATCGACG GCCAGTTCGT CGAATCCACC ACCAGCCAAT GGCGCGAGGT GGTCGACCCG GCCACCCAGC AGGTCCTGGC CCGCGTCCCC TTCGCCAGCG AGGCGGAGCT GAACGCGGCG GTGGCCAGCG CCGCCGCGGC GTTCAAGACC TGGCGCAAGA CCTCCATCGG TACCCGCGCG CGGCTCTTCC TCAAGTACCA GCAACTGATC CGGGAGAACC TCAAGGAACT GGCGGCGATC CTCAGCGCCG AGCAGGGCAA GACCCTGGCC GACGCCGAGG GCGACGTGTT CCGCGGCCTG GAGGTGGTCG AGCACGCCGC CGGCATCGGC AACCTGCAGC TCGGCGAACT GGCCAACAAC GTCGCCGGCG GCGTCGACAC CTACACCCTG CTGCAGCCGC TCGGCGTCTG CGCCGGCATC ACCCCCTTCA ACTTTCCGGC GATGATCCCG CTGTGGATGT TCCCGATGGC CATCGCCACC GGCAACACCT TCGTCCTCAA GCCCTCCGAG CAGGACCCGA TGGTCACCAT GCGCCTGGTC GAGCTGGCCT TGGAAGCCGG CGTGCCGCCG GGAGTGCTCA ACGTGGTCCA CGGCGGCGCC GAGGTGGTCG ACCGGCTCTG CGACCACCCG GACATCAAGG CACTCTCCTT CGTCGGATCG AGTCGCGTCG GCGCCCACGT CTACCAGCGC GCCAGCCAGG CCGGCAAGCG CGTGCAGTGC ATGATGGGCG CGAAGAACCA CGCCGTCGTC CTGCCCGACG CGCACAAGGA ACAGACCCTC AACAGCCTGG CCGGTGCCGC CTTCGGCGCG GCCGGCCAGC GCTGCATGGC GATCTCGGTG GCGGTCCTGG TCGGCGCGGC CCGCGACTGG CTGCCGGAGC TGGTGGCCAA GGCCGCCACC CTCAAGGTCG GCGCCGGCAG CAAGCCCGGC ACCGACCTCG GCCCACTGAT CTCGCGCGCC GCCCTGGACC GGGTCGGCAG CCTGATCGAG CAGGGCGTGC GCGAAGGCGC GCGGCTGGAG CTGGACGGCC GCAACCCGGT CGTCGCCGGC TACGAGCAGG GCAATTTCGT CGGCCCGACC CTGTTCTCCG GCGTCACCCC TGGGATGAGC CTGTACCGCG AGGAGATCTT CGGCCCGGTG CTCTGCGTGA TGCAGGCCGA GACCCTGGAC GAGGCCATCG CCATCGTCAA CGCCAACCCC CACGGCAACG GCACCGCCCT GTTCACCCGC TCCGGCGCCG CGGCCCGGCA CTTCCAGGAG GAGATCGAGG TCGGCCAGGT CGGCATCAAC GTGCCGATCC CGGTGCCGGT GCCGATCTTC TCCTTCACCG GCTCGCGGGC CTCCAAGCTC GGCGACCTGG GGCCGTACGG CAAACAGGTG GTGCAGTTCT ACACCCAGAC CAAGACCGTC ACCCAGCGCT GGTTCGACGA GAACGAGGTC GGCGGCCCGG TCAACACCAC CATCACCCTC AAGTGA
|
Protein sequence | MTHSIPSIKL LIDGQFVEST TSQWREVVDP ATQQVLARVP FASEAELNAA VASAAAAFKT WRKTSIGTRA RLFLKYQQLI RENLKELAAI LSAEQGKTLA DAEGDVFRGL EVVEHAAGIG NLQLGELANN VAGGVDTYTL LQPLGVCAGI TPFNFPAMIP LWMFPMAIAT GNTFVLKPSE QDPMVTMRLV ELALEAGVPP GVLNVVHGGA EVVDRLCDHP DIKALSFVGS SRVGAHVYQR ASQAGKRVQC MMGAKNHAVV LPDAHKEQTL NSLAGAAFGA AGQRCMAISV AVLVGAARDW LPELVAKAAT LKVGAGSKPG TDLGPLISRA ALDRVGSLIE QGVREGARLE LDGRNPVVAG YEQGNFVGPT LFSGVTPGMS LYREEIFGPV LCVMQAETLD EAIAIVNANP HGNGTALFTR SGAAARHFQE EIEVGQVGIN VPIPVPVPIF SFTGSRASKL GDLGPYGKQV VQFYTQTKTV TQRWFDENEV GGPVNTTITL K
|
| |