Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15130 |
Symbol | mphE |
ID | 7760448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1489125 |
End bp | 1490144 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804410 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_002798703 |
Protein GI | 226943630 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTCA ACCCCGGCAA GAAGCTGTAC ATCTCCGACG TGACCCTGCG CGACGGCAGC CACGCGATCC GCCACCAGTA CTCGATCGCC AACGTGCAGG CCATCGCCCG CGCGCTGGAC CAGGCGAAAG TCGACTCCAT CGAGGTCGCC CACGGCGACG GCCTGCAGGG TTCGAGCTTC AACTACGGCT TCGGCGCCCA CACCGACCTG GAATGGATCG AGGCGGTGGC CGAGGTGGTG ACTCATGCCA GGATCGCCAC CCTGCTGCTG CCCGGCATCG GCACCGTCCA CCACCTCAAG GAGGCTTACG AGGCCGGCGC GCGCATCGTC CGGGTGGCCA CCCACTGCAC CGAGGCGGAC GTGTCCAGAC AGCACATCGC CTACGCGCGC GAGTTGGGCA TGGACACCGT GGGCTTCCTG ATGATGAGCC ACATGACCAC GCCGCAGAAC CTCGCCGTCG AGGCGAAGAA GATGGAAAGC TACGGCGCCA CCTGCATCTA CGTGGTCGAC TCCGGCGGGG CCTTGAGCAT GCAGGACGTG CGCGAGCGCT TCCGCGCGGT CAAGGACCTG CTGGAGCCTT CGACCCAGAC CGGCATCCAC GCCCACCACA ACCTCAGCCT CGGGGTGGCC AACTCCATCG TCGCGGTGGA GGAGGGCTGC GACCGCATCG ACGCCAGCCT GGCCGGCATG GGCGCGGGGG CGGGCAATGC GCCGCTGGAG GTGTTCGTCG CCGCGGCCGA GCGGCTGGGC TGGAACCACG GCACCGACCT CTACACCCTG ATGGACGCCG CCGACGAGAT CGTCCGGCCG TTGCAGGACC GCCCGGTACG GGTCGACCGC GAGACGCTGG CGCTGGGGTA TGCCGGGGTC TATTCGAGCT TCCTGCGCCA CGCCGAGGTG GCGGCCGAGA AGTATGGCCT GAGCACCGTG GACATCCTGG TCGAGCTGGG CCGGCGGCGG ATGGTCGGCG GCCAGGAAGA CATGATCGTC GACGTGGCGC TGGATCTGCT CGAGCGCTGA
|
Protein sequence | MTFNPGKKLY ISDVTLRDGS HAIRHQYSIA NVQAIARALD QAKVDSIEVA HGDGLQGSSF NYGFGAHTDL EWIEAVAEVV THARIATLLL PGIGTVHHLK EAYEAGARIV RVATHCTEAD VSRQHIAYAR ELGMDTVGFL MMSHMTTPQN LAVEAKKMES YGATCIYVVD SGGALSMQDV RERFRAVKDL LEPSTQTGIH AHHNLSLGVA NSIVAVEEGC DRIDASLAGM GAGAGNAPLE VFVAAAERLG WNHGTDLYTL MDAADEIVRP LQDRPVRVDR ETLALGYAGV YSSFLRHAEV AAEKYGLSTV DILVELGRRR MVGGQEDMIV DVALDLLER
|
| |