Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42120 |
Symbol | |
ID | 7763090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4240593 |
End bp | 4241615 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807065 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_002801314 |
Protein GI | 226946241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0602792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTCA ACCCCGGCAA GAAGCTGTAC ATCTCCGACG TGACCCTGCG CGACGGCAGC CACGCGATCC GCCACCAGTA CTCGATCAAG AACGTGCAGG CCATCGCCCG CGCGCTGGAC CAGGCGAAAG TCGACTCCAT CGAGGTCGCC CACGGCGACG GCCTGCAGGG TTCGAGCTTC AACTACGGCT TCGGCGCCCA CACCGATCTG GAATGGATCG AGGCGGTGGC CGAGGTGGTG ACTCATGCCA GGATCGCCAC CCTGCTGCTG CCCGGCATCG GCACCGTGCA CCACCTCAAG GAAGCCTACG AGGCCGGCGC GCGCATCGTC CGGGTGGCCA CCCACTGCAC CGAGGCGGAC GTGTCCAGGC AGCACATCGC CTACGCGCGC GAGCTGGGCA TGGACACCGT GGGCTTCCTG ATGATGAGCC ACATGACCAC GCCGCAGAAC CTCGCCGTCG AGGCGAAGAA GATGGAAAGC TACGGCGCCA CCTGCATCTA CGTGGTCGAC TCCGGCGGGG CCTTGAGCAT GCAGGACGTG CGCGAGCGCT TCCGCGCGGT CAAGGACCTG CTGGAGCCTT CGACCCAGAC CGGCATCCAC GCCCACCACA ACCTCAGCCT CGGGGTGGCC AACTCCATCG TCGCGGTGGA AGAGGGCTGC GACCGCATCG ACGCCAGCCT GGCCGGCATG GGCGCCGGCG CTGGCAACGC GCCGCTGGAG GTGTTCGTCG CCGCGGCCGA ACGGCTGGGC TGGAACCACG GCACCGACCT GTACACCCTG ATGGACGCCG CCGACGAGAT CGTCCGGCCG TTGCAGGACC GCCCGGTACG GGTCGACCGC GAGACGCTGG CGCTGGGCTA TGCCGGGGTC TATTCGAGCT TCCTGCGCCA TGCCGAGGTG GCGGCCGAGA AGTATGGCCT GAGCACCGTG GACATCCTGG TCGAGCTGGG CCGGCGGCGG ATGGTCGGCG GCCAGGAAGA CATGATCGTC GACGTGGCGC TGGATCTGCT CGAGCGTTCC TGA
|
Protein sequence | MTFNPGKKLY ISDVTLRDGS HAIRHQYSIK NVQAIARALD QAKVDSIEVA HGDGLQGSSF NYGFGAHTDL EWIEAVAEVV THARIATLLL PGIGTVHHLK EAYEAGARIV RVATHCTEAD VSRQHIAYAR ELGMDTVGFL MMSHMTTPQN LAVEAKKMES YGATCIYVVD SGGALSMQDV RERFRAVKDL LEPSTQTGIH AHHNLSLGVA NSIVAVEEGC DRIDASLAGM GAGAGNAPLE VFVAAAERLG WNHGTDLYTL MDAADEIVRP LQDRPVRVDR ETLALGYAGV YSSFLRHAEV AAEKYGLSTV DILVELGRRR MVGGQEDMIV DVALDLLERS
|
| |