Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_23130 |
Symbol | aroG |
ID | 7761229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2307362 |
End bp | 2308438 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643805195 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002799476 |
Protein GI | 226944403 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.118954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATT TACCGATCAA TGACATCAAC GTTGCCTCCA ACGACCCCCT GATCACCCCA GAGCAGCTCA AGGCAGAGAT CCCCCTGAGC GCTGCCGCCA TGAACACCGT CTCCCGCGGC CGCGAAGTCA TCCGCAATAT CCTCGACGGC AAGGATCATC GCCTGTTCCT GGTGGTCGGC CCGTGCTCCA TCCACGATGT CAAGGCGGCA CACGAATACG CCGAACGGCT CAAGAAGCTC GCAGCCGAAG TTTCCGACAC CCTGTTTCTG GTGATGCGTG TCTATTTCGA GAAACCCCGC ACCACCGTCG GTTGGAAGGG CCTCATCAAC GATCCCTATC TGGACGACAC CTTCAAGATC CAGGAAGGGC TGCATATCGC CCGCCAATTG CTGCTCGACA TCGCCGAAAC GGGCTTGCCG AGCGCTGGCG AAGCCCTGGA CCCGATTTCC CCACAGTATC TGCAGGACCT GTTCAGTTGG TCGGCCATCG GTGCCCGCAC CACGGAATCC CAGACACACC GCGAGTTGGC CTCCGGCCTG TCTTCTGCCG TCGGTTTCAA GAACGGCACG GACGGCAGCC TGACCGTGGC GATCAACGCG CTGCAATCGG TATCCAGGCC CCACCGTTTC CTGGGCATCA ACCAGCAGGG CAGCGTTTCC ATCGTGACGA CCAAGGGCAA TACCTATGGG CACGTCGTTC TGCGCGGCGG CAATGGCAAA CCCAACTACG ACTCGGTCAA CGTCGCCATC TGCGAGCAGG AGCTGCGCAA GGCCGGTATC CTGCCGAATA TCATGGTGGA CTGCAGCCAC GCCAATTCGA ACAAGGATCC GGCCCTGCAA CCCCTGGTGA TGACCAACGT CGCCAACCAG ATTCTCGAAG GCAATTCATC CATCATAGGT CTGATGGTGG AGAGCAACCT GGGCTGGGGC AGCCAGTCGA TTCCCGACAA TCTGGACGAC CTCAAGTACG GAGTCTCCGT CACCGACGCC TGCATCGACT GGGACACCAC AGCCACGGCG ATACGCGACA TGCACGCCAA ACTCAAGGAT ATCCTGCCGA ACCGGAAACG CTCCTGA
|
Protein sequence | MADLPINDIN VASNDPLITP EQLKAEIPLS AAAMNTVSRG REVIRNILDG KDHRLFLVVG PCSIHDVKAA HEYAERLKKL AAEVSDTLFL VMRVYFEKPR TTVGWKGLIN DPYLDDTFKI QEGLHIARQL LLDIAETGLP SAGEALDPIS PQYLQDLFSW SAIGARTTES QTHRELASGL SSAVGFKNGT DGSLTVAINA LQSVSRPHRF LGINQQGSVS IVTTKGNTYG HVVLRGGNGK PNYDSVNVAI CEQELRKAGI LPNIMVDCSH ANSNKDPALQ PLVMTNVANQ ILEGNSSIIG LMVESNLGWG SQSIPDNLDD LKYGVSVTDA CIDWDTTATA IRDMHAKLKD ILPNRKRS
|
| |