Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33210 |
Symbol | aroF |
ID | 7762216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3394870 |
End bp | 3395934 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643806186 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002800450 |
Protein GI | 226945377 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAT CCGTTTCCGC CCAGCTCGCC ACCGCCGAGT CCATTCCCGC CCGCCGCAGC GCCCAGCCGC TGCCCAGGCC CTCGGTGCTG CGCCAGCGCC TGCCGCTGAC TCCCGCGCTC ACCGAGCGCA TCCGGGCCGA CCGCGCCGCC ATCCGCGCGG TGCTCGACGG CCGGGACCCG CGCCTTCTGG TGGTGGTCGG CCCCTGCTCG CTGCACGACC CCGACTCCGC CCTGGATTAC ACCGCGCGCT TGGCCGAGCT GGCGCCGCAG GTCGACGACC GGTTGTTGCT GGTGATGCGC GCCTATGTCG AGAAGCCGCG CACCACCGTC GGCTGGAAGG GGCTGGTCTA CGATCCGCAC CTGGACGGCA GCGGCGACAT GGCCGAGGGC CTGCGGCTGT CGCGCCGACT GATGCTGGAC ATTCTGGAAC TGGGCCTGCC GCTGGCCAGC GAACTGCTGC AGCCGCTGGC GGCCAGCTAC TTCGACGACC TGCTGGGCTG GGCCGCCATC GGCGCGCGCA CCAGCGAGTC GCAGATCCAC CGCGAGATGG TCAGCGGCCT GGATCTGCCG GTGGGCTTCA AGAACGGCAC CGACGGCAGC CTGGGCATCG CCTGCGACGC CATGCGCTCG GCCGCCCATG CCCATCGGCA TTTCGGCATC GACGAACTGG GCCATCCGGC CCTGCTGCAG ACCCGCGGCA ACCCGGATAC CCATCTGGTG CTGCGCGGCG GCCACGGCGG ACCGAACCAC GACGCGGCCA GCGTCGCCGG CGCCCGTCAG GCCCTGGAGC GCCAGGGCAT CGCCGCGCGG ATCATGGTCG ACTGCAGCCA CGCCAACAGC GGCAAGGACC CGTTGCGCCA GCCGGCCGTG CTGGACGACG TGCTCGCGCA GCGCCTGGCC GGCGATACCA GCCTGCGCGG GGTGATGCTG GAAAGCCATC TGTTCGACGG CTGCCAGCCG CTGTCCGGCG AGCTGCGCTA CGGTGTCTCG ATCACCGACG GCTGTCTCGG CTGGAGCGCC ACCGAACGGA TGCTGCTGGA CGCCGCCCGG CGCCTGCGCG CTTGA
|
Protein sequence | MNASVSAQLA TAESIPARRS AQPLPRPSVL RQRLPLTPAL TERIRADRAA IRAVLDGRDP RLLVVVGPCS LHDPDSALDY TARLAELAPQ VDDRLLLVMR AYVEKPRTTV GWKGLVYDPH LDGSGDMAEG LRLSRRLMLD ILELGLPLAS ELLQPLAASY FDDLLGWAAI GARTSESQIH REMVSGLDLP VGFKNGTDGS LGIACDAMRS AAHAHRHFGI DELGHPALLQ TRGNPDTHLV LRGGHGGPNH DAASVAGARQ ALERQGIAAR IMVDCSHANS GKDPLRQPAV LDDVLAQRLA GDTSLRGVML ESHLFDGCQP LSGELRYGVS ITDGCLGWSA TERMLLDAAR RLRA
|
| |