Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_45240 |
Symbol | aroB |
ID | 7763392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4589710 |
End bp | 4590813 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807372 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002801613 |
Protein GI | 226946540 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACGT TACAGGTTGA TCTCGGCGAG CGCAGCTATC CCATCTATAT TGGAGCCGGC CTGCTGGACC GGGCGGAATG CATGGTTCCG CACCTGGCCG GTCGCCAAGT GGCGGTGGTC ACCAACGAGA CCGTCGCGCC GCTCTATCTG GAGCGGTTGT CCCAAACCCT GGCCGGCCAT GAGCTGACGC CCATCGTGCT GCCGGACGGC GAGGCCTTCA AGCACTGGGA AACCCTGCAG AAGATTTTCG ACGGCCTGCT CGAGGCCCGG CACGACCGGC GCACCACCCT CATCGCCCTT GGCGGGGGCG TCGTCGGCGA CATGGCGGGG TTTGCCGCCG CCTGCTACCA GCGCGGGGTC GACTTCATCC AGATTCCGAC CACCTTGCTG TCCCAGGTGG ACTCCTCGGT CGGCGGCAAG ACCGGCATCA ATCACCCGCT GGGCAAGAAC ATGATCGGCG CTTTCTATCA GCCGAAGGCC GTGGTGATCG ATACCGCCAC GCTGGCGACC CTGCCGGAGC GGGAACTGTC CGCCGGCCTG GCCGAGGTGA TCAAGTACGG CCTGATCTGC GACGAGCCCT TCCTCGGCTG GCTGGAAACC CACATGGCGG CGCTTCGCGC CCTGGATGCG GCGGCCCTGA CCGAGGCCAT CGCCCGCTCC TGCGCGGCCA AGGCGCGGGT GGTCGGTGTC GACGAACGCG AGTCGGGCGT GCGCGCCACC CTGAACCTGG GGCACACCTT CGGTCATGCC ATCGAAACCC ACATGGGCTA TGGTGCATGG CTGCACGGCG AGGCGGTGGC CGCGGGGTCC GTCATGGCCC TGGAGATGTC CCGGCGCCTC GGCTGGCTCG GCGAGGCCGA GCGCGATCGC GGCATCCGCC TGCTGCAGCG CGCGGGACTG CCCGTGGTGC CGCCGGCGCA GATGTCGGCG GACGACTTCC TCGGGCACAT GGCCGTCGAC AAGAAGGTTC TGGGCGGCCG CCTGCGGCTG GTGCTGCTCA GGCGCCTGGG CGAGGCGGTG GTGACTGGAG ACTTCCCGCA CGAGGCCTTG CAGGCCACCC TGGATACCGA TTACCGGACT CTGATGGAAC AGATCAGTCA CTGA
|
Protein sequence | MQTLQVDLGE RSYPIYIGAG LLDRAECMVP HLAGRQVAVV TNETVAPLYL ERLSQTLAGH ELTPIVLPDG EAFKHWETLQ KIFDGLLEAR HDRRTTLIAL GGGVVGDMAG FAAACYQRGV DFIQIPTTLL SQVDSSVGGK TGINHPLGKN MIGAFYQPKA VVIDTATLAT LPERELSAGL AEVIKYGLIC DEPFLGWLET HMAALRALDA AALTEAIARS CAAKARVVGV DERESGVRAT LNLGHTFGHA IETHMGYGAW LHGEAVAAGS VMALEMSRRL GWLGEAERDR GIRLLQRAGL PVVPPAQMSA DDFLGHMAVD KKVLGGRLRL VLLRRLGEAV VTGDFPHEAL QATLDTDYRT LMEQISH
|
| |