Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4051 |
Symbol | aroB |
ID | 7388859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3408814 |
End bp | 3409959 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643652766 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002550940 |
Protein GI | 222149983 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAG ATCAGGCATC CGAACGCCCC GAATCCTCAG AGCGCCTCGT CCATGTGCCG CTTGGTGAGC GCGCCTATGA TATCCTGATC GGCGACGGCC TGATCGGACG GGCGGGCGGC GAGATTTCCA CCCGTATCAA GGGCCGCAAG GCAGCCATCA TCACCGATGA GAATGTCGGG GCGCTTTACC ACGGCGCGCT GATGGACAGC CTGGAGGCGG ACGGCTTTGA AGCCGTGTCG CTGACCCTGC CCGCCGGTGA AAAGACCAAG AGCTTTGAGC ACCTGACCAA GGTTTGCGAC GTGCTGCTGG AAGCCCGCAT CGAGCGCAAT GATGTGGTGA TCGCGCTTGG CGGCGGCGTT ATCGGCGATC TCACCGGCTT TGCGGCGGGT ATCGTTCGGC GCGGGGTGCG GTTCGTGCAG ATCCCGACCT CGCTTCTGTC GCAGGTCGAT AGTTCCGTCG GTGGCAAGAC CGGTATCAAT GCAAGGCAGG GCAAGAATCT GGTTGGCATT TTCAACCAGC CGGATCTGGT TCTGGCCGAC ACCGCAGTGC TGAATACGTT GAGCGAAAGA GAGTTTCGCG CCGGCTATGC CGAAGTGGCG AAATATGGCC TGATCGACAA GCCGGAGTTT TTTGATTGGC TGGAGCGCAA TTGGCGCGAG GTGTTTGCCG GTGGAGCGGC CCGCACGCAG GCAATTGCGC TGTCCTGCCA GGCCAAGGCC GATGTGGTCG TGGCTGATGA GCGTGAACAT GGCCGCCGAG CCCTGCTCAA TCTCGGTCAC ACCTTCGGCC ACGCGCTGGA AGCGGCCACG GGCTATGATA GCCGCCGCCT TGTGCATGGG GAAGGTGTTG CCATCGGTAT GGTGCTGGCC CATGATTTTT CGGCCCGGCT CAATCTGGCC AGCCCTGATG ATGCAAAGCG GGTCGAGCAC CATTTGAAAG AGGTTGGGCT GCCCACCCGG ATCGCTGAGA TTCCCGGCGA TATGCCGCCC GCAGAAGAGT TGATGAAGGC CATTGCCCAG GACAAGAAGG TCAAGGGCGG TCAATTGACC TTCATTCTCA CCCGCGGTAT CGGACAGTCT TTCGTCGCCG ACGATGTGCC GTCCTCGGAA GTGCTGAGTT TTCTACAAGA CAATCTGCCC GGCTGA
|
Protein sequence | MSTDQASERP ESSERLVHVP LGERAYDILI GDGLIGRAGG EISTRIKGRK AAIITDENVG ALYHGALMDS LEADGFEAVS LTLPAGEKTK SFEHLTKVCD VLLEARIERN DVVIALGGGV IGDLTGFAAG IVRRGVRFVQ IPTSLLSQVD SSVGGKTGIN ARQGKNLVGI FNQPDLVLAD TAVLNTLSER EFRAGYAEVA KYGLIDKPEF FDWLERNWRE VFAGGAARTQ AIALSCQAKA DVVVADEREH GRRALLNLGH TFGHALEAAT GYDSRRLVHG EGVAIGMVLA HDFSARLNLA SPDDAKRVEH HLKEVGLPTR IAEIPGDMPP AEELMKAIAQ DKKVKGGQLT FILTRGIGQS FVADDVPSSE VLSFLQDNLP G
|
| |