Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_3266 |
Symbol | aroB |
ID | 4040101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | - |
Start bp | 3540614 |
End bp | 3541720 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637978672 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_585407 |
Protein GI | 94312197 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.966202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACGC TTGAAGTCGA TCTCGGGACG CGTAGTTACC CGATCCACAT TGGCAGCGGC CTGCTGGACC GCGCCGACCT GCTGGCCCCG CATGTACGCG GCCAGCATGC CGTCATTGTC ACCAACGAGA CCGTGGGCCC GCTGTACGCG GCGCGCGTCG AAGCTGTGCT TGCCGGCCTT GGCAAGACCG TGCGCACCGT GGTGCTGCCG GACGGCGAGC GCTTCAAGCA CTGGGAAACG CTGAACCTGA TTTTCGATGC GCTGCTCCAG GCAGGGGCCG ACCGCAAGAC CACGCTGATC GCGCTGGGCG GCGGGGTGGT GGGAGACATG ACCGGGTTTG CCGCCGCATG CTACATGCGT GGCGTGCCGT TCGTGCAGAT GCCGACCACG TTGCTGGCGC AAGTCGATTC GTCGGTGGGC GGCAAGACCG GCATCAATCA TCCGCTCGGC AAGAACATGA TCGGCGCGTT CCACCAGCCC AACGCGGTGA TCGCCGACAT TGACACGCTG CGCACGTTGC CCCCGCGCGA ACTGGCGGCG GGCATGGCCG AGGTCATCAA GCACGGGGCA ATCGCCGACG CCGACTACTT CGCTTGGATC GAGGACCATA TCGCCGGGCT GAACGCCTGC GACACCGGCC TGATGGCCGA AGCGGTGCGA CGCTCCTGCG AGATCAAGGC GTCCGTGGTG GCCCAGGACG AACGCGAAGG CGGCTTGCGC GCGATTCTCA ATTTCGGCCA TACGTTTGGT CACGCGATCG AAGCTGGCAT GGGCTATGGC GAATGGCTGC ATGGCGAGGC CGTGGGCTGT GGCATGGTGA TGGCCGCGGA CCTCTCGCAT CGGCTCGGTT TTATCGACGT CGAGACGCGC GCCCGCATTC GCACGCTGAC GCAGGCGGCG ATGCTGCCGG TGGTGGCGCC GGATCTGGGT ATCGATCGCT ATATCGAGTT GATGAAGGTG GACAAGAAGG CCGAAGCGGG CAGCATCAAG TTCATCCTGC TGCGCAAGCT GGGTTCTGCC TTCATCACCA CCGTGCCGGA CGCGGACCTG CGCGAGACGC TGCGTCACGC GATATTGAAA CCTCCCACCG AGGCCCCGGT CGCCTGA
|
Protein sequence | MITLEVDLGT RSYPIHIGSG LLDRADLLAP HVRGQHAVIV TNETVGPLYA ARVEAVLAGL GKTVRTVVLP DGERFKHWET LNLIFDALLQ AGADRKTTLI ALGGGVVGDM TGFAAACYMR GVPFVQMPTT LLAQVDSSVG GKTGINHPLG KNMIGAFHQP NAVIADIDTL RTLPPRELAA GMAEVIKHGA IADADYFAWI EDHIAGLNAC DTGLMAEAVR RSCEIKASVV AQDEREGGLR AILNFGHTFG HAIEAGMGYG EWLHGEAVGC GMVMAADLSH RLGFIDVETR ARIRTLTQAA MLPVVAPDLG IDRYIELMKV DKKAEAGSIK FILLRKLGSA FITTVPDADL RETLRHAILK PPTEAPVA
|
| |