Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0536 |
Symbol | aroB |
ID | 3909575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 601413 |
End bp | 602561 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882424 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_484158 |
Protein GI | 86747662 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCGC CGCTGAACCA TTCCGCTCCG ATCAAGGTCG AGGTCGCGCT CGGCGATCGC GCCTACGACA TCGTGATCGG CCGCAACGTG CTCGGCACGC TCGGCGAGCG GATCGCCAAG CTGCGGCCCG GCGCGCGCAC CGCGATCGTC ACCGATCGCA CCGTGGCGCG GACCTGGCTG GCGCCGACCG AGGCGGCGCT GGATGCCGCC GGCATCGCGC ATGCGCGCGT CGTGGTCGGC GAGGGCGAAA GCTCCAAGAC CTATGCGGGG CTGGCGGAAG TCAGCGAGGC GCTGATCGCC GCCAAGATCG AACGCAACGA TCTGGTGATC GCGCTCGGCG GCGGCGTGGT CGGCGATCTC GCCGGCTTCG CGGCGTCGAT CCTGCGCCGC GGCGTCGATT TCGTGCAGGT GCCGACCTCG CTGCTGGCAC AGGTTGATTC GTCGGTCGGC GGCAAGACCG GCATCAACTC GCCGCAGGGC AAGAACCTGC TCGGCGCGTT CCATCAGCCC GTATTGGTGA TCGCCGACAC CGCGGTGCTC GACACGCTGT CGCCGCGCCA GTTCCGTGCC GGCTATGCCG AAGTGGCGAA ATACGGCGCG CTCGGCGACG AGGCGTTCTT CGCCTGGCTC GAAGCCAATC ACGCCGAGAT CGTCTCAGGC GGGCCGGCGC GCGAGCACGC CATCGCCACG TCGTGCCGGG CGAAGGCGGC GATCGTGGCG CGCGACGAGC GCGAAAACGG CGAGCGCGCG CTGCTCAATC TCGGCCACAC GTTCGGCCAT GCGCTGGAGG CCGCGACCGG CTTCTCCGAC CGGCTGTTTC ACGGCGAGGG CGTGGCGATC GGCATGGTGC TGGCGGCGCG GTTCTCCGCC GAGCGCGGCA TGATGCCGGA GGCCGACGCC ATCCGGCTGC AGCGCCATCT CGCCGATGTC GGCCTGCCGA CCCGGCTGCA GGACATCGCC GGCTTCGCCC AGGAAGGCCT CGCCGACGCC GACGCGCTGT TGGCGCTGAT GACTCAGGAC AAGAAGGTCA AACGCGGCCA GCTCACCTTC ATCCTGATGG AAGGGATCGG CCGCGCGGTG ATCGCCGACA AGGTCGAGCC GGCGCCGGTT CGCGATTTCC TGGCCCGGCA GCTCGCGCGC GCATCGTGA
|
Protein sequence | MTAPLNHSAP IKVEVALGDR AYDIVIGRNV LGTLGERIAK LRPGARTAIV TDRTVARTWL APTEAALDAA GIAHARVVVG EGESSKTYAG LAEVSEALIA AKIERNDLVI ALGGGVVGDL AGFAASILRR GVDFVQVPTS LLAQVDSSVG GKTGINSPQG KNLLGAFHQP VLVIADTAVL DTLSPRQFRA GYAEVAKYGA LGDEAFFAWL EANHAEIVSG GPAREHAIAT SCRAKAAIVA RDERENGERA LLNLGHTFGH ALEAATGFSD RLFHGEGVAI GMVLAARFSA ERGMMPEADA IRLQRHLADV GLPTRLQDIA GFAQEGLADA DALLALMTQD KKVKRGQLTF ILMEGIGRAV IADKVEPAPV RDFLARQLAR AS
|
| |