Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1478 |
Symbol | aroB |
ID | 4895071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1540363 |
End bp | 1541475 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640112067 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001043360 |
Protein GI | 126462246 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.389848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.464781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCG ATGCGGTGCG GGTAGAGCTG GGCGCGCGCG CCTACGAGGT GCGGATCGGA CCGGGGCTCA TCGCGCGGGC GGGGGCCGAG ATCGCGCCGC TCCTGCGGCG GCCGAAGGTG GCGATCCTCA CCGACGAGAC GGTGGCGGGG CTGCATCTCG ACCCCTTCCG GCAGGCGCTG GCCGAGGCGG GCATCGCCTC CTCGGCGCTG GCGCTGCCCG CGGGCGAGGC CACCAAGGGC TGGCCGCAGT TTGCCCGCGC CGTCGAATGG CTGCTCGAGG AGAAGGTCGA GCGGCGCGAC GTGGTGGTGG CGCTCGGCGG CGGGGTGATC GGCGATCTGG CGGGCTTCGC GGCCGCCGTC CTGCGCCGGG GCGTGCGCTT CGTGCAGGTG CCGACGACGC TTCTGGCGCA GGTCGACAGC TCGGTCGGCG GCAAGACCGG GATCAACACC GCCCAAGGCA AGAACCTCGT CGGCGCCTTC CACCAGCCCT CGCTGGTGCT GGCCGATATT GGCGTCCTCG AGACGCTGCC GCCCCGCGAC TTCCGCGCGG GTTACGGCGA GGTGGTGAAA TACGGCCTGC TCGGCGATGC CGATTTCTAC GAATGGCTGG AGGAGGCGGG CCCTCGGCTG GCCGCCGATA CCGAGGCCCG CCAGCGTGCC GTGCGCCGCT CGGTCGAGAT GAAGGCCGAG ATCGTGGCCC GCGACGAGAC CGAGGAGGGC GACCGCGCGC TGCTGAACCT CGGCCATACC TTCTGCCACG CGCTGGAAAA GGCCACCGGC TATTCCGATC GGCTCCTCCA TGGCGAGGGC GTGGCCATCG GCTGCGCGCT GGCTTTTGAG CTGAGCCAGC GTCTCGGCCT CTGCGCCCAG GAGGCGCCGA GCCGCCTGCG CGCCCATCTG CGGGCCATGG GCATGAAGGT CGACCTGCGC GACATCCCGG GCGATCTGCC CTCCGCCGAA GCGCTGCTCG CCCTCATGGC GCAGGACAAG AAGGTGGTGG ACGGCAAGCT GCGCTTCATC CTCGCCCGCG GCATCGGACA GGCCTTCGTC GCCGATGACG TGCCGGGCGA CGTGGTTCGC ACGCTGCTTG AGGATGCCCT GGCACAGCGT TGA
|
Protein sequence | MTVDAVRVEL GARAYEVRIG PGLIARAGAE IAPLLRRPKV AILTDETVAG LHLDPFRQAL AEAGIASSAL ALPAGEATKG WPQFARAVEW LLEEKVERRD VVVALGGGVI GDLAGFAAAV LRRGVRFVQV PTTLLAQVDS SVGGKTGINT AQGKNLVGAF HQPSLVLADI GVLETLPPRD FRAGYGEVVK YGLLGDADFY EWLEEAGPRL AADTEARQRA VRRSVEMKAE IVARDETEEG DRALLNLGHT FCHALEKATG YSDRLLHGEG VAIGCALAFE LSQRLGLCAQ EAPSRLRAHL RAMGMKVDLR DIPGDLPSAE ALLALMAQDK KVVDGKLRFI LARGIGQAFV ADDVPGDVVR TLLEDALAQR
|
| |