Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4018 |
Symbol | aroG |
ID | 5151767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 4222286 |
End bp | 4223371 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640558849 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001239990 |
Protein GI | 148255405 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGAGCA CCACAGACGA TCTTCGTATC AAGGAAATCA AGGAATTGAG CCTGCCGGTG GAGGTGATGG GTGAGATCCC GCGCACCCTC ACCGCCACCC GCGTCGTGAT GGCCGCGCGC AACGCGATCC ACGCCATCCT GAACGGCACT GACGATCGCC TGCTGGTCAT CGTCGGCCCC TGCTCCATTC ACGATCCGAA GGCCGCGATC GATTATGCCG GCCGCCTCGC CGCCTTGCGC GAGCGTCTCA ATGACCGGCT GGAAATCGTG ATGCGGGTCT ATTTCGAGAA GCCGCGCACC ACGGTGGGCT GGAAGGGCCT GATCAACGAC CCCGATCTCA ACAATTCCTT CGACATCAAC AAGGGCCTGC GGCTGGCGCG CAACGTGCTG TCCGCCGTCA ACAATCTCGG CCTGCCCGCG GGATGCGAAT TCCTCGACAT GACAACGCCG CAATACATTG CCGACCTCGT CGCCTGGGCC GCGATCGGCG CGCGCACCAC CGAGAGCCAG ATCCATCGCG AGCTGGCCTC CGGCCTGTCC TGCCCGGTCG GCTTCAAGAA CGGCACCGAC GGCAACATCC GCATCGCCGC CGACGCGGTG AAATCAGCCG CCCATCCGCA TCATTTCATG GCGGTCACCA AAGGCGGGCG CTCCGGCATC GCGGCCACCA CCGGCAATGA AGACTGCCAC ATCATCCTGC GCGGCGGCAG CGGCCCGAAC TACGACGCGG CCCATGTCGA GCAGGCCGCG AGCGAGCTGG TCAAGGCCGG CCTGCCGGCG CGGCTGATGA TCGACACCAG CCACGCCAAT TCCAGCAAGA AGCCGGAGAA CCAGCCGCTC GTCGCCGCAG ATATCGCCGG CCAGCTCGCG GCCGGCGAGC AGCGCATCAT GGGCGTGATG ATCGAGAGCC ATCTCGTCGC CGGTCGCCAG GACGTCAAGC CGGGCGTGCC GCTGACCTAT GGCCAGAGCA TCACCGACGG CTGTATCGGT TGGGACACGA CGGTCGCGGT GCTGGAGCAA CTCGCCGACG CCGTCACCAC GCGGCGGACA CGGCGCAACG AGTCCGTGCG CGAGCGCTCG GCGTAA
|
Protein sequence | MLSTTDDLRI KEIKELSLPV EVMGEIPRTL TATRVVMAAR NAIHAILNGT DDRLLVIVGP CSIHDPKAAI DYAGRLAALR ERLNDRLEIV MRVYFEKPRT TVGWKGLIND PDLNNSFDIN KGLRLARNVL SAVNNLGLPA GCEFLDMTTP QYIADLVAWA AIGARTTESQ IHRELASGLS CPVGFKNGTD GNIRIAADAV KSAAHPHHFM AVTKGGRSGI AATTGNEDCH IILRGGSGPN YDAAHVEQAA SELVKAGLPA RLMIDTSHAN SSKKPENQPL VAADIAGQLA AGEQRIMGVM IESHLVAGRQ DVKPGVPLTY GQSITDGCIG WDTTVAVLEQ LADAVTTRRT RRNESVRERS A
|
| |