Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_7572 |
Symbol | aroB |
ID | 5155541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 7955791 |
End bp | 7956939 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640562217 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001243325 |
Protein GI | 148258740 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.498964 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCC CCCTGAAGAC TTTCGATCCG ATCATCGTCG ACGTCGCGCT GGGTGAGCGT GCCTATGACA TCGTCATCGG CCGCGGCGTG CTGGCCTCGC TCGGTCAGCG CATCGCGGCG CTGCGTCCGG GCGTGCGGAC AGCGATCGTC ACCGATCGGA CCGTGGCCGC ACACTGGCTG AAGCCGACCG AGGCGATCCT GGCGGAGGCG GGCATTCCGT CTTCGACCAT CGTCGTCGAG GAGGGCGAGG GCTCCAAGAC CTATGCCGGC CTGGAAAAGG TCAGCGAGGC GCTGATCGCA GCGAAGATCG AGCGCAACGA TCTCGTCATC GCGCTCGGCG GCGGCGTGGT CGGCGATCTC GCCGGCTTCG CGGCGGCCAT CCTGCGCCGC GGCGTCAACT TCGTGCAGGT GCCGACCTCG CTGCTGGCAC AGGTCGATTC CTCGGTCGGC GGCAAGACCG GCATCAACTC GCCGCAGGGC AAGAACCTGC TCGGCGCCTT CCACCAGCCG GTGCTGGTGA TCGCCGACAC CGCCGTGCTC GACACGCTGT CGCCGCGCCA GTTCCGCGCC GGCTATGCCG AGGTCGCCAA ATACGGCCTG CTCGGCGACG CCGGCTTCTT CACCTGGCTG GAGGCCAACC ACGCTGACAT CGTCACGGGC GGTGCGGCAC GCGAACATGC CGTCGCCACC TCTTGTCGCG CCAAGGCGGC GATCGTCGCC CGCGACGAGC GCGAGACCGG CGACCGCGCG CTGCTCAATC TCGGCCATAC TTTCGGTCAT GCCCTGGAGG CGATCACCGG CTTCTCCGAT CGCTTGTTCC ATGGCGAGGG CGTCGCGGTC GGCATGGTGC TGGCGGCGCA GTTCTCGGCC GAGCTCGGCA TGCTGCCGCC GGACGATGTC ACGCGGATCG AGCGTCACCT TGCCGCGGTC GGCCTGCCGA CCCATTTGCA GGATATCGCC GGCTTCGCCC AGGAGGGGAT CGGCGACGCC GACCGGCTTT TGGCCTTGAT GGCGCAGGAC AAGAAGGTCA AGCGCGGCAA GCTCACCTTC ATCCTGATGG AGGCGATCGG CCGCGCGGTC ATTGCCAAGG ACGTCGATCC GGCGCGCGTC CGCGACTTCC TGCAGGCCAA ATTGCACCGT CGCGGCTGA
|
Protein sequence | MTAPLKTFDP IIVDVALGER AYDIVIGRGV LASLGQRIAA LRPGVRTAIV TDRTVAAHWL KPTEAILAEA GIPSSTIVVE EGEGSKTYAG LEKVSEALIA AKIERNDLVI ALGGGVVGDL AGFAAAILRR GVNFVQVPTS LLAQVDSSVG GKTGINSPQG KNLLGAFHQP VLVIADTAVL DTLSPRQFRA GYAEVAKYGL LGDAGFFTWL EANHADIVTG GAAREHAVAT SCRAKAAIVA RDERETGDRA LLNLGHTFGH ALEAITGFSD RLFHGEGVAV GMVLAAQFSA ELGMLPPDDV TRIERHLAAV GLPTHLQDIA GFAQEGIGDA DRLLALMAQD KKVKRGKLTF ILMEAIGRAV IAKDVDPARV RDFLQAKLHR RG
|
| |