Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2564 |
Symbol | aroB |
ID | 5323432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2661507 |
End bp | 2662649 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791507 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001328229 |
Protein GI | 150397762 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.443147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCA TGACCAGCGA TGTGATGCCA GCCGCCGAGC GTAAGGTCCG CGTCGATCTT GCGGAGCGTT CCTACGATAT CCTCATCGGC CCCGGCCTGA TCGCCGCCGC GGGCGGGGAG ATCGCCTCGC GGCTCAAGGG CCGGAAGATG GCGGTCATCA CCGACGAGAA TGTTGCGCCG CGCTATCTCG AACCGTTGAT GGCGAGCCTT GCGGGAAGCG GCATGGATCC GGTCTCCCTG ATTCTGCCGG CGGGTGAGAA GACCAAGAGC TTCGAGCACC TGATCCCGGT TTGCGAAGCG GTCCTGGGTG CCAGAATAGA GCGTAACGAC GCGGTGATCG CGCTCGGTGG CGGCGTGATC GGCGATCTCA CCGGCTTCGC CGCTGGTATC GTCCGCCGCG GCTCTCGCTT CATCCAGATC CCGACATCGC TGCTGGCGCA GGTCGATTCC TCCGTCGGCG GCAAGACCGG CATCAATTCG CCGCACGGCA AAAACCTGAT CGGCGTTTTC CACCAGCCGG ACCTCGTTCT CGCCGACACC GCCGCGCTCG ACACGCTCAG CCCGCGCGAA TTCCGTGCAG GCTATGCCGA GGTCGTGAAA TACGGCCTGA TCGACAAGCC GGATTTCTTC GAATGGCTTG AGCAGAACTG GCAGGCCGTG TTTGCCGGCG GACCCGCCCG GATCGAGGCG ATCGCCGTCA GCTGCCAGGC GAAAGCCGAC GTCGTCGCCG CGGATGAGCG CGAAAACGGG CTGCGTGCGC TGCTCAATCT CGGCCACACC TTCGGTCATG CCCTGGAAGC CGCCACCGGA TACGACAGCA AACGGCTGGT GCACGGGGAG GGCGTTGCGA TCGGTATGGT TCTGGCGCAC GAGTTCTCGG CCCGGATGAA CATTGCGAGT CCCGACGATG CGCGGCGCGT GGAAATGCAT CTGAAGACGG TCGGCCTCCC GACACGCATG GCCGACATTC CCGGTATGTT GCCGCCCGCC GATCGACTGC TGGAGGCGAT CGCCCAGGAC AAGAAGGTAA AGGGAGGCAA GTTCACCTTC ATTCTCACCA GGGGCATCGG ACAGTCCTTC ATCGCGGATG ACGTGCCCTC CTCGGAGGTC CTAAGCTTTC TTGAGGAAAG GATCCCGCGA TGA
|
Protein sequence | MKPMTSDVMP AAERKVRVDL AERSYDILIG PGLIAAAGGE IASRLKGRKM AVITDENVAP RYLEPLMASL AGSGMDPVSL ILPAGEKTKS FEHLIPVCEA VLGARIERND AVIALGGGVI GDLTGFAAGI VRRGSRFIQI PTSLLAQVDS SVGGKTGINS PHGKNLIGVF HQPDLVLADT AALDTLSPRE FRAGYAEVVK YGLIDKPDFF EWLEQNWQAV FAGGPARIEA IAVSCQAKAD VVAADERENG LRALLNLGHT FGHALEAATG YDSKRLVHGE GVAIGMVLAH EFSARMNIAS PDDARRVEMH LKTVGLPTRM ADIPGMLPPA DRLLEAIAQD KKVKGGKFTF ILTRGIGQSF IADDVPSSEV LSFLEERIPR
|
| |