Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3888 |
Symbol | aroB |
ID | 8014708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3957198 |
End bp | 3958328 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826458 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002977670 |
Protein GI | 241206574 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0114459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00292677 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATGCGA TAACCTCCGC CTCCGCCATC CGAACGGTGC ATGTGCCGCT CGGCGAGCGC GCCTACGACA TCCTGATCGG GCCGGGGCTG ATTGCGCGGG CCGGCGCCGA GATCGCCTCT CGCCTCAAGG GCCGCAAGGC GGCCGTTATC ACCGACGAAA ATGTCGCGCC GCTCTATCTC AAAGCTCTCG TCGCAAGTCT GGATGAAGCG GGTATCGCCT CGGCTGAGGT CGTCCTGCCG GCCGGCGAGA AGACCAAGAG CTTCGAACAT CTGATCACGG CCTGCGACAA GGTGCTTGAG GCGCGCGTCG AGCGTAACGA TTACGTCATC GCGCTCGGCG GCGGCGTCAT CGGCGATCTT TCGGGATTTG CGGCCGGCAT CGTCCGCCGC GGCGTGCGCT TCGTGCAGGT ACCGACCTCG CTGCTGTCGC AGGTCGATTC TTCCGTCGGC GGCAAGACCG GGATCAATTC CCGCCACGGC AAGAATCTGA TCGGCGTCTT CCACCAGCCG GACCTGGTTC TGGCCGATAC GGATGTGCTG AATTCGCTGA GCGCGCGCGA ATTCCGCGCA GGTTACGCCG AGGTCGCAAA ATACGGGCTG ATCGACAAGC CGGATTTCTT TGCCTGGCTG GAAGCGAACT GGAAGGCAGT TTTTACCGGC GGCTCCGCAC GCATCGAGGC GATTGCCGCC AGCTGCCAGG CGAAGGCCGA TGTCGTCGTT GCCGACGAGC GTGAGAACGG TCAGCGGGCG CTGCTCAATC TCGGCCATAC CTTCGGTCAT GCGCTGGAAG CGGCGACTGC CTATGACAGT TCCCGCCTTG TGCATGGCGA GGGCGTTTCG ATCGGCATGG TGCTGGCGCA TGAATTCTCC GCACGGATGA ACCTTGCAAG CCCCGACGAT GCGCGCCGCG TCGAGCGGCA TCTGAAGGAG GTCGGCCTGC CAACCCGCAT GTCCGACATT CCGGGCGAAC TGCCGCCGGC CGAAACGTTG ATGGACGCGA TCGCCCAGGA CAAGAAGGTC AAGAGCGGCA AGCTCACCTT CATCCTGACG CGCGGGATCG GCCAGTCCTT CGTCGCCGAC GACGTGCCGG CGTCCGAAGT GATCAGCTTT CTTCGGGAAA AACACGCCTG A
|
Protein sequence | MNAITSASAI RTVHVPLGER AYDILIGPGL IARAGAEIAS RLKGRKAAVI TDENVAPLYL KALVASLDEA GIASAEVVLP AGEKTKSFEH LITACDKVLE ARVERNDYVI ALGGGVIGDL SGFAAGIVRR GVRFVQVPTS LLSQVDSSVG GKTGINSRHG KNLIGVFHQP DLVLADTDVL NSLSAREFRA GYAEVAKYGL IDKPDFFAWL EANWKAVFTG GSARIEAIAA SCQAKADVVV ADERENGQRA LLNLGHTFGH ALEAATAYDS SRLVHGEGVS IGMVLAHEFS ARMNLASPDD ARRVERHLKE VGLPTRMSDI PGELPPAETL MDAIAQDKKV KSGKLTFILT RGIGQSFVAD DVPASEVISF LREKHA
|
| |