Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3595 |
Symbol | aroB |
ID | 6982356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3720779 |
End bp | 3721909 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398320 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002283088 |
Protein GI | 209551171 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCGA TACCCTCCGC CTCATCAGTC CAGACGGTGC ACGTGCCGCT CGGCGAGCGC GCCTACGATA TCCTGATCGG GCCGGGGCTG ATCGCGCGGG CCGGCGCCGA AATCGCCTCC CGCCTCAAGG GCCGCAAGGC GGCTGTCGTC ACCGATGAAA ATGTCGCGCC GCTCTATCTC CAGGCGCTCG TCGCAAGTCT CGATGAAGCG GGCATCGCCT CGGCCGCGGT CGTCCTGCCG GCCGGTGAGA AGACCAAGAG CTTCGAGCAT CTGATGACCG CCTGCGACAA GGTGCTCGAA GCCCGCGTCG AGCGTAACGA TTGCGTCATC GCGCTCGGCG GCGGCGTTAT CGGCGACCTC TCGGGATTTG CGGCCGGCAT CGTGCGGCGC GGCGTGCGCT TCGTGCAGGT GCCGACCTCG CTGCTGGCGC AGGTCGATTC CTCCGTCGGC GGCAAGACCG GCATCAATTC CCGCCACGGC AAGAACCTGA TCGGCGTCTT CCATCAGCCG GACCTGGTCC TGGCCGATAC CGATGTGCTG AATACGCTAA GCGAGCGCGA ATTCCGCGCC GGTTACGCCG AGGTCGCCAA ATACGGGCTG ATCGACAAGC CGGATTTTTT CGCTTGGCTG GAAGCCAACT GGAAGGCGGT TTTCACAGGC GGCGCCGCCC GCATCGAGGC GATTGCCGCC AGCTGCCAGG CGAAGGCCGA TGTCGTCGTT GCCGACGAGC GCGAGAACGG TCCGCGGGCG CTGCTCAACC TCGGCCATAC GTTCGGCCAT GCGCTTGAAA CGGCGACAGC CTATGACAGC TCCCGTCTCG TGCATGGCGA GGGCGTTTCG ATCGGCATGG TGCTGGCGCA CGAATTCTCT GCGCGGATGA ACCTTGCAAG CCCCGATGAT GCGCGGCGCG TCGAGCGGCA TCTGCAGGAG GTCGGCCTTC CGACCCGGAT GTCCGACATT CCGGGCGCGC TGCCGCCGGC CGAAACGCTG ATGGATGCGA TCGCCCAGGA CAAGAAGGTC AAGAGCGGCA AGCTCACCTT CATCCTGACG CGCGGCATCG GTCAGTCCTT CGTCGCCGAC GACGTTCCTG CCTCCGAGGT GATCAGCTTT CTCAGGGAAA AACACCCCTA A
|
Protein sequence | MNAIPSASSV QTVHVPLGER AYDILIGPGL IARAGAEIAS RLKGRKAAVV TDENVAPLYL QALVASLDEA GIASAAVVLP AGEKTKSFEH LMTACDKVLE ARVERNDCVI ALGGGVIGDL SGFAAGIVRR GVRFVQVPTS LLAQVDSSVG GKTGINSRHG KNLIGVFHQP DLVLADTDVL NTLSEREFRA GYAEVAKYGL IDKPDFFAWL EANWKAVFTG GAARIEAIAA SCQAKADVVV ADERENGPRA LLNLGHTFGH ALETATAYDS SRLVHGEGVS IGMVLAHEFS ARMNLASPDD ARRVERHLQE VGLPTRMSDI PGALPPAETL MDAIAQDKKV KSGKLTFILT RGIGQSFVAD DVPASEVISF LREKHP
|
| |