Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_12558 |
Symbol | aroB |
ID | 5223240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 2874906 |
End bp | 2875994 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640607320 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001288487 |
Protein GI | 148823733 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 229 |
Plasmid unclonability p-value | 0.000000000354876 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 194 |
Fosmid unclonability p-value | 0.184174 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATA TCGGCGCACC CGTGACCGTG CAGGTGGCCG TCGATCCGCC ATACCCGGTG GTCATCGGTA CCGGCCTGCT CGACGAGCTG GAAGACCTGC TGGCCGACCG GCACAAGGTC GCCGTCGTGC ATCAGCCCGG ACTAGCCGAG ACCGCGGAAG AGATCCGAAA GCGCTTGGCC GGCAAGGGCG TCGACGCGCA CCGCATCGAG ATCCCCGACG CCGAGGCCGG CAAGGACCTG CCCGTCGTGG GATTCATCTG GGAGGTGTTG GGCCGCATCG GAATCGGCCG CAAAGACGCC CTGGTCAGCC TCGGCGGCGG GGCCGCCACC GACGTCGCCG GGTTCGCGGC GGCCACCTGG CTGCGCGGCG TCTCGATTGT GCACCTGCCC ACCACACTGC TGGGCATGGT CGATGCGGCC GTCGGCGGCA AGACCGGCAT CAACACCGAC GCCGGCAAGA ACCTGGTCGG GGCGTTTCAT CAGCCGTTGG CGGTCCTGGT GGACCTGGCG ACGCTGCAAA CCTTGCCACG CGACGAAATG ATCTGCGGCA TGGCCGAAGT GGTCAAGGCC GGCTTCATCG CCGACCCGGT GATCCTGGAT CTCATCGAAG CTGACCCGCA GGCCGCACTC GACCCGGCCG GCGACGTGCT GCCCGAGCTG ATCCGGCGCG CGATCACCGT CAAGGCCGAG GTGGTCGCCG CCGACGAAAA GGAATCCGAG CTGCGCGAAA TCCTCAACTA CGGCCACACA TTAGGCCACG CGATCGAGCG CCGGGAACGC TACCGGTGGC GCCACGGCGC CGCCGTGTCG GTGGGGCTGG TGTTCGCGGC CGAGCTGGCC AGGCTTGCCG GGCGGCTCGA CGACGCGACC GCGCAGCGCC ACCGCACCAT CCTGTCCTCG TTGGGATTGC CGGTCAGCTA CGACCCGGAC GCGCTGCCCC AGCTGCTGGA AATCATGGCC GGCGACAAGA AGACTCGGGC GGGTGTGTTG CGGTTCGTGG TGCTCGACGG ATTGGCCAAG CCGGGCCGAA TGGTGGGACC GGACCCCGGT CTGCTGGTAA CCGCCTACGC CGGAGTTTGC GCCCCATGA
|
Protein sequence | MTDIGAPVTV QVAVDPPYPV VIGTGLLDEL EDLLADRHKV AVVHQPGLAE TAEEIRKRLA GKGVDAHRIE IPDAEAGKDL PVVGFIWEVL GRIGIGRKDA LVSLGGGAAT DVAGFAAATW LRGVSIVHLP TTLLGMVDAA VGGKTGINTD AGKNLVGAFH QPLAVLVDLA TLQTLPRDEM ICGMAEVVKA GFIADPVILD LIEADPQAAL DPAGDVLPEL IRRAITVKAE VVAADEKESE LREILNYGHT LGHAIERRER YRWRHGAAVS VGLVFAAELA RLAGRLDDAT AQRHRTILSS LGLPVSYDPD ALPQLLEIMA GDKKTRAGVL RFVVLDGLAK PGRMVGPDPG LLVTAYAGVC AP
|
| |