Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1307 |
Symbol | aroB |
ID | 4485462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1456217 |
End bp | 1457302 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639730087 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_873065 |
Protein GI | 117928514 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0261081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG TGGTCCGGGT CGGTGGCGAA GCGCCGTACG ACGTGCGCAT CGGCCGGGGC GTGCTGGCCG AGCTCGACAC GCTGGTGCCA GCCGTCGTCC GCCGGGTCGC CGTCCTCCAT CAGCCGACCG TCGCGTCCGT GGCGGACCGG ATCGCCGCCG GTCTTGCCGC CCCGAACCGT GAGGTCTCGC TCTTCCCCTT GCCGGACGGC GAGGCGGCCA AGACCGTGGC CACGGCCGCG AACCTCTGGG ACGGCCTGGC GTCCGCCGGC TTCACCCGCA CGGATCTGAT CATCGGAGTC GGCGGGGGGG CGGCGACCGA CCTCGCCGGT TTCGTTGCGG CGACCTGGCT GCGCGGGGTG GACGTCGTCC AGGTGCCCAC CACTCTCGCC GGGATGGTCG ATGCTGCGAT CGGCGGAAAG ACCGGAATCA ACCTGGACGC CGGCAAGAAC CTCGTCGGCG CCTTTCACCC GCCCCGCGCG GTGCTCTGCG ACGTCGGGCT CCTGGCCACC GTGCCGCCGG CGGACTACGC CGCCGGTCTG GCTGAAGTGA TTAAAACCGG GTTCATCGCC GATCCGGTGA TCCTCGAACT CGTCGAGGCG GACCCGGCAG CCGCCCGGAC GCCGGCCGGC CCGCACACCG AAGAGCTCAT CGTCCGGTCG GTGGCGGTGA AGGCGGCCGT GGTCGCCGCC GACCTTCGCG AGCGGATCGG CACCCAACTC GGCAGGGAAG TGCTGAACTA CGGGCACACC CTTGGCCATG CGATCGAGCG CCGGGAGCAG TACCGGTGGC GGCACGGCGA CGCGGTAGCG GTCGGCCTGG TGTTCGCCGC TGCGCTGGCC CGTCATGCTG GGCTGCTGGA CGACGCCACC GCCGAGCGGC ACCGGCGCAT CCTGTCCGCC GTCGGTCTGC CGACCCGGTA TCCGAAGGAA GCCTGGCCGG AACTCCGCGC TGCGATGGCG ATCGACAAGA AGGCCCGGGG GAGCACGCTG CGGTTCGTCG TCCTGGAAGC GATCGGCCGG CCGCGCATCC TTGCCGATCC CGATCCGGCC GTCCTGGAGG CCGCCTACGC GGAGGTAGCC GAATGA
|
Protein sequence | MTQVVRVGGE APYDVRIGRG VLAELDTLVP AVVRRVAVLH QPTVASVADR IAAGLAAPNR EVSLFPLPDG EAAKTVATAA NLWDGLASAG FTRTDLIIGV GGGAATDLAG FVAATWLRGV DVVQVPTTLA GMVDAAIGGK TGINLDAGKN LVGAFHPPRA VLCDVGLLAT VPPADYAAGL AEVIKTGFIA DPVILELVEA DPAAARTPAG PHTEELIVRS VAVKAAVVAA DLRERIGTQL GREVLNYGHT LGHAIERREQ YRWRHGDAVA VGLVFAAALA RHAGLLDDAT AERHRRILSA VGLPTRYPKE AWPELRAAMA IDKKARGSTL RFVVLEAIGR PRILADPDPA VLEAAYAEVA E
|
| |