Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2009 |
Symbol | aroB |
ID | 7293470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2265615 |
End bp | 2266706 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643590413 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002488072 |
Protein GI | 220912763 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000160945 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACACCT CATCAACAGT CATCAAGGTC ACCGGTGAAT CCGCCGCCAA CAACTACGAC GTTGTGGTGG GGCGGGGCCT GCTGGGAACC CTTCCGGAAA TCCTGGGGGA GCGGGTGCGG CGCGTGCTGG TCATCCACCC CCGGGCCCTG CGCCTCACCG GTGACACCGT CCGCGATGAC CTTGAGTCCG CGGGCTTCAC TGCGCTGACC GCGGAAATCC CGGACGCCGA AGAAGGCAAG CACATCCAGG TCGCTGCCTT CTGCTGGCAG GTCCTGGGCC AGAACGACTT CACCAGGTCT GACGCCATCG TGGCTGTCGG CGGGGGAGCG GTCACCGACC TGGCGGGCTT CGTGGCCGCC ACCTGGCTCC GCGGCGTCAA GGTCATCCAC ATGCCCACCA GCCTGCTGGG GATGGTGGAT GCGTCCGTGG GCGGCAAGAC CGGCATCAAC ACCGCCGAGG GCAAGAACCT GGTGGGCGCC TTCCACCCTC CGGCGGCGGT CCTGGCGGAC CTGGACACCC TGGACACGCT GCCCCGGAAC GAACTCATTT CCGGTATGGC CGAAGTGGTC AAGTGCGGCT TCATCGCGGA CCCTGCCATC CTCGAACTGG TGGAGAAGGA CTTTGCTGCG GTCACCGATC CGCGGTCCGA GACCCTCCGC GAGCTCATTG AACGTGCCAT CGCCGTCAAA GCCAAAGTGG TTTCGGAAGA CCTCAAGGAA TCCGGGCTGC GCGAAATCCT CAACTACGGC CACACCCTGG GCCACGCCAT CGAACTGGTG GAACGCTACT CGTGGCGGCA CGGCGCCGCA GTCTCCGTCG GCATGATGTT CGCCGCGGAA CTCGCCCGCA GCGTGGGCCG GCTGAGCGAT GCCGACGCCG ACAGGCACCG AAGCATCCTC GAAGGACTTG GGCTCCCGGT CACCTACCGG CGGGACCGAT GGCAGGGCCT GCTGGACGGC ATGCGGCGGG ACAAGAAGTC CCGCGGCGAC CTGCTGCGGT TCGTGGTGCT GGACGGTGTG GCCAAACCGG GCATCCTGGA TGTCCCCGAC ACGTCCCTCC TGTTCGCCGC CTACCAGGAA GTCGCTTCCT GA
|
Protein sequence | MNTSSTVIKV TGESAANNYD VVVGRGLLGT LPEILGERVR RVLVIHPRAL RLTGDTVRDD LESAGFTALT AEIPDAEEGK HIQVAAFCWQ VLGQNDFTRS DAIVAVGGGA VTDLAGFVAA TWLRGVKVIH MPTSLLGMVD ASVGGKTGIN TAEGKNLVGA FHPPAAVLAD LDTLDTLPRN ELISGMAEVV KCGFIADPAI LELVEKDFAA VTDPRSETLR ELIERAIAVK AKVVSEDLKE SGLREILNYG HTLGHAIELV ERYSWRHGAA VSVGMMFAAE LARSVGRLSD ADADRHRSIL EGLGLPVTYR RDRWQGLLDG MRRDKKSRGD LLRFVVLDGV AKPGILDVPD TSLLFAAYQE VAS
|
| |