Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0331 |
Symbol | aroB |
ID | 3102062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 326282 |
End bp | 327361 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637169551 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_112863 |
Protein GI | 53802348 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.274632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCT TACACGTCGA GCTGGGGGAG CGCGGCTACC CCATTTATAT AGGACGGGGC CTGCTGGGCC ATCCCGACCT GATACAGGCC CATCTGCCGG GCGGGCAGGT CCTGGTGGTG ACCAACGAAG TGGTGGCGCC GCTGTACCTC GACCGCATGC TTGCATCCCT GGCCGGCAAG GACACGGGCA GTGTCGTGCT TCCCGACGGC GAGGCCCACA AGACCCTGGA CTCGGCGATG GCCGTGTTCG ATGCCTTGCT GGCCCGGCGT TTCGGCCGCA ACGCCGCCAT CGTGGCGCTC GGCGGCGGGG TGATCGGCGA TCTGGCCGGT TTCGCGGCAG CCTGCTATCA GCGCGGCGTG CCTTTCATCC AGGTGCCCAC CACCCTGTTG TCTCAGGTCG ACTCCTCGGT GGGAGGCAAG ACCGCGGTCA ACCATCCGCG CGGCAAGAAC ATGATCGGCG CCTTCTACCA GCCGCGCTGC GTTCTGGCCG ACACCGACAC TCTGGATACG TTGCCCGACC GCGAACTGAG CGCGGGTCTG GCCGAGGTCA TCAAGTACGG CTTCATCCGT GACCCGGAAT TCCTGGCCTG GCTCGAAGCG AACGTCGAGC GCTTGCTGCA GCGCGATCCC GAAGCGCTCG CCTATGCCAT CGAGCGGTCC TGCATCAACA AGGCGGAAAT CGTGGCGGAA GACGAGACCG AAACCGGGGT GCGGGCGACG CTGAACCTGG GGCACACTTT CGGCCACGCC ATCGAAACCG GCATGGGCTA TGGTGTATGT CTGCACGGCG AAGCGGTGGC GATCGGTATG TGCCAGGCGG CCGATCTGTC CCGTCGCTTG GGCTGGATCG GTGACGACGA GGTGGCGAGG GTGATCCGCC TGCTGGAGCG GGCGCGGCTG CCGGTCGTCC CGCCGCGCGA GTTGGATGCG GACGCCTTTC TCGAACACAT GGCGGTCGAC AAGAAGAACG TCGACGGCGG TCTGCGACTG GTTCTGCTCA AATCCCTGGG TGAGGCGACC CTGCCGGTGG CCGTGGACGC CGGACTGTTA CGGGCCACAT TGGAATGCTA CGGCCGCTGA
|
Protein sequence | MKTLHVELGE RGYPIYIGRG LLGHPDLIQA HLPGGQVLVV TNEVVAPLYL DRMLASLAGK DTGSVVLPDG EAHKTLDSAM AVFDALLARR FGRNAAIVAL GGGVIGDLAG FAAACYQRGV PFIQVPTTLL SQVDSSVGGK TAVNHPRGKN MIGAFYQPRC VLADTDTLDT LPDRELSAGL AEVIKYGFIR DPEFLAWLEA NVERLLQRDP EALAYAIERS CINKAEIVAE DETETGVRAT LNLGHTFGHA IETGMGYGVC LHGEAVAIGM CQAADLSRRL GWIGDDEVAR VIRLLERARL PVVPPRELDA DAFLEHMAVD KKNVDGGLRL VLLKSLGEAT LPVAVDAGLL RATLECYGR
|
| |