Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0703 |
Symbol | aroB |
ID | 3786165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 808956 |
End bp | 810056 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810785 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_411402 |
Protein GI | 82701836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACAG TCACTGTTAA TCTTGTCTCC ACGCCCGCTC AACGCGGTTA TCCGATCCAT ATCGGTGCAA ACATCCTGAC TCAGCCCGAG CTCATCCTGG ATTACCTGGA CCAGAAGCGA GTGGCCATTG TCACAAATAC TACAGTCGGG CCTTTATATC TCGAAAAGTT TCGTGCGGAT CTTTCAGTTC ATGGGATGGT TTCGGTTCCA ATCGTTCTTC CGGACGGTGA GGAATACAAG AACTGGGAAA CCCTCAATCT CATTTTCGAT GCTCTCCTGA CCCATCGCTG CGAACGCAAT ACCCCGCTCA TCGCCCTTGG CGGAGGAGTT GTGGGAGACC TGACCGGTTT CGCCGCCGCC ACTTATTTAC GGGGTGTGCC GTTCATTCAG GTACCCACCA CACTTCTGGC GCAAGTCGAT TCTTCAGTTG GCGGTAAGAC CGGAATCAAC CATCCGCTGG GTAAAAACAT GATTGGAGCG TTTTATCAAC CCCAGGCGGT CGTGGCGGAT ACGTCCACAC TCGATACGTT GCCCGATAGA GAACTGCGTG CCGGTATAGC GGAGGTCATC AAATATGGCC TTATTCGCGA TCCGGCATTC TTCGACTGGA TTGAATCTCA TATTGAACTC CTGTTGCGGC GTGATAATTC GATTCTGACC GACGCAATAA AAAGGAGTTG CCAGCATAAG GCAGAAGTTG TGGAAGAAGA CGAGCGCGAG AGCGGCATGC GTGCCTTGCT GAACCTTGGA CACACTTTTG GCCATGCAAT CGAGAATGCA ATGGGGTACG GCAACTGGCT TCACGGGGAA GCAGTGGCTG CAGGAACGAT GCTGGCAGCC GAGGTGTCAC GACGTATGGG CATGATAGGC GAGGAGGATG TGGATCGGGT CCGAAATCTC TATGTGAAGA CCGGGCTGCC GGTGATTGCC CCGAATCTCG GTCCGGAAAA ATATTTGCAT CTGATGGGAC TGGACAAGAA AGTACAAGGC GGGAAAATGC GTTTCATACT GCTCGAGAAT ATCGGTCGGG CGACGGTGCA CGCGGACGTG CCGGCTGCGA TACTGACCGA AGTTCTGACG GAATGCACGG CTGATGCATG A
|
Protein sequence | METVTVNLVS TPAQRGYPIH IGANILTQPE LILDYLDQKR VAIVTNTTVG PLYLEKFRAD LSVHGMVSVP IVLPDGEEYK NWETLNLIFD ALLTHRCERN TPLIALGGGV VGDLTGFAAA TYLRGVPFIQ VPTTLLAQVD SSVGGKTGIN HPLGKNMIGA FYQPQAVVAD TSTLDTLPDR ELRAGIAEVI KYGLIRDPAF FDWIESHIEL LLRRDNSILT DAIKRSCQHK AEVVEEDERE SGMRALLNLG HTFGHAIENA MGYGNWLHGE AVAAGTMLAA EVSRRMGMIG EEDVDRVRNL YVKTGLPVIA PNLGPEKYLH LMGLDKKVQG GKMRFILLEN IGRATVHADV PAAILTEVLT ECTADA
|
| |