Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2192 |
Symbol | |
ID | 3786217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2490323 |
End bp | 2491390 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812279 |
Product | chorismate mutase |
Protein accession | YP_412876 |
Protein GI | 82703310 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCTC AACTCAAACA GCTGCGCGAC AAGATAGATG CAATCGACAG CGAACTCCTG AAGCTGGTGA GCGCGCGGGC AGATCTCGCC CGCGAAATCG GAGAAATCAA GAATGGCACT GCGTATCGGC CCGAACGTGA AGCTCAGGTG CTGGCGCGGA TGCGCGAACT GAACCCGGGG CCCTTGGAGA ATGAACAGGT TGCGCGGTTA TTTACGGAGA TCATGTCTCT CTGTCGATCC ATGGAAGAAC CTCTGACAGT CGCATACCTG GGACCGAGGG GCACGTTTTC CGAAGAAGCT GCGCTCAAGC GCTTCGGTAG CGTTGTGACT TCGCTGCCGT GCAATTCGAT AGATGACGTA TTTAGCAAGG TGGAAGCCGG CAAGGCAAAT TATGGGGTTG TACCGGTAGA GAATTCGACC GAGGGCGCAG TGGGTAGGTC GCTCGATCTG TTGCTGCAAA CCCGCTTGAA GGTGTGCGGC GAAGTGGCGC TGGCCATACA TCAGCTTCTG CTGGCGCATC ATACCGACCT TGCACGCATT CGCAGGATTT ACTCTCATCC TCAATCGTTT GCTCAATGTC ACGAGTGGCT CAATGTCCAT TTGCCCCATT TACCCGCGTC GGCAAGAATC AACGCCGCGA GCAATGCGGA TGCTGCCAGA CTGGCGGCGG AAGATGAAAG CGCCGCGGCA GTGGCGGGAA AAAAGGCGGG AGAAGTATAT GGTCTCACTG TCTGTGCCGA GAACATCGAG GATGATCCCA GCAATACGAC CCGCTTCCTG GTAATAGGTG AGCAGGAAGT CGCCCCTTCC GGCAGGGATA AAACGTCGCT GGTAACGTCG GTCAGGAATC GGCCGGGCGC CATACATGAG TTACTGGCCC CGTTCGCCCA TCATGGAGTC AGCATGACCC GGCTTGAATC GCGCCCGTCC CGTGCGGGTT TATGGGAATA CGTGTTTTTT GTGGATGTCG AAGGCCACCA GCAGGAGCCG AAAGTCTCCC AGGCGCTGCG CGAACTGGTG GAAAAAGCGG CGTTTCTCAA AGTGCTCGGC TCATATCCCG CAGCCTGA
|
Protein sequence | MTAQLKQLRD KIDAIDSELL KLVSARADLA REIGEIKNGT AYRPEREAQV LARMRELNPG PLENEQVARL FTEIMSLCRS MEEPLTVAYL GPRGTFSEEA ALKRFGSVVT SLPCNSIDDV FSKVEAGKAN YGVVPVENST EGAVGRSLDL LLQTRLKVCG EVALAIHQLL LAHHTDLARI RRIYSHPQSF AQCHEWLNVH LPHLPASARI NAASNADAAR LAAEDESAAA VAGKKAGEVY GLTVCAENIE DDPSNTTRFL VIGEQEVAPS GRDKTSLVTS VRNRPGAIHE LLAPFAHHGV SMTRLESRPS RAGLWEYVFF VDVEGHQQEP KVSQALRELV EKAAFLKVLG SYPAA
|
| |