Gene Nmul_A2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2192 
Symbol 
ID3786217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2490323 
End bp2491390 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID637812279 
Productchorismate mutase 
Protein accessionYP_412876 
Protein GI82703310 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCTC AACTCAAACA GCTGCGCGAC AAGATAGATG CAATCGACAG CGAACTCCTG 
AAGCTGGTGA GCGCGCGGGC AGATCTCGCC CGCGAAATCG GAGAAATCAA GAATGGCACT
GCGTATCGGC CCGAACGTGA AGCTCAGGTG CTGGCGCGGA TGCGCGAACT GAACCCGGGG
CCCTTGGAGA ATGAACAGGT TGCGCGGTTA TTTACGGAGA TCATGTCTCT CTGTCGATCC
ATGGAAGAAC CTCTGACAGT CGCATACCTG GGACCGAGGG GCACGTTTTC CGAAGAAGCT
GCGCTCAAGC GCTTCGGTAG CGTTGTGACT TCGCTGCCGT GCAATTCGAT AGATGACGTA
TTTAGCAAGG TGGAAGCCGG CAAGGCAAAT TATGGGGTTG TACCGGTAGA GAATTCGACC
GAGGGCGCAG TGGGTAGGTC GCTCGATCTG TTGCTGCAAA CCCGCTTGAA GGTGTGCGGC
GAAGTGGCGC TGGCCATACA TCAGCTTCTG CTGGCGCATC ATACCGACCT TGCACGCATT
CGCAGGATTT ACTCTCATCC TCAATCGTTT GCTCAATGTC ACGAGTGGCT CAATGTCCAT
TTGCCCCATT TACCCGCGTC GGCAAGAATC AACGCCGCGA GCAATGCGGA TGCTGCCAGA
CTGGCGGCGG AAGATGAAAG CGCCGCGGCA GTGGCGGGAA AAAAGGCGGG AGAAGTATAT
GGTCTCACTG TCTGTGCCGA GAACATCGAG GATGATCCCA GCAATACGAC CCGCTTCCTG
GTAATAGGTG AGCAGGAAGT CGCCCCTTCC GGCAGGGATA AAACGTCGCT GGTAACGTCG
GTCAGGAATC GGCCGGGCGC CATACATGAG TTACTGGCCC CGTTCGCCCA TCATGGAGTC
AGCATGACCC GGCTTGAATC GCGCCCGTCC CGTGCGGGTT TATGGGAATA CGTGTTTTTT
GTGGATGTCG AAGGCCACCA GCAGGAGCCG AAAGTCTCCC AGGCGCTGCG CGAACTGGTG
GAAAAAGCGG CGTTTCTCAA AGTGCTCGGC TCATATCCCG CAGCCTGA
 
Protein sequence
MTAQLKQLRD KIDAIDSELL KLVSARADLA REIGEIKNGT AYRPEREAQV LARMRELNPG 
PLENEQVARL FTEIMSLCRS MEEPLTVAYL GPRGTFSEEA ALKRFGSVVT SLPCNSIDDV
FSKVEAGKAN YGVVPVENST EGAVGRSLDL LLQTRLKVCG EVALAIHQLL LAHHTDLARI
RRIYSHPQSF AQCHEWLNVH LPHLPASARI NAASNADAAR LAAEDESAAA VAGKKAGEVY
GLTVCAENIE DDPSNTTRFL VIGEQEVAPS GRDKTSLVTS VRNRPGAIHE LLAPFAHHGV
SMTRLESRPS RAGLWEYVFF VDVEGHQQEP KVSQALRELV EKAAFLKVLG SYPAA