Gene Nmul_A1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1958 
Symbol 
ID3785136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2251312 
End bp2252493 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content56% 
IMG OID637812046 
Productchorismate synthase 
Protein accessionYP_412645 
Protein GI82703079 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCA ATACCTTTGG CAAACTGTTT TGCGTGACGT CTTTCGGGGA ATCCCATGGC 
CCTGCCTTAG GATGCGTAGT GGATGGTTGT CCCCCGGGAA TGGAACTGTC CGCTGAGGAT
ATCCAGCAGG ATCTTGATCG GCGCAAGCCC GGCACCTCCC GGCACGTAAC CCAGCGGCGC
GAGCCCGATA CGGTGGAAAT TCTTTCGGGC GTGTTCGAGG GCAAGACTAC GGGAACACCG
ATCGCGTTGT TGATTCGCAA CGAAGATCAA CGCAGCAAGG ACTACAGCAA AATCATGGAT
ACTTTCCGCC CCGGGCACGC GGACTATGTT TATTGGCAAA AATACGGCAT ACGCGATTAC
CGGGGAGGAG GACGTTCATC CGCGCGCGAA ACCGCCGTGC GGGTGGCGGC AGCAGCCATA
GCAAAAAAAT GGCTGCGTGC AAGATATGGA GTGGTAATAA GAGGTCATAT GGCGCAACTG
GGACCTGTCG AGATTCCATT CAAACAATGG GAAGCAGTGG GTGAGAATCC CTTTTTTTCG
GCTGACCCGG ACATCGTGCC CAGCCTGGAA GAATTCATGG ACAAACTGCG GAAATCGGGC
GACTCGGTGG GCGCCAGGAT TCGTGTGGTT GCCGAAGGTG TGCCCGTGGG GTGGGGCGAA
CCTGTATATG ATCGCCTCGA TGCCGAAATC GCCTATGGCA TGATGAGCAT CAACGCCGTT
AAAGGCGTCG AAATCGGAGC TGGATTTGCT TCTGTAAGTC AGAAGGGAAC GGAGCACTCC
GATGAAATCA GTCCCGGGGG TTTTCTCAGC AATAATGCGG GAGGCATATT GGGCGGGATT
TCCACCGGTC AGGATATCGT GGTGAATATC GCCGTCAAGC CCACTTCAAG CATACGTCTG
CCGCGCCGCT CAGTAGACAA GATGGGCAAT CCGGCAATAG TGGAAACTCA TGGAAGACAT
GACCCCTGTG TCGGTATCCG GGCTACGCCC ATTGCGGAAG CAATGCTGGC GCTGGTATTG
ATGGACCATG CGCTGCGCAA TAGGGCACAA AACGCGGATG TTGCGTGCAC AACTCCAAAA
ATTCCCGGCC ACACTGGCCC TAGGGAAGGT CAGGAAGAGG GCCCGTCAGA TAGCGAGCCA
AAAGTGGAGT TTGCGGATGA TCCCGAGCCG GATGAAGCGT GA
 
Protein sequence
MSGNTFGKLF CVTSFGESHG PALGCVVDGC PPGMELSAED IQQDLDRRKP GTSRHVTQRR 
EPDTVEILSG VFEGKTTGTP IALLIRNEDQ RSKDYSKIMD TFRPGHADYV YWQKYGIRDY
RGGGRSSARE TAVRVAAAAI AKKWLRARYG VVIRGHMAQL GPVEIPFKQW EAVGENPFFS
ADPDIVPSLE EFMDKLRKSG DSVGARIRVV AEGVPVGWGE PVYDRLDAEI AYGMMSINAV
KGVEIGAGFA SVSQKGTEHS DEISPGGFLS NNAGGILGGI STGQDIVVNI AVKPTSSIRL
PRRSVDKMGN PAIVETHGRH DPCVGIRATP IAEAMLALVL MDHALRNRAQ NADVACTTPK
IPGHTGPREG QEEGPSDSEP KVEFADDPEP DEA