Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1958 |
Symbol | |
ID | 3785136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2251312 |
End bp | 2252493 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812046 |
Product | chorismate synthase |
Protein accession | YP_412645 |
Protein GI | 82703079 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCA ATACCTTTGG CAAACTGTTT TGCGTGACGT CTTTCGGGGA ATCCCATGGC CCTGCCTTAG GATGCGTAGT GGATGGTTGT CCCCCGGGAA TGGAACTGTC CGCTGAGGAT ATCCAGCAGG ATCTTGATCG GCGCAAGCCC GGCACCTCCC GGCACGTAAC CCAGCGGCGC GAGCCCGATA CGGTGGAAAT TCTTTCGGGC GTGTTCGAGG GCAAGACTAC GGGAACACCG ATCGCGTTGT TGATTCGCAA CGAAGATCAA CGCAGCAAGG ACTACAGCAA AATCATGGAT ACTTTCCGCC CCGGGCACGC GGACTATGTT TATTGGCAAA AATACGGCAT ACGCGATTAC CGGGGAGGAG GACGTTCATC CGCGCGCGAA ACCGCCGTGC GGGTGGCGGC AGCAGCCATA GCAAAAAAAT GGCTGCGTGC AAGATATGGA GTGGTAATAA GAGGTCATAT GGCGCAACTG GGACCTGTCG AGATTCCATT CAAACAATGG GAAGCAGTGG GTGAGAATCC CTTTTTTTCG GCTGACCCGG ACATCGTGCC CAGCCTGGAA GAATTCATGG ACAAACTGCG GAAATCGGGC GACTCGGTGG GCGCCAGGAT TCGTGTGGTT GCCGAAGGTG TGCCCGTGGG GTGGGGCGAA CCTGTATATG ATCGCCTCGA TGCCGAAATC GCCTATGGCA TGATGAGCAT CAACGCCGTT AAAGGCGTCG AAATCGGAGC TGGATTTGCT TCTGTAAGTC AGAAGGGAAC GGAGCACTCC GATGAAATCA GTCCCGGGGG TTTTCTCAGC AATAATGCGG GAGGCATATT GGGCGGGATT TCCACCGGTC AGGATATCGT GGTGAATATC GCCGTCAAGC CCACTTCAAG CATACGTCTG CCGCGCCGCT CAGTAGACAA GATGGGCAAT CCGGCAATAG TGGAAACTCA TGGAAGACAT GACCCCTGTG TCGGTATCCG GGCTACGCCC ATTGCGGAAG CAATGCTGGC GCTGGTATTG ATGGACCATG CGCTGCGCAA TAGGGCACAA AACGCGGATG TTGCGTGCAC AACTCCAAAA ATTCCCGGCC ACACTGGCCC TAGGGAAGGT CAGGAAGAGG GCCCGTCAGA TAGCGAGCCA AAAGTGGAGT TTGCGGATGA TCCCGAGCCG GATGAAGCGT GA
|
Protein sequence | MSGNTFGKLF CVTSFGESHG PALGCVVDGC PPGMELSAED IQQDLDRRKP GTSRHVTQRR EPDTVEILSG VFEGKTTGTP IALLIRNEDQ RSKDYSKIMD TFRPGHADYV YWQKYGIRDY RGGGRSSARE TAVRVAAAAI AKKWLRARYG VVIRGHMAQL GPVEIPFKQW EAVGENPFFS ADPDIVPSLE EFMDKLRKSG DSVGARIRVV AEGVPVGWGE PVYDRLDAEI AYGMMSINAV KGVEIGAGFA SVSQKGTEHS DEISPGGFLS NNAGGILGGI STGQDIVVNI AVKPTSSIRL PRRSVDKMGN PAIVETHGRH DPCVGIRATP IAEAMLALVL MDHALRNRAQ NADVACTTPK IPGHTGPREG QEEGPSDSEP KVEFADDPEP DEA
|
| |