Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2185 |
Symbol | |
ID | 3785996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2481462 |
End bp | 2482109 |
Gene Length | 648 bp |
Protein Length | 215 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637812272 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_412869 |
Protein GI | 82703303 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.279465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAACG CCGTTCTCTT CGATCTCGAC GGAACGCTTG CCGATACCGC CCCCGATCTC GGGTATGCCC TGAATCGGCA GCGCATCGCC CGCGGGCTGC TGCCCCTGCC GTTGGAGCTC ATCCGTACCG AGGCTTCCGC GGGGGCGCGC GGCCTGTTGG GCCTGGGATT CAACATCAAG CCCGGGGACG CCGGATATGA CGCCATGCGC ACCGAATTCC TCGACTTCTA CGCGGAACAC CTCTGCCGCG AGACATTCTT GTTCGCAGGG GTCGCGGATC TCCTCGATCA GCTCGATGAC CGGGGCCTGA TCTGGGGCAT CGTCACCAAC AAACCCGCTC GTTTTTCCGT ACCGCTCCTC GAGGCGCTGG GTTTGGGCAA TCGTGCTTCC TGCCTTATCA GCGGGGGCGA TACCACGCAC TCCAAACCCC ATCCCGAGCC TCTGCTTACA GCGAGCGGAG CAATAGCCGT CCCGCCGGAA GAATGCATCT ATCTGGGCGA CGACCTGCGC GACGTCCAGG CCAGCCTCGC CGCCGGCATG GAACCCATCA TCGCCAGATA CGGCTATCTC GGCAATGTAG GTGCCCCCGA AACCTGGGGT GCAAGATACC TCATAGACCG GCCCGAAGAA CTGCTCGGCT ATTTGTAA
|
Protein sequence | MINAVLFDLD GTLADTAPDL GYALNRQRIA RGLLPLPLEL IRTEASAGAR GLLGLGFNIK PGDAGYDAMR TEFLDFYAEH LCRETFLFAG VADLLDQLDD RGLIWGIVTN KPARFSVPLL EALGLGNRAS CLISGGDTTH SKPHPEPLLT ASGAIAVPPE ECIYLGDDLR DVQASLAAGM EPIIARYGYL GNVGAPETWG ARYLIDRPEE LLGYL
|
| |