Gene Nmul_A0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0407 
Symbol 
ID3785400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp450563 
End bp451768 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID637810483 
Productphosphatase kdsC 
Protein accessionYP_411107 
Protein GI82701541 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG1083] CMP-N-acetylneuraminic acid synthetase
[COG1778] Low specificity phosphatase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA
[TIGR01670] 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase, YrbI family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.784801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTGGG TAGCTTTCGC GCCTTTGCAT GGAAATTTCG GGCCGGTTTC CCGAAGCAGC 
GCACGCAGCA TGGCGGGAAG GCCATTATTT TCCTGGAGCT TGGAGCAGGC CGTCATATCG
GGATGTTTCG ACACCATTTA CGTAACTGCC GACCCGCCTG TCATTCGAAA ACGGATAGTG
GAAGAATTTT CGCGAGCCGA CACGATAATC GAGATTCTCG ATTGCAGTGG CGCAACCCGT
ACGGGCGTAG AGAATTGGAC CAGTCTCCTA CACACGTTTC AGCAGAAGAT CCCCTTCGAC
GTCGTCTGTT CAATACAGGC AGCCTCGCCC CTCACGCGTG CTGAAGATTT TCGTGCCGCT
AAGCGCAAAT TTCTTTCGGA AAATCTTGAC TCGCTTCTGA CAGCTGCGCC GTCCAGACGG
TTTCTGTGGA CAAGGATGGG AGAACCCGTC GGTCATGACC CGCTAAAATC CCGTGCGTCA
TGCGATGCAT CGAATCCGGA GGGATACCTG CTGGAAAATG GCGCCTTCTA TCTGACACAT
GAAAAATTAC TTCGAGACAA CGATCATTAT CTGGGTGGAC GCATGGGTAT TCACGAGATG
GCGCCCGAAA CTGCGATCGA GATCACCGGA GAGGCTGGCT GGAACATCGT GGAGCGTCTT
TTACGGGAGC AGGAACGGGG GTCGGTCCAA GCCCGCGCGT CACGAATCAA GTTTCTGGTA
CTCGATGTGG ACGGAACGTT GACGGATGCG GGAATGTATT ACGGCCCCGC CGGCGAAGCC
TTGAAAAAAT TCAATACTCG CGACGCCCAT GGTTTGCAAC GGTTGCGTGA ACATGGCCTC
GGGGTTTGCG TAATCACCAC CGAGACTAGT CCTTCCGTTG AAGCAAGGAT GAGAAAATTG
CGCATCGACG AATACTACCC GGGCATAAGC GATAAATTTC CTCTCCTCCT AAAGCTTTCC
AAACGCTGGG GGGTTCCTCT GGAAAATATC GGGTATGTGG GTGATGACCT CAGCGATCTG
GAATGCCTGA GCCGCGTAGG CGTTGCCTTC TGCCCGGCGG ATGCTGTCCC CCTGGTCGTG
CGGCAGGCCC ATTATATGTG TGAATATTCG GGTGGCCACG GCGCGGTTCG CGAGGTATGC
GACTTGATTC TCCGATCAAG AGAAACCATA CGACATGACT CCGCTGAAGC GGAAGCGTAC
TCATGA
 
Protein sequence
MRWVAFAPLH GNFGPVSRSS ARSMAGRPLF SWSLEQAVIS GCFDTIYVTA DPPVIRKRIV 
EEFSRADTII EILDCSGATR TGVENWTSLL HTFQQKIPFD VVCSIQAASP LTRAEDFRAA
KRKFLSENLD SLLTAAPSRR FLWTRMGEPV GHDPLKSRAS CDASNPEGYL LENGAFYLTH
EKLLRDNDHY LGGRMGIHEM APETAIEITG EAGWNIVERL LREQERGSVQ ARASRIKFLV
LDVDGTLTDA GMYYGPAGEA LKKFNTRDAH GLQRLREHGL GVCVITTETS PSVEARMRKL
RIDEYYPGIS DKFPLLLKLS KRWGVPLENI GYVGDDLSDL ECLSRVGVAF CPADAVPLVV
RQAHYMCEYS GGHGAVREVC DLILRSRETI RHDSAEAEAY S