Gene Nmul_A0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0172 
Symbol 
ID3785080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp179913 
End bp180842 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content59% 
IMG OID637810243 
Product2-hydroxy-3-oxopropionate reductase 
Protein accessionYP_410872 
Protein GI82701306 
COG category[I] Lipid transport and metabolism 
COG ID[COG2084] 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases 
TIGRFAM ID[TIGR01505] 2-hydroxy-3-oxopropionate reductase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.798657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGCA GAACGAAAAT TGGTTTCATC GGTCTTGGCA TCATGGGTAA GCCAATGGCG 
AGCCATCTGC TCAGGGGCGG TCATACCTTA TTTCTGCATT CCCGCAGCGG CGTGCCGGGA
GAATTGCTCA CGCAGGGGGG ACAAGCGTGT TCTTCGCCGG CTGAAGTCGC GCAGAACGCT
GACCTTATCA TTACCATGCT TCCGGACACG GGGGATGTGG AGCGGGTGCT GTTCGGTGAC
AAGGGGGTGG CGGAGGGATT GAGCGCGGGG CGAAGTAAAG GCGGGATCGT GATGGACATG
AGTACCATCT CGCCAGTTGA GACGCGGAAA TTCGCCGCTG AAATCAATGA GCTTGGGTGC
GAGTACGTGG ATGCCCCCGT TTCAGGCGGA GATATCGGCG CCCGGAACGG CACTCTCACC
ATCATGGTCG GCGCCACTTC TTCGGCCTTC GTGCAGGTGA AGCCGGTTCT CGAGTTTATG
GGTAAAACGA TAACCCTGAT CGGTCCCACT GGCACGGGCC AGGTCTGCAA GCTTGCCAAT
CAGATCATCG CCGCGCTTAC TATCGAAGCG GTGGGTGAAG GTTTGCTATT CGCCTCGAAA
GCGGGGGCCG ATGTGCGGAA AGTGCGCCAG GCATTAATGG GCGGTTTCGC TTATTCCCGC
GTGCTGGAAG TACACGGCGA GCGCATGATC GAGCGCGCGT TTGAGCCGGG ATTTCGCGTC
GAGCTGCACC GGAAGGATCT GGGCCTTGCT TTGTCCCATG CCCGTACACT GGGCGTGAGC
CTGCCCGGTA CGGCGACTGT TCAGGAATTG CTGAATGCGT GCATCGCGCA TGGCGGGGCC
GGATGGGATA GTTCGGCCCT GGTACGGATG CTGGAAAAAC TGGCGAATCA TGAGATCGAG
ACCATCGAAG GCTCGCATGA TGAATCATGA
 
Protein sequence
MTGRTKIGFI GLGIMGKPMA SHLLRGGHTL FLHSRSGVPG ELLTQGGQAC SSPAEVAQNA 
DLIITMLPDT GDVERVLFGD KGVAEGLSAG RSKGGIVMDM STISPVETRK FAAEINELGC
EYVDAPVSGG DIGARNGTLT IMVGATSSAF VQVKPVLEFM GKTITLIGPT GTGQVCKLAN
QIIAALTIEA VGEGLLFASK AGADVRKVRQ ALMGGFAYSR VLEVHGERMI ERAFEPGFRV
ELHRKDLGLA LSHARTLGVS LPGTATVQEL LNACIAHGGA GWDSSALVRM LEKLANHEIE
TIEGSHDES