Gene Nmul_A2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2272 
Symbol 
ID3785434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2583314 
End bp2584438 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content57% 
IMG OID637812360 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_412956 
Protein GI82703390 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACGA TAAAAACCCG CGCGGCAGTA GCTTGGGGAC CAGGCCAGCC TTTGAAGATC 
GAGGAAGTGG ACTTGATGCC GCCAAGGAAA GGTGAGGTGC TGGTGCGCAT AGTCGCCACG
GGAGTCTGTC ATACAGACGC ATATACTTTG TCCGGAAAAG ACCCCGAAGG CAAATTTCCA
GCTATTCTGG GGCATGAAGG CGGCGGCATC GTCGAGGCGG TCGGCGACGG GGTAACCAGC
GTGGCGGTAG GCGATCATGT CATCCCGTTG TATACCCCCG AATGCCGCGA GTGCAAGTTC
TGCAGATCGG GGAAAACCAA CCTGTGCCAG GCGATAAGGA CAACCCAGGG ACAGGGGCTG
ATGCCGGACG GCACTACCCG TTTTTCGAAG GATGGAAAAC CCATTTATCA TTACATGGGC
ACCTCGACTT TTTCCGAATA TACGGTAATC CCTGAGATTG CATTGGCCAG GATCAGCAAG
GAAGCGCCGC TGGAAAAGGT CTGCCTGCTC GGCTGCGGCG TTACCACCGG CTTGGGCGCG
GTGACCCATA CTGCAAAGGT AAAGGCAGGC GATACGGTAG CGGTATTCGG GCTGGGCGGC
ATAGGCCTTG CCGTGATAAT CGGGGCCGTG ATGGCCAAAG CTGGACGCAT CATCGCTATC
GACATCAATC CGGACAAATT TGAAATTGCG CGCCAACTGG GCGCTACCGA CGTGGTGAAT
CCCCAAGATC ACGACAAGCC GATCCAGGAG GTGATCATCG GGATGACCGG GGGAGGCGTG
GATTTTTCCT TCGAATGCAT CGGCAACGTA AAGGTCATGC GATCGGCCCT GGAATGCTGT
CATAAGGGTT GGGGGGAGTC GGTGATTATC GGCGTTGCCG GGGCGGGAGA AGAAATCTGC
ACGCGCCCCT TTCAGCTCGT TACCGGCCGC GTCTGGCGCG GCTCCGCTTT TGGGGGAGTG
AAGGGCCGCA GCGAACTGCC CGGCTATGTA CAGCGCTATC TTGATGGAGA ATTCGAACTG
GATACCTTCA TTACCCACAC GATGGGATTG GAGGATATCA ATAAGGCATT CGACCTCATG
CATGAGGGCA AGTCCATCCG AACGGTGATA CATTTCGACA AATGA
 
Protein sequence
MDTIKTRAAV AWGPGQPLKI EEVDLMPPRK GEVLVRIVAT GVCHTDAYTL SGKDPEGKFP 
AILGHEGGGI VEAVGDGVTS VAVGDHVIPL YTPECRECKF CRSGKTNLCQ AIRTTQGQGL
MPDGTTRFSK DGKPIYHYMG TSTFSEYTVI PEIALARISK EAPLEKVCLL GCGVTTGLGA
VTHTAKVKAG DTVAVFGLGG IGLAVIIGAV MAKAGRIIAI DINPDKFEIA RQLGATDVVN
PQDHDKPIQE VIIGMTGGGV DFSFECIGNV KVMRSALECC HKGWGESVII GVAGAGEEIC
TRPFQLVTGR VWRGSAFGGV KGRSELPGYV QRYLDGEFEL DTFITHTMGL EDINKAFDLM
HEGKSIRTVI HFDK