Gene Nmul_A1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1371 
Symbol 
ID3786514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1557647 
End bp1558642 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID637811459 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_412066 
Protein GI82702500 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02824] putative NAD(P)H quinone oxidoreductase, PIG3 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.544729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCAA TTGAAATACA ACATCCGGGA GGGCCGGAGG TGCTGAGGCC TGCATTCCAT 
CCGGTTCCCC AGCCCGGCCC TGGTGAAATT CTGATCAAAG TGGCAACGGT CGGAGTAAAT
CGGCCCGATA TCCTGCAGCG CCGGGGCCTT TACCCTCCCC CTCCGGGCGC CTCGGAAATC
CCGGGACTGG AAGTCGCGGG GGAAATCGTT GAATCAGGCG AAGGCACAAT CCGATTCAGG
CCGGGCGAAA AGGTTTGCGC GCTGGTGGCG GGCGGTGGCT ATGCCGAATA CTGCGCCGTG
CACGAAAGCA ATGCCCTGCC GATACCATCA GGTCTCGGCA TGATCGAGGC AGCGGCATTG
CCGGAAACCT TTTTTACCGT TTGGACCAAC CTGTTCCAGC GCGGCAAGCT AAAATCGGGC
GAGACTGTAC TCATTCATGG TGGCACTTCG GGCATCGGCA CCACCGCCAC AATGCTGGCC
AAGGCTTTCG GCGCTCTTGT CCTGACAACC GCAGGCTCGG AGGAAAAATG CCGGGCATGC
GTTGCTCTGG GCGCTGATTT TGCCATCAAT TACCGCACCC AGGATTTCGT CGAGGAAGTC
CGGAAGTTTA CGGATGGCAA AGGAGTCGAT GTCATTCTCG ATGTTGTCGC CGGGGACTAC
GTGGCGAGAA ACTACAAGGC GGCTGCGCTC AATGGCCGTA TTCTCCAGGT CGGCATCCAG
AATGGGCCTG CCATGGAACT GAACCTGATG CCCATGCTGG CAAAAAGGCT GACTCATACC
GGGTCGACCC TGCGATCGCG CACGGTGCCT GAAAAGGCCC AGATTGCCCA GGAACTGGAG
CAGCAGGTCT GGCCATTATT GCATGAGGGA AAAATAAAAC CGCAAATATT CAAAACATTC
CGACTGGAGG AAGCTGCCGA GGCGCATGTA TTGATGGAAT CAGGCGCCCA TATCGGAAAA
ATCGTATTGA TGACAGGAGC AACTATCTCC GCTTGA
 
Protein sequence
MLAIEIQHPG GPEVLRPAFH PVPQPGPGEI LIKVATVGVN RPDILQRRGL YPPPPGASEI 
PGLEVAGEIV ESGEGTIRFR PGEKVCALVA GGGYAEYCAV HESNALPIPS GLGMIEAAAL
PETFFTVWTN LFQRGKLKSG ETVLIHGGTS GIGTTATMLA KAFGALVLTT AGSEEKCRAC
VALGADFAIN YRTQDFVEEV RKFTDGKGVD VILDVVAGDY VARNYKAAAL NGRILQVGIQ
NGPAMELNLM PMLAKRLTHT GSTLRSRTVP EKAQIAQELE QQVWPLLHEG KIKPQIFKTF
RLEEAAEAHV LMESGAHIGK IVLMTGATIS A