Gene Nmul_A1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1959 
Symbol 
ID3784982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2252646 
End bp2253647 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content55% 
IMG OID637812047 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_412646 
Protein GI82703080 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGA TACTGATGAA TGCTCCGGGT GCACCAGAGG TGTTGACACC TGCCGACGTT 
CCCATGCCCG ATCTTGCCGG CGCATTTGAT GTGCGGGTAA AGTTGCACGC GGCGGGCGTA
AACCCGATCG ATACCAAGGT GCGCAAGGCC AATATGTATT ACCCGGACAG GCTTCCGTCC
ATTCTCGGAT GTGATGGGGC GGGTGTGGTT GAAGCCGTTG GCAGTTCAGT GACCCGGGTA
CGCGCAGGCG ATGAAGTCTT CTTTTTCAAT AACGGTTTGG GCGGAGCGCC CGGAAACTAT
GCGGAATATG CGGTAGTGCA TGAAGATTAT CTGGCATTGA AACCTGGGAA TCTGTCAATG
GTGGAAGCAG CCGCTGTTCC GTTGGCTCTG ATTACCGCCT GGGAAGCACT GATAAAGCGT
GGCAATCTCA AGGGGAGCCA GATTGCGCTG ATTCATGCCG GCGCGGGTGG CGTGGGTCAT
ATTGCCATCC AGCTTGCCCG ATACCTGAAG GCCCGGGTTG CAACGACAAT TTCGAGCGAG
GAAAAGGCTG CCTTCGTGCA ATCCCTGGGA GCCGAGCTTG CAATCGATTA TCGCGAAAAT
GACTTTGTGG ACACTGCGCT CGAATGGACG GAGGGACTGG GTGTGAACCT CGCTCTGGAT
ACTGTCGGTG GAGAGACGTT CTGCAAATCC TTCTCCGCCA TCCGGCTGTA TGGCAGGGTG
GTATCGCTGC TTTCAACGGT CTGTGATGCA AAGCAGCTCA ATACTGCCCG ACTGCGCAAC
CTGAGCATCG GCTATGTGCA AATGACTGCT CCCCTTTATT TCGGTTTACA TTCGGCGCGT
GTAGTCCAAA CCGGCATACT TGAACAAGGT GCAAGACTGC TCGAACAAGG TATTCTCAAG
ATTCACGTCA GCCGCACGCT GCCTCTGACG GAAGCCGCCG AAGCGCATCG TTTGATCGAA
GCGGGGCATA CTCTGGGCAA GATAGTGCTG AAGATTGTGT AG
 
Protein sequence
MKAILMNAPG APEVLTPADV PMPDLAGAFD VRVKLHAAGV NPIDTKVRKA NMYYPDRLPS 
ILGCDGAGVV EAVGSSVTRV RAGDEVFFFN NGLGGAPGNY AEYAVVHEDY LALKPGNLSM
VEAAAVPLAL ITAWEALIKR GNLKGSQIAL IHAGAGGVGH IAIQLARYLK ARVATTISSE
EKAAFVQSLG AELAIDYREN DFVDTALEWT EGLGVNLALD TVGGETFCKS FSAIRLYGRV
VSLLSTVCDA KQLNTARLRN LSIGYVQMTA PLYFGLHSAR VVQTGILEQG ARLLEQGILK
IHVSRTLPLT EAAEAHRLIE AGHTLGKIVL KIV