Gene Nmul_A1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1400 
Symbol 
ID3786430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1595520 
End bp1596560 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID637811488 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_412095 
Protein GI82702529 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGA TTCTCGGATT CGCCGCGCAC GCCCCCGGGC AGAAACTCGA ACCATTCACT 
TATGACGCCG GGCCTCTCGC GCCGGAGGAA GTGGAAATTG CGGTGGAGCA TTGTGGTCTT
TGTCATTCCG ATCTTTCAAT ACTGAATAAT GACTGGGGTA TCACGCAGTA TCCCGTTATT
CCGGGGCACG AAGCCATTGG CCGGATTGTC GCGATGGGAG AACAGGCAAA GGGATTGCAA
ATCGGCCAGC GGGTAGGCGT TGGCTGGAAT GCGGGCAGCT GCATGCACTG CCATGAATGC
ATGAGCGGCG ATCACAACCT TTGTACCAGA GCCACCGCGA CAATCATCGG GCATTACGGG
GGATTTGCCG ACAAAGTGCG AGCCCACTGG GCGTGGACGA TTCCCATACC TGAGACCCTT
GAAAGTTCCT CCGCAGGCCC GTTACTTTGC GGAGGAATTA CTGTATTTGC GCCCCTTGCG
GCCTATGTAA AACCGACCGA TCATGTAGGT GTCGTTGGCA TTGGCGGCCT TGGTCATCTC
GCCCTGCAAT TTGCGCATGC CTGGGGTTGC GAAGTTACGG CCTTCTCTTC CAATCCCTCA
AAGGCGGAAG AAATGCGCAC CCTCGGTGCC CATCGTGTTC TCTCCAGTCG TAAGAGCGGC
GAAATTCGCT CGGCAGCACG CTCGCTCGAC TTTCTGCTGG TGACCGTCAA TGTCCCCCTT
GACTGGGCAT TGCTGCTCCA GACGCTGAAA CCGAAGGGAC GCATGCATCT CGTTGGCGCA
GTGCTCGAAC CCCTGCCTAT CCCCGCTTTC GAGCTTCTGA GCGGACAGAA GAATGTTTCA
GGGTCACCGA CGGGTGGGCC TGCGATGATG GCGGATATGC TGGATTTTGC CGCCCGTCAC
GGCATTCAGC CTCAGGTAGA GCGTTTTCCC ATGAGCAGGG TCAATGAAGC GGTTGCACAT
CTGGCTGCTG GAAAAGCGCG CTACCGCATA GTCCTGGATG CGAATTTCAA TCGGGAGCAC
CCAGGAAGTG CGAATGCATG A
 
Protein sequence
MTKILGFAAH APGQKLEPFT YDAGPLAPEE VEIAVEHCGL CHSDLSILNN DWGITQYPVI 
PGHEAIGRIV AMGEQAKGLQ IGQRVGVGWN AGSCMHCHEC MSGDHNLCTR ATATIIGHYG
GFADKVRAHW AWTIPIPETL ESSSAGPLLC GGITVFAPLA AYVKPTDHVG VVGIGGLGHL
ALQFAHAWGC EVTAFSSNPS KAEEMRTLGA HRVLSSRKSG EIRSAARSLD FLLVTVNVPL
DWALLLQTLK PKGRMHLVGA VLEPLPIPAF ELLSGQKNVS GSPTGGPAMM ADMLDFAARH
GIQPQVERFP MSRVNEAVAH LAAGKARYRI VLDANFNREH PGSANA