Gene Nmul_A1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1473 
Symbol 
ID3785447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1681942 
End bp1683048 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content56% 
IMG OID637811561 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_412168 
Protein GI82702602 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.420949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAGA TTCGGGCAGC GACGATTCGA CAGAAGGGTG GGCCTTTCAG GATCGAGAAT 
TTGATTCTGG ATGAGCCACG CCCAGATGAG GTGCTGGTTC GTATCGCGGC TACTGGCATG
TGTCATACCG ACATGGTAGC GCGTGATCAG CTCTATGATG TCCCCTTACC GATTGTGCTT
GGGCATGAAG GTGCGGGTGT TGTCGAACGG GTAGGCAGCA ACGTGAAAAA AGTGACGGCA
GGAGATCACG TAGTGCTGAC CTATATGTGG TGCGGCCATT GCAGGCCATG TCTCCATGGA
GATTTAACCT ATTGCCAGAA TTTCTATGCA CTGAATTTTG GCGGCGCCAG GGAAGACGGC
AGCAGCTCCG CCCGCGATGC GCATGGTTCG CTTCATGACC ATTTCTTCGG CCAGTCGTCA
TTCGGGACTT TTGCTCTTAC CCACGAACGT AATGCGATCA AGGTGCCGAG GGAAGCTCCG
CTGGAGCTTC TTGGTCCGCT TGGCTGCGGC ATTCAAACTG GCGCCGGTGC AGTGATAAAT
GCGCTTAAAG TCAATCCAGG CGCCAGTTTT GCGGCTTTTG GCGGGGGAGC GGTAGGACTG
AGTGCGGTAA TGGCGGCTCG CGTCACGGGC GCCACAACGA TTATTGCTGT GGATGTCGTT
CCATCCCGGC TCGAGCTGGC GAGAGAGCTC GGGGCAACTC ACACGGTTAA CAGCCGCGAA
ACCGATCCCG TCGCGACGGT GCGCAAGATC AGTGGCGGGG GGGTAGAATA TGCCCTTGAG
TCCAGTGGTC GGCCCCAGGT ATTGCGCCAG GCCATCGATG CGCTGGGCAT TCGCGGCACT
TGCGGCATTG TCGGCGCGCC CGCTCTTGGG ACAGAGGTCA GCTTTGACGT GAATGGCGTA
ATGACCACCG GCAAACGCAT CCTTGGGATC ATCGAAGGCG ATAGCATACC CGACCTCTTC
ATACCAGCCC TTGTCGAGCT TTACATGCAG GGACGCTTTC CATTCGACAA GCTCGTGAAG
TTTTACCCTC TTGACAGGAT CAATGAAGCG GCAGAGGATA GTGAGAAGGG TATTACCATC
AAGCCGATTA TCAGGGTGGC ATTATAA
 
Protein sequence
MMEIRAATIR QKGGPFRIEN LILDEPRPDE VLVRIAATGM CHTDMVARDQ LYDVPLPIVL 
GHEGAGVVER VGSNVKKVTA GDHVVLTYMW CGHCRPCLHG DLTYCQNFYA LNFGGAREDG
SSSARDAHGS LHDHFFGQSS FGTFALTHER NAIKVPREAP LELLGPLGCG IQTGAGAVIN
ALKVNPGASF AAFGGGAVGL SAVMAARVTG ATTIIAVDVV PSRLELAREL GATHTVNSRE
TDPVATVRKI SGGGVEYALE SSGRPQVLRQ AIDALGIRGT CGIVGAPALG TEVSFDVNGV
MTTGKRILGI IEGDSIPDLF IPALVELYMQ GRFPFDKLVK FYPLDRINEA AEDSEKGITI
KPIIRVAL