Gene Nmul_A2448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2448 
Symbol 
ID3786405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2797284 
End bp2798642 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content57% 
IMG OID637812539 
Productaldehyde dehydrogenase 
Protein accessionYP_413129 
Protein GI82703563 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATACA TCAGTTTGAA TCCGGCGACC AATGAGACAC TGAAAACCTA TATGAGCTGG 
GATCGCCGCC ACTTGGCCGA AGCCCTGGAG CAAACATATC ATGCCCAGCC CGCCTGGGCA
CGACTGGGTT TTCCCCGACG CGCGGAGCTT ATGCACAGAG CCGCCAGCCT CTTGCATGAA
CGGGCGGCTG AATATGCCAT TCTTATAGCT GCGGAAATGG GCAAGCCTGT GCGCGAAGGT
CGCGCGGAGG TGGAAAAGTG CGCGCTTGCC TGCGATTATT ATGCCGTGCA CGCCGAGCAT
TTCATGCAGG TCGAGAGGGT TCAGACCGGG GCCTGCAAAA GTTACGTGTC CTATGAGCCT
CTTGGAGTCG TCCTGGGGGT GATGCCCTGG AATTTCCCTT TTTTCCAGGT GATCCGCTTC
GCCGCTCCCA CACTCATGGC GGGAAATGGC TGCGTGCTGA AACATGCGTC GAATGTTCCG
CAGTGCGCTT TGGCGCTCGA ACGGCTGTTC CAGGATGCTG GTTTTCCTCC GCACCTTTTC
AGCACGCTCA TGATAGAGCC GTCTGAGGTG GGGGAGGCTA TCGCCAGCCA GCATGTCCAT
GCTGTGACAC TCACGGGCTC GGAGCGGGCC GGCCGGGAAA TAGCTTCCCA CGCGGGACAG
CATTTGAAGA AGTGTGTGAT GGAACTCGGC GGGTCGGATG CGTTTATCGT TCTGAAAGAT
GCGGACCTGG AACTCGCCGC CACCTTTGCG GTTAGGTCGC GCTTTCAGAA TTGCGGACAG
TCATGCATTG CCGCAAAGCG CATTATTCCT GTGGCCGACA TTGCCGATGA GTTTGGTTCG
TTGTTCATAA AGAAAACAAA AGAATTGAAA ATGGGCGATC CTCTGGACGA GGCGACGCAG
ATCGGCCCGA TGGCAAGGCT CGATCTGCGC GAAAACCTGC ATCAGCAAGT GACCGATTCG
ATTGCACAGG GCGCGCATGC GGTTCTCGGG TGTGAACCGG GGGAGGGGGC ATTCTATCCC
CCGTCCATTC TGGATGACGT AACACCCGGG ATGAGAGCCT ATCATGAGGA ATTATTCGGG
CCGGTTGCGG CGGTGATACG CGCGGCGGAC GAGGAGGACG CCATTCGCAT TGCCAACGAT
ACCCGCTTCG GGCTCGGCTC CAGCATCTGG AGCCGCGATG CCGACCATGC CGAGGAACTG
GCCCATCGAA TTCAGGCGGG TTGCAGCTTT ATCAATGGCA TGGTCCGGTC AGATCCACGC
CTTCCCTTCG GAGGAACCAA GGATTCCGGA TTCGGCAGGG AACTGTCCTA TCATGGCATC
CGGGAATTCG TTAATGTAAA AACGGTATGG GTGCGATAA
 
Protein sequence
MPYISLNPAT NETLKTYMSW DRRHLAEALE QTYHAQPAWA RLGFPRRAEL MHRAASLLHE 
RAAEYAILIA AEMGKPVREG RAEVEKCALA CDYYAVHAEH FMQVERVQTG ACKSYVSYEP
LGVVLGVMPW NFPFFQVIRF AAPTLMAGNG CVLKHASNVP QCALALERLF QDAGFPPHLF
STLMIEPSEV GEAIASQHVH AVTLTGSERA GREIASHAGQ HLKKCVMELG GSDAFIVLKD
ADLELAATFA VRSRFQNCGQ SCIAAKRIIP VADIADEFGS LFIKKTKELK MGDPLDEATQ
IGPMARLDLR ENLHQQVTDS IAQGAHAVLG CEPGEGAFYP PSILDDVTPG MRAYHEELFG
PVAAVIRAAD EEDAIRIAND TRFGLGSSIW SRDADHAEEL AHRIQAGCSF INGMVRSDPR
LPFGGTKDSG FGRELSYHGI REFVNVKTVW VR