Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2448 |
Symbol | |
ID | 3786405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2797284 |
End bp | 2798642 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812539 |
Product | aldehyde dehydrogenase |
Protein accession | YP_413129 |
Protein GI | 82703563 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATACA TCAGTTTGAA TCCGGCGACC AATGAGACAC TGAAAACCTA TATGAGCTGG GATCGCCGCC ACTTGGCCGA AGCCCTGGAG CAAACATATC ATGCCCAGCC CGCCTGGGCA CGACTGGGTT TTCCCCGACG CGCGGAGCTT ATGCACAGAG CCGCCAGCCT CTTGCATGAA CGGGCGGCTG AATATGCCAT TCTTATAGCT GCGGAAATGG GCAAGCCTGT GCGCGAAGGT CGCGCGGAGG TGGAAAAGTG CGCGCTTGCC TGCGATTATT ATGCCGTGCA CGCCGAGCAT TTCATGCAGG TCGAGAGGGT TCAGACCGGG GCCTGCAAAA GTTACGTGTC CTATGAGCCT CTTGGAGTCG TCCTGGGGGT GATGCCCTGG AATTTCCCTT TTTTCCAGGT GATCCGCTTC GCCGCTCCCA CACTCATGGC GGGAAATGGC TGCGTGCTGA AACATGCGTC GAATGTTCCG CAGTGCGCTT TGGCGCTCGA ACGGCTGTTC CAGGATGCTG GTTTTCCTCC GCACCTTTTC AGCACGCTCA TGATAGAGCC GTCTGAGGTG GGGGAGGCTA TCGCCAGCCA GCATGTCCAT GCTGTGACAC TCACGGGCTC GGAGCGGGCC GGCCGGGAAA TAGCTTCCCA CGCGGGACAG CATTTGAAGA AGTGTGTGAT GGAACTCGGC GGGTCGGATG CGTTTATCGT TCTGAAAGAT GCGGACCTGG AACTCGCCGC CACCTTTGCG GTTAGGTCGC GCTTTCAGAA TTGCGGACAG TCATGCATTG CCGCAAAGCG CATTATTCCT GTGGCCGACA TTGCCGATGA GTTTGGTTCG TTGTTCATAA AGAAAACAAA AGAATTGAAA ATGGGCGATC CTCTGGACGA GGCGACGCAG ATCGGCCCGA TGGCAAGGCT CGATCTGCGC GAAAACCTGC ATCAGCAAGT GACCGATTCG ATTGCACAGG GCGCGCATGC GGTTCTCGGG TGTGAACCGG GGGAGGGGGC ATTCTATCCC CCGTCCATTC TGGATGACGT AACACCCGGG ATGAGAGCCT ATCATGAGGA ATTATTCGGG CCGGTTGCGG CGGTGATACG CGCGGCGGAC GAGGAGGACG CCATTCGCAT TGCCAACGAT ACCCGCTTCG GGCTCGGCTC CAGCATCTGG AGCCGCGATG CCGACCATGC CGAGGAACTG GCCCATCGAA TTCAGGCGGG TTGCAGCTTT ATCAATGGCA TGGTCCGGTC AGATCCACGC CTTCCCTTCG GAGGAACCAA GGATTCCGGA TTCGGCAGGG AACTGTCCTA TCATGGCATC CGGGAATTCG TTAATGTAAA AACGGTATGG GTGCGATAA
|
Protein sequence | MPYISLNPAT NETLKTYMSW DRRHLAEALE QTYHAQPAWA RLGFPRRAEL MHRAASLLHE RAAEYAILIA AEMGKPVREG RAEVEKCALA CDYYAVHAEH FMQVERVQTG ACKSYVSYEP LGVVLGVMPW NFPFFQVIRF AAPTLMAGNG CVLKHASNVP QCALALERLF QDAGFPPHLF STLMIEPSEV GEAIASQHVH AVTLTGSERA GREIASHAGQ HLKKCVMELG GSDAFIVLKD ADLELAATFA VRSRFQNCGQ SCIAAKRIIP VADIADEFGS LFIKKTKELK MGDPLDEATQ IGPMARLDLR ENLHQQVTDS IAQGAHAVLG CEPGEGAFYP PSILDDVTPG MRAYHEELFG PVAAVIRAAD EEDAIRIAND TRFGLGSSIW SRDADHAEEL AHRIQAGCSF INGMVRSDPR LPFGGTKDSG FGRELSYHGI REFVNVKTVW VR
|
| |