Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2149 |
Symbol | |
ID | 3784775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2439775 |
End bp | 2440935 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637812237 |
Product | zinc-containing alcohol dehydrogenase superfamily protein |
Protein accession | YP_412834 |
Protein GI | 82703268 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.406335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCAC TCACTTATCA TGGAAGTTAC GACGTTCGCG TGGATACCGT ACCCGACCCG ATACTTCAGG AACCGGACGA TATTGTTCTA AAAATAACTG CTACTGCGAT CTGTGGCTCC GACTTGCATC TGTACCGGGG CAAAATGCCG GAACTCAAGA ATGGCGACAT ACTCGGCCAT GAGTTTATGG GAACAGTCGT CGACGCGGGC CCGGAGGTTA CAGCGTTGCG AAAGGGCGAC CGGGTGGTGG TCCCCTTCGT CATCGCCTGC GGACAATGCT TCTTCTGCGA GAGGCAATTG TACGCCGCTT GTGAAACGAC CAACCCGGAT CGTGGCGCAA TCATGAACAA AAAAAACGTG CGTTCAGGCG CAGCGTTCTT TGGCTATACG CACCTGTATG GGGGAGTGCC GGGCGGCCAG GCTGAATATG TAAGAGTGCC GAAGGCGAAT GTAGGCCCCA TCAAGATCCC TGAAACGCTT CCGGATGAGA AAGTCCTGTT CCTGAGTGAT ATTCTGCCTA CCGGTTACCA GGCTGTACTG AATGCAGAGA TAGGCCCCGG TTCCAGCGTT GTGATTTTTG GCGCCGGTCC GGTGGGACTG ATGTCGGCGG CTTGCGCGCG GCTTCTGGGA GCGGAAACGA TTTTCATGGT CGACCATCAT CCGTATCGAC TCGAATTTGC CCGCCAAACA TATGACGTTA TCCCCCTCAA TTTCGATGAG GTCGACCCTG CCGAGGTCAT TGTCGAGAAA ACTTCTTATA GAGGTGTCGA TGCGGCCATC GATGCAATAG GATTTGAGGC CAAAGGCAGC GCGCTGGAAA CCGTCATGAC CAACCTGAAA CTGGAAGGCA GCAGCGGCGT GGCCTTGCGC CAATGCATCG CAGCAGTAAG GCGGGGCGGC ACAATCAGCG TGCCGGGAGT ATATGCCGGT TTTATCCATG CCTTCCTTTT CGGTGATGCT TTCGAGAAAG GCGTGACCTT CAGAATGGGG CAAACTCATG TCCAGCGCTT CCTCCCCGAA CTGCTGGAGC ATGTCGAATC CGGAAAGCTA CAACCGGATG TCATTATCAG TCACCGGATG CCGCTTGCTG AAGCAGCTAG CGCATATAAA ATCTTCGAAA AAAAAGAAGA CGACTGCCGC AAGGTTGTCC TGACCCCCTG A
|
Protein sequence | MRALTYHGSY DVRVDTVPDP ILQEPDDIVL KITATAICGS DLHLYRGKMP ELKNGDILGH EFMGTVVDAG PEVTALRKGD RVVVPFVIAC GQCFFCERQL YAACETTNPD RGAIMNKKNV RSGAAFFGYT HLYGGVPGGQ AEYVRVPKAN VGPIKIPETL PDEKVLFLSD ILPTGYQAVL NAEIGPGSSV VIFGAGPVGL MSAACARLLG AETIFMVDHH PYRLEFARQT YDVIPLNFDE VDPAEVIVEK TSYRGVDAAI DAIGFEAKGS ALETVMTNLK LEGSSGVALR QCIAAVRRGG TISVPGVYAG FIHAFLFGDA FEKGVTFRMG QTHVQRFLPE LLEHVESGKL QPDVIISHRM PLAEAASAYK IFEKKEDDCR KVVLTP
|
| |