Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1489 |
Symbol | |
ID | 3785366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1699601 |
End bp | 1700491 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811577 |
Product | short chain dehydrogenase |
Protein accession | YP_412184 |
Protein GI | 82702618 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACTT CTCTTAAACC TCTCGACCAG CAAGTGATTG TGATTACCGG TGCATCAAGC GGCATAGGCC TTGCTACAGC AATGCTTGCA GCAGAACGGG GGGCAAAGCT CGTTCTTATT GCTCGTAGTG CTAAAACACT CGAGCATCTG GTTGCAAGGA TAGCAAACAC TGGAGGGGAG GCTATCGATG TTGTCGCGGA TGTTGCCGAT CGGGAAAAAA TGCGTCTCGC GGCGCAAACT GCGGTTGACC GTTTTGGCCA TATCGATACC TGGATCAATA ACGCAGGTGT GGCAATTTAC GGACGTCTCG ATGAAGTCAA CGAAGCCGAC AGCCGGCGTC TTTTCGATAC CAACTTCTGG GGGGTGGTCA ATGGTTCGCT TGCCGCCTTG CCTTACCTGA AAAAGCAGGG CGGAGCGCTC ATAAATGTAG GCAGCGAAAC TTCCGAGGCT ATCGTGCCTC TTCTGGGAAT GTATTCCGCA TCCAAGCATG CGGTGAAAGG CTTCACTGAT GCATTACGCG TGGAAGTCCA GGAATTCGAC AAGGCACCCG TAGTGATTAC ACTGATTCAG CCTTCCGCCG TGAATACACC TTTTCCGCAG CATGCCAAGA ATTATATGGA CAAGGAGCCA AAATTGCCGC CCCCTCTGAT CAATCCCGAG CAGGTTGCTG AAGCGATACT GAAAGCAGCT ACTGAAGGAG GACGTGACGT AAAGGTCGGT GCAATGGCGG TCGTCAACAC GATGATATCC AAGCTTGCTC CGAGTTTCGG AGACAAGATG TCCGCCAAGC GCGGAAGCGG CCAGCGGGAA AGGGTCTTCC CGCTACATCC ACAAGGCACT TTATATGAAC CGGGAGAATC GGGATCGGCG CACGGTCACG CATCTTCATG A
|
Protein sequence | MQTSLKPLDQ QVIVITGASS GIGLATAMLA AERGAKLVLI ARSAKTLEHL VARIANTGGE AIDVVADVAD REKMRLAAQT AVDRFGHIDT WINNAGVAIY GRLDEVNEAD SRRLFDTNFW GVVNGSLAAL PYLKKQGGAL INVGSETSEA IVPLLGMYSA SKHAVKGFTD ALRVEVQEFD KAPVVITLIQ PSAVNTPFPQ HAKNYMDKEP KLPPPLINPE QVAEAILKAA TEGGRDVKVG AMAVVNTMIS KLAPSFGDKM SAKRGSGQRE RVFPLHPQGT LYEPGESGSA HGHASS
|
| |