Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0233 |
Symbol | |
ID | 3786315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 247736 |
End bp | 248833 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810305 |
Product | Rieske (2Fe-2S) region |
Protein accession | YP_410933 |
Protein GI | 82701367 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0567601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGATC TGATTGATGT TTGCCAATTG GCTCCAACGC AACTGCCGGT TGACTGGTAT CTCGACTCTC AAATCCTTGA ACTGGAAAAA CGCATTCTTT TCGACCAGGG ACCGGGCTAT GTCGGTCATG AGATCATGGT GCCCAACATT GGTGACTATT ATGTGCCCGA ATGGATGGAT AACGCGAAAA TGCTGGTGCG TAATAAGGAT GGTATCGAGC TGCTGTCCAA CGTCTGCCGC CACAGGCAGT CACTGCTGCT CAAAGGGAGT GGCAACACCA GAAACATCGT TTGCCCGGTG CATCGCTGGA CGTATGACTT GAAGGGCACG TTGCTCGGGG CGCCCCATTT CCCCGAAAAC CCTTGCCTCA ATCTTTCCAA TTCACCTTTG CAGAACTGGA ACGGACTCCT GTTTCAAGGC AGGCGGAATG TTGCCCGCGA TCTGGGCAAC TTGCAAGTCC TGAAGAATTT CGACTTCTCG GGCCATGTGC TGGAACGCCT CCAGATAGAT GAATACGCCT GCAACTGGAA AACCTTCATA GAAGTTTATC TGGAGGACTA CCACGTCGAG CCATACCATC CGGGCCTGGG CAACTTTGTT GACACGGCTG CGCTGGAATG GGAGTTCGGG GAGTGGTACA ACGTGCAGAC TGTGGGGATC AATAACGCCC TGACCCGTCC GGGAACGCCC GTCTATGCAA AATGGCACGA GCAACTGTTG CTGCAGACAG CGGGCGAAAT ACCGAGGCAT GGCGCAATCT GGATGCTGTA TTATCCCAAT GTCATGATGG AATGGTATCC CCATGTATTG GTCGTCAGCA CGGTGCTTCC TACCGGAACA GAGCGTTGCA CCAATGTGGT GGAGTTCTAT TATCCTGAAG ATATCGCTTT GTTTGAGCGT GAATTCATTG AAGCAGAACA GGCCGCCTAT CGCGAAACCG CTGCCGAGGA TGATGAAATC TGCAGGCTGA TGACGGAAGG GCGCCGTGCG CTGTACAAAC AAGGCGTGAG CGAGGTCGGA CCGTACCAAT CGCCGATGGA AGATGGCATG GTGCACTTCC ACAAGTTTCT GCGGCGAGAA ATTGAACCGC ACATCTGA
|
Protein sequence | MVDLIDVCQL APTQLPVDWY LDSQILELEK RILFDQGPGY VGHEIMVPNI GDYYVPEWMD NAKMLVRNKD GIELLSNVCR HRQSLLLKGS GNTRNIVCPV HRWTYDLKGT LLGAPHFPEN PCLNLSNSPL QNWNGLLFQG RRNVARDLGN LQVLKNFDFS GHVLERLQID EYACNWKTFI EVYLEDYHVE PYHPGLGNFV DTAALEWEFG EWYNVQTVGI NNALTRPGTP VYAKWHEQLL LQTAGEIPRH GAIWMLYYPN VMMEWYPHVL VVSTVLPTGT ERCTNVVEFY YPEDIALFER EFIEAEQAAY RETAAEDDEI CRLMTEGRRA LYKQGVSEVG PYQSPMEDGM VHFHKFLRRE IEPHI
|
| |