Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2688 |
Symbol | |
ID | 3785050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 3088267 |
End bp | 3089316 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812778 |
Product | hypothetical protein |
Protein accession | YP_413367 |
Protein GI | 82703801 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGG AAACTCAGTC TGCTGATATT GATCAACCGC AGAAAGCAGG GGCGAGATGG CTTGCGCACG CCAATCCCCT GCTGGTTTTT GCTTTGATTC TGATCCTGGT TTTCGCCTGG CAATGGTACG ACACGCGCAG TGAAATCGCA GGGCTGCGGC ATGAGCTGGC AAGACGACTG GCCGAGGCGG ATACTTCGGG CAAGGAGGCG CGCAATGCCG CAGCCGATGC CGTCGAGGCG GGACGCCGTG CGGAAACGAA GCTCGACCTT CTTGAAGATA AACTGGCTGA ATCGCAGAGT CAGCAGGTTG CCCTGGAAGC GCTTTACCAG GAACTTACGC GAAGCCGCGA TGAAGCAATG CTGGAAGAAG TGGAGCAGCT ACTGCTTATC GCCAACCAGC AACTGCAACT AGCAAGCAAC GTCAAATCCG CCCTGATTGC AATGCAGGAG GCGGATAGCC GGCTCCTGCG CACGGATCGT CCGCAACTGG CTTCTTTGCG GAAAACCATT GCAAAGGATA TGGGTCTGCT GAAATCGGTG CCGTATGTTG ACACCACGGA AATCAGTTTG CGCCTGGATA ATCTTGCCGC CTCCGTCGAT ACGCTGCCGC TGGCAATGGA GTTTCGACCG CCTGAGAGCG CCTCTTCCCC GCCTCCCGTT CCCATCTCCG AAAATGTATG GCTGCGCTTT CTGCGTCAAG TATGGGAGGA TTTCAAACGG CTGGTGCGGA TACAGTATAT GGACAGCCCG GATGTCGCCC TGCTGTCGCC CTCGCAAGCG TATTTCCTGC GCGAGAATTT GAAGTTGCGG CTCTTATCCG CGCGCTATGC GTTGCTTGCG CACAACGGCC CCAGCTTCAA GGCTGACCTC GACGCGTCCA GGGAGCAGAT CGGGCGTTAT TACAACACTC AAGCCCAACC TGTCATCGAT TTCCGGGAAG CATTGCAGCA ATTGGGCGAG ACTGATATCG GGGCTGAGTT GCCGCGTATC TCCGCAAGTC TCGATGCGGT TCGTAACTAC AGGTTGACGC GTGACCGGGG AAACAGGTGA
|
Protein sequence | MGMETQSADI DQPQKAGARW LAHANPLLVF ALILILVFAW QWYDTRSEIA GLRHELARRL AEADTSGKEA RNAAADAVEA GRRAETKLDL LEDKLAESQS QQVALEALYQ ELTRSRDEAM LEEVEQLLLI ANQQLQLASN VKSALIAMQE ADSRLLRTDR PQLASLRKTI AKDMGLLKSV PYVDTTEISL RLDNLAASVD TLPLAMEFRP PESASSPPPV PISENVWLRF LRQVWEDFKR LVRIQYMDSP DVALLSPSQA YFLRENLKLR LLSARYALLA HNGPSFKADL DASREQIGRY YNTQAQPVID FREALQQLGE TDIGAELPRI SASLDAVRNY RLTRDRGNR
|
| |