Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0392 |
Symbol | |
ID | 3784087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 433312 |
End bp | 434640 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810468 |
Product | sun protein |
Protein accession | YP_411092 |
Protein GI | 82701526 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGGA CGCAGCGTCT CGCCGCCACC ACTGTTGAAA GAGTGGTGAG TGGCGCAAGT CTGACAACGG TTTTGCAGGA AACCTGGCGC AGCCACAGCG GCCTCTCCGA CCAGCAGCGC GGTGCGATTC AGGATTTAAG CTATGGCGTA CTGCGCTTTT ATGGTCAACT GGACATATTG CTTGGCTTAT TGCTGAACAA ACCCCTGAAG GACCAGAGTT TGCGCTACCT GCTACTGGTG GGACTGTATC AGCTTCAATA CAGCAAAACG GCGCCGCACG TCGTGGTGGA CAGTGCAGTA TCCGCTTCTT GCAGCCCCTC GATCGATGGC AGGAATAGCA GGAACATGCT GGATCCCCGC CATGCGAAAA GTATAGGCGG ACTGGTCAAT GCGATATTGC GCAACTTCTT GCGCAAACGT CCCGCCCTGC TTGAAAAAAC GGCGGCAACC GAAGTAGGAA GGTATTCCCA TCCACAATGG TGGATCGACA AACTCCGTGC GCAATATCCC TCCCGCTATC AGGCTGTACT GGAGACAGCC AATATACGAC CACCTATGGC GCTCAGAGTC AATCAGCGGA GGACAAACGT TGAAGCATAT CAGAAGCTGC TGCACAATGC CGGACTGAAT GCCCGGCGAC CGGAAAACGA GTGGACAGAA GAGGCGCTTG AATTATTTCA CCCGGTCCCG GTGGAAAAAC TGCCGGGCTT CGGTCAAGGA CTGGTATCAG TTCAGGATGT GGCCGCTCAA ATGGCAGCGC CGCTACTCTG TCCCCAGAGC GGAATGCGGG TGCTGGACGC CTGCGCTGCC CCGGGCGGGA AAAGCGCGCA TCTGCTTGAA TTGACCGATC TGGAACTGAC CGCGGTTGAC AATGATGGCG AGCGGTTAGA ACGGCTAAAG CAGAATTTCA CTCGTCTCGG CCTCAAGCCT TATCGCATCA TTCACGGCGA CGCCACGCAC CCGGGGGAGT GGTGGGATGG AAGGCCATTC GATCGTATCC TGGCTGATGC CCCCTGCTCT GCGTCCGGGG TGGTGCGCCG TCACCCCGAC ATCAAATGGC TGCGACGCGA AAGCGACCTG GCGCAGTTTG CCGAGATACA ACGTAAAACC CTCGATGCCC TTTGGCAAAC CCTGGTCAAA GGCGGTAAAT TGCTCTATGT CACTTGTTCC ATATTCGCAG AGGAAAACGG TCTTCAGGTG GAAAGCTTCC TGAATCATCA TCCGGATGCT CGTCTACTGC CCTTTTCCGT GCCGGAAATC GTGGAGGGCC AGTTGTTGCC GGACCTCCAT CACGACGGTT TTTTTTATGC ATTGCTTGAA AAGCTCTGA
|
Protein sequence | MIRTQRLAAT TVERVVSGAS LTTVLQETWR SHSGLSDQQR GAIQDLSYGV LRFYGQLDIL LGLLLNKPLK DQSLRYLLLV GLYQLQYSKT APHVVVDSAV SASCSPSIDG RNSRNMLDPR HAKSIGGLVN AILRNFLRKR PALLEKTAAT EVGRYSHPQW WIDKLRAQYP SRYQAVLETA NIRPPMALRV NQRRTNVEAY QKLLHNAGLN ARRPENEWTE EALELFHPVP VEKLPGFGQG LVSVQDVAAQ MAAPLLCPQS GMRVLDACAA PGGKSAHLLE LTDLELTAVD NDGERLERLK QNFTRLGLKP YRIIHGDATH PGEWWDGRPF DRILADAPCS ASGVVRRHPD IKWLRRESDL AQFAEIQRKT LDALWQTLVK GGKLLYVTCS IFAEENGLQV ESFLNHHPDA RLLPFSVPEI VEGQLLPDLH HDGFFYALLE KL
|
| |