Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2061 |
Symbol | |
ID | 3784379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2353857 |
End bp | 2354900 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812150 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_412747 |
Protein GI | 82703181 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTCGTAC TCGGAATTGA AACCTCCTGC GATGAAACGG GTGTCGCACT TTATCACACC CAGCAGGGCT TACTCAGCCA TGCGCTATAT TCGCAAATCG AAATGCATGG GGAATATGGC GGAGTAGTTC CCGAACTCGC CTCCCGCGAC CACATCAGGC GGTTGCTGCC CCTCATCCGG CAGATATTTG CGGAAGCCGG CGTCTCACTG CGAGATCTGG ATGCCATTGC ATACACCCAG GGTCCCGGAC TTGCCGGGGC CTTGCTCGTG GGTGCAAGCG TGGCAGCCGC GCTTGGTTTT GCGCTCAAAG TTCCCGTGCT GGGGATTCAC CACCTCGAAG GCCATCTTTT ATCTCCTCTG CTCTCCGACC CCGCTCCTGC TTTTCCATTC GTAGCCCTGC TTGTATCGGG GGGTCATACG CAACTGATGG AAGTGACCGG TCTGGGGCAG TACCGATTGC TTGGCGAAAC TGTGGACGAT GCGGCAGGAG AAGCCTTCGA CAAGACGGCC AAGCTGCTCG GTCTGGGTTA TCCAGGAGGG CCCGCGCTTT CCCGTCTGGC GGATGAATTT ACCCGATCAG GCCAATCCGC GCGCTTCGAG CTTCCACGAC CCATGCTCCA TAGCGGCGAT TTCAATTTCA GCTTCAGTGG CCTCAAAACC GCCGTCCTGA CGCTGGTGAA CAAACACGAG ATGACACCCC AGATTCGCGG TGCGATCGCA CAAGCTTTCC AGGAGGCGGC AGTCGAAGTG CTGACCGAGA AATCGCTTGC GGCACTTGCA AAAACGGGAC TTACCCAACT GGTAGTGGCT GGCGGGGTAG GAGCCAATCG TCAATTGAGG AGTAATCTCG ACCGCAGGGC AGGAACGATT GGCGCGACAG TATATTATCC GAAACTCGAA TTCTGCACTG ACAACGGGGC AATGATCGCA TTTGCGGGAG CAATGCGTCT GGAATCTGGA GAGTCTGAAA GTTCAAGGCT TGGCAAGTTC ACGATAAACG CCCGCTGGGA TCTGGAAATG CAGGAAATCG GGCGAGAAGC ATAA
|
Protein sequence | MLVLGIETSC DETGVALYHT QQGLLSHALY SQIEMHGEYG GVVPELASRD HIRRLLPLIR QIFAEAGVSL RDLDAIAYTQ GPGLAGALLV GASVAAALGF ALKVPVLGIH HLEGHLLSPL LSDPAPAFPF VALLVSGGHT QLMEVTGLGQ YRLLGETVDD AAGEAFDKTA KLLGLGYPGG PALSRLADEF TRSGQSARFE LPRPMLHSGD FNFSFSGLKT AVLTLVNKHE MTPQIRGAIA QAFQEAAVEV LTEKSLAALA KTGLTQLVVA GGVGANRQLR SNLDRRAGTI GATVYYPKLE FCTDNGAMIA FAGAMRLESG ESESSRLGKF TINARWDLEM QEIGREA
|
| |