Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0586 |
Symbol | |
ID | 3783984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 670292 |
End bp | 672013 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810668 |
Product | hypothetical protein |
Protein accession | YP_411286 |
Protein GI | 82701720 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTTA AAGTTCCCGG CGCATTACCG CTGCTCTTTC TTGCCGCCTG TTCACATCTT CCTGCCAAGA CTCCTCTCAC GGCGGAAAAA CCGGCAGAGA AGAAGGAGTC AGAGCAGTCA AGAGTACCCA GTCAGGATTT GACCCCCTCC ATGCTGTTCG ATTTTTTGCT GGCGGAGACA GCGCTGCAGC GGGGCGACAC GGAAATCGGT CTTCGTACCT ATCTCAAGCT TGCAAAAAAC ACTCAGGATC CGCGCGTGGC CCAGCGCGCC ACGGAAGCTG CGCTTCAGGC GCGCCAGCCG GCATTCGCCC TGGAAGCGGC AAAAATCTGG ACGGAACTCG ATCCGGAATC GATTCCAGCC CGTCAGATGA TGGCTGCATT GCTGGTGCAT TTTGACAGAT TAGATGAGGC CCGTCCCCAT CTGGAAAAAT TGCTGGCGGT TGCGGGAGAC AAGATCGATG ATGCTTTCAT GCAACTGAAC AGCCTGCTGG TGCGCAGTCC CAACAAGAAT GCAATCTTCG AATTGGTAAA GCAGCTCGCG CAACCTTATC CCGATCTGCC GGAAGCGCAT TTTGCCGAGT CCCAGGCTGC ATGGTTTGCC GAGCGTTTCG ACATCGCTCT GGAAGAGATG AAAAAAGCGC TGGCGCTGAG GCCTGAGTGG GAGATGGCGG CAATTTACGA AGGACGTATT CTGTCGCGCG AATCCAACGC CCGCGCAATC GAATTTTTCG ATGATTATCT CAAGCGCTAT CCCAAAGCCA ACGATACACG CATCACCTAT GCACGCTTGC TGCTAGCTGA AAGAGATTAT AGCAAAGCCC GCGAGCAGTT CCAGAAGTTG CTGACAGAAA ATCCGGATAA CCCGGATGTT GCCATTGCCG TCGGCCTGTT ATCCCTGGAG CTGCAGAACT ACGATGTAGC CGAATCGAAT TTCAAAAGAG CGCTGGAGCT GGGTTACCGG GACCCGGGAA TGGTGCGCTT TTATCTCGGC GGCATCAGTG AAAAAAAACA GCAGATCCCG CAGGCTTTGA ATTGGTACCG CTCGGTTACG GAGGGAACCC AGTTTATCCC GGCACAGATC AAATATGCCA TTCTGTTGAG CAGAACCGGC AAAACCAAGG AGGGTCTCCA TCATTTGCAG CAACTGCCAG TGGCCAATGA CCAGCAGCGT GCGCAGGTGA TCATCGCCGA GGCGCAATTG TTGCGCGAGT CCGGCGCTTA CAAGAAAGCC TTTCAGCTCC TGAGCAGCAG TCTCGAAAAA CTTCCCGATT CCCCCGAACT GCTTTATGAC CGCGCCCTGG CTGCGGAGAA AATAGGCAAG GCGGATATCA TGGAGCAGGA TCTGCGCAAA CTGATCGAAC TGAGGCCCGA CCATGCTCAC GCCTACAACG CCCTGGGTTA CGGTATTGCC GAGCACTCCA GCAAGCGCCT GCCGGAGGCG CTGGAATTGA TCGAGAAGGC GATCAAGCTT TCGCCTATCG ATCCGTACAT CATCGACAGT CTGGGATGGG TGCATTACCG CATGGGGGAC ATCAACCAGG GATTGAGCTA CCTGCGACAG GCGTTTGCAA TGAATCCCGA TCCGGAAATT GCTGCTCACC TGGGGGAAGT GTTGTGGGTG CAGGGAATGA AGGACGAAGC GAAAGAGATC TGGCAGACGG CCCTCAAGAA TCATCCCGGT AACGAAGCGT TGATTGGCGT GATGAAGAAG TTCATGAAGT AG
|
Protein sequence | MSLKVPGALP LLFLAACSHL PAKTPLTAEK PAEKKESEQS RVPSQDLTPS MLFDFLLAET ALQRGDTEIG LRTYLKLAKN TQDPRVAQRA TEAALQARQP AFALEAAKIW TELDPESIPA RQMMAALLVH FDRLDEARPH LEKLLAVAGD KIDDAFMQLN SLLVRSPNKN AIFELVKQLA QPYPDLPEAH FAESQAAWFA ERFDIALEEM KKALALRPEW EMAAIYEGRI LSRESNARAI EFFDDYLKRY PKANDTRITY ARLLLAERDY SKAREQFQKL LTENPDNPDV AIAVGLLSLE LQNYDVAESN FKRALELGYR DPGMVRFYLG GISEKKQQIP QALNWYRSVT EGTQFIPAQI KYAILLSRTG KTKEGLHHLQ QLPVANDQQR AQVIIAEAQL LRESGAYKKA FQLLSSSLEK LPDSPELLYD RALAAEKIGK ADIMEQDLRK LIELRPDHAH AYNALGYGIA EHSSKRLPEA LELIEKAIKL SPIDPYIIDS LGWVHYRMGD INQGLSYLRQ AFAMNPDPEI AAHLGEVLWV QGMKDEAKEI WQTALKNHPG NEALIGVMKK FMK
|
| |