Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1066 |
Symbol | |
ID | 3784886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1233764 |
End bp | 1234714 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811150 |
Product | peptidase S49 |
Protein accession | YP_411761 |
Protein GI | 82702195 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.609608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATT CACAGAATCG AAACGAAGGT TGGGACAGGC AGGTGCTGGA AAAGCTGGTG TTCTCCACCT TGCAGGAACA GCGCCGCACG CGGCAGTGGG GGGTGTTTTT CAAGTCCCTT ACTTTTATCT GGCTGTTTAT CCTCCTGTTT TTCGGTCTTG GCTGGTTTGG AGACAGCAGT ATGTCCATTT CCGGCAAGCA TACCGCTCTC GTCGATTTGC GCGGTGTAAT CTCTCCCGAT AGCATCAGCA GCGCGGAAAA CATCACTGCC GGCCTGCAGC AGGCATTCAA GGACGCAAAA ACGCAGGGGG TGATCCTGCG CATCAACAGT CCCGGGGGGA GTCCCGTTCA GGCAGGATAC ATCAACGATG AGATACGCCG CCTGCGTGCA GAATATCCTG AAATACCCCT CTACGCTGTC GTGGAAGATA TCTGCGCTTC GGGCGGCTAT TATGTAGCGG TTGCCGCCGA CAAGATATAT GTGGACAAGG CAAGTATCAT TGGTTCCATC GGCGTCCTGA TAAACGGGTT CGGTTTTACA AAAGCAATGG AAAAACTTGG CATCGAAAGG CGCTTGATCA CGGCAGGAGA AAACAAGGCT TTTCTCGATC CATTTTCTCC CAACAATCGC GAGCAGGAGG AATATGCCAA GAAAATGCTG GGTGATATCC ATGAGCAATT CATTCAGGTA GTTCAGCAAG GCCGGGGCGA ACGCCTGAAG GAAAAGCCGG AAATATTCAG CGGCAAGGTG TGGACAGGTC AAAAAAGTGT CGAACTGGGA CTTGCTGACG GAATGGGCAG CGCGGAATAC GTGGCGCGGG AAATTATCAA GGCGGAACAC ATCGTCGACT ATACGACCCG GGAGGGAGTT GCCGAGCGCC TCGCCAAACG CTTTGGAGGA GTCCTGGCGG AAACGCTGAG TGGTTTGGGA ATGAGTGCGG AACTCCACTA A
|
Protein sequence | MSDSQNRNEG WDRQVLEKLV FSTLQEQRRT RQWGVFFKSL TFIWLFILLF FGLGWFGDSS MSISGKHTAL VDLRGVISPD SISSAENITA GLQQAFKDAK TQGVILRINS PGGSPVQAGY INDEIRRLRA EYPEIPLYAV VEDICASGGY YVAVAADKIY VDKASIIGSI GVLINGFGFT KAMEKLGIER RLITAGENKA FLDPFSPNNR EQEEYAKKML GDIHEQFIQV VQQGRGERLK EKPEIFSGKV WTGQKSVELG LADGMGSAEY VAREIIKAEH IVDYTTREGV AERLAKRFGG VLAETLSGLG MSAELH
|
| |