Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1882 |
Symbol | |
ID | 3786532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2168084 |
End bp | 2169205 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811968 |
Product | hypothetical protein |
Protein accession | YP_412569 |
Protein GI | 82703003 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGC TGGATTTTCG CCGGGCGGGT GCGCACATTG CCTTGGTCGT GCTGTTCGCT GTTTGTCCGG TTGCGTTTGC TGATGAAAAC GATGAAGTTG CCAAACTCTA CCAGCAGGGC AATCTGGACA AGGCACTGGA ACAGGCAAAT GCGTACCTCG CGCTCAAGCC AAAGGATCCG CAGATGCGGT TTCACAAGGG GTTGATACTG ACCGAGCAGC AGAAAATCCC TGATGCGATC AAGGTCTTTT CTTCACTGTC GGAAGATTAC CCCAACCTGC CCGAGCCGTA TAATAACCTG GCGGTACTCT ATGCCAGCCA GGGCCTGTAT GAAAAGGCGA GGGGAGCGCT GGAGGCCGCT ATACGCACTC ACCCCAGCTA CTCCATTGCG CATGAGAACC TGGGTGATAT CTATGCAAAA CTGGCCAGCG AGGCTTACGG CAAGGCCTTG CAACTGGACC AGGGCAACGC CGCGGCCCAG ACCAAGCTTG CAATGATCAA GGATTTATTC ATCGGCAAAT CCGGCGCGAT AAAAACTGCA TCCGCTGCCA CTGCGCCGGC ACCGCCTGCT TCAGCCGCCC CCCGCTCCCC TGCGGCCACT ACGCCATCTC CTCCCCGATC CGCTGCTCCC GCAACCCCGC CTGCTTCCGT CCACCCCGCG CCCGGTAAAG CCCCGCAAAA ATCAGCGGAA AAAAAAACTG AAAAGGCGGA AAAATCAGCA CCGGGAAAGA TAGCCGCGGT TGAAACACAT TCACCAGAAA AGAAAACCGA TGTTCCGGAC GAATCCGACG AAATCATCAA AACTGTCAAT GCGTGGGCCA GGGCATGGTC CGACAAGAAC GTGACGGCAT ACTTCGCATT CTACGCGGCT GACTTTCAAA CTCCTCGTGG CGTCAAGCGC ACAACATGGG AAAAAACGCG ACGCGACCGC ATCATCAAGC CAAAAGCCAT CCAGGTGGAG ATCACTCACC CCAAGGTAAC TTTAATCAAT CCAGCACGCG CGAGGGTAAG CTTCAGGCAA CTCTATCACT CCGACGCATT CAAACACGAT TCGTCCAAAA CACTTGAGAT GGTTAAAACG GACGGGAAGT GGCAGATCCG ACAGGAGCGC TCCGCGAAAT GA
|
Protein sequence | MKTLDFRRAG AHIALVVLFA VCPVAFADEN DEVAKLYQQG NLDKALEQAN AYLALKPKDP QMRFHKGLIL TEQQKIPDAI KVFSSLSEDY PNLPEPYNNL AVLYASQGLY EKARGALEAA IRTHPSYSIA HENLGDIYAK LASEAYGKAL QLDQGNAAAQ TKLAMIKDLF IGKSGAIKTA SAATAPAPPA SAAPRSPAAT TPSPPRSAAP ATPPASVHPA PGKAPQKSAE KKTEKAEKSA PGKIAAVETH SPEKKTDVPD ESDEIIKTVN AWARAWSDKN VTAYFAFYAA DFQTPRGVKR TTWEKTRRDR IIKPKAIQVE ITHPKVTLIN PARARVSFRQ LYHSDAFKHD SSKTLEMVKT DGKWQIRQER SAK
|
| |