Gene Nmul_A0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0534 
Symbol 
ID3784523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp613763 
End bp615859 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content57% 
IMG OID637810616 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_411234 
Protein GI82701668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.709674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCTT CCCACCAGCC CCCGGAAAGT CAGCCCACTG TCAAAGTTTT TGACCCGCTG 
GAGCTTGACC GTTCCGTGAT TGCCATTCCT CTTCTGCGGC AGATGGAGGA GGAATTAAGA
GGAATTGAAG CTTTTCGGGC TGCACATCCG TTCCCCGAAG ATGGAGAATT CAATACGCTG
ATCGAATACA ACAGGGAATT CGATGGAAAG CCGGAGGAGA TGCGGGAGTT GGTCGTCAAG
ATGGCCGAAG AGGCGGCAGC GAAGGCATTG GAAGCATCGA AAAAACGAGT CGAAGAGAGC
AAGGAGATAG CCCCAGCGGG AACACGGGGC ACGCCGGTGC GCGGTGGCTC GACAAGGAGT
CGGGCTGATA TTTACGGAAA TCCTTTTTTG ATGGAGCAAC GGCGCGACAA GGATCTCCAG
GAAGCGGTCG GGAATCAGAA AATCGGACCG GTCTCGGAGG ACGGCGCCTA TAGCTTTGCC
AGCTTGCACG CAACCATAAT ACGGCGAATA TTAGCCGCAA ACGAACGGCT GTCACAGGGT
GAGGAAGCTT CAAAGAAACC ACCTATCCTT CGGATCCACC CTACACGTTA TGAGGTCATC
ATCGATATCA ATCTCGAATA TCCGGGTGGA CGGGAAGAGG CCCGGCGCTG GATTTACCAG
AATATCGAAG AGGCCAAGAA CAAAGCGAAA GTGTACGACG CGGGTCAGGA TATTCATCTC
AAGAAGGAAC AGCGCGAAAG CGGCTACATG TTCGCCTGCC TGGAAGCTCG TACCATCAAG
GCGCTTATCG AACTCGATGT TGCCCAGGCA AAGGTGAAGG CGCGCGAAGC TCAGGCAAAA
GTGGGGGATA CGAGCCATGA AGCGAAGGCT CGTGCAGCAA AAATCAACCC GGCCAAGTTT
CGCGCCATCT TCCGCATATG GCCGGATTTC GAGATTTCTG CCTCCATCAC CCACTCCATC
GCCACGATAA AAGCCGATGC AGCGCAGAAT TCTTTTTCCG CGCGGGGCGC CGGAATCACC
TGGGCGGTGA TGGATTCAGG AATAAAGCAG GATCATGGGC ACTTTCGCAA ACACAACAAT
GTCGATAAAG ACTCCACATG GCATAAAGAC TTCACGGCGG ACGGCAGCGA TCCGTTCAAC
GATGAGAACG GGCATGGAAC GCATGTGGCA GGCATCATTG CCGGCGAATG GCGTCCGGCA
GGTGCACCAG GAGGGACACA AGTGGCCGCG CCAGCGGAGG TTTCACCTCC GGCGGGCGAT
TCCGCCGCAC CATCCCGAAT TCCTGTAGCG GTTTCCCGTT ATCTGAAAAC GGGTTCCGAA
GATGTCGAAT ACCAGCAGAT AAAACTCGTG GACGGCATCA GCGGGATGGC GCCGCGCTGC
AAGCTGATAA GCCTGCGGGT GTTGGACGAA AACGGAAAAG GGAGTGTGAG CAATCTCATT
GCGGCCATCG CCCATGTGCA GGAGATCAAC GGTTACGGCC GAAAGCTTCT GATCCACGGA
GTCAACATGA GCCTGGGATA TACCTTCGAG CCCGAATGGT TCGCCTGCGG CCAGAGTCCG
CTCTGCGTGG AGGTTGACCG GCTGGTAAAA ACCGGCGTCG TGGTCGTGGT CGCCGCAGGC
AATACAGGTT ACGGCACGTT GAAGGCCGCC ACAGGCGCGC TCTCGGCAGG AATGGCGCTG
ACCATCAATG ATCCCGGCAA TGCGGAACTC GCCATTACAG TGGGTTCGAC CCACCGCGAC
ATGCCGCACG TTTACGGGGT TTCCTATTTT TCCTCGAAAG GCCCGACAGG CGACGGCCGC
CTCAAGCCGG ACCTCGTCGC GCCCGGCGAA AAAATCATCT CCTGCGCCAC GGGTCAGTTA
AAAAGCGAGG GCGCACAGGG GATCGAATGC GACTACGTCG AAACCAGCGG CACGAGTATG
GCAGCTCCCC ATGTCAGCGG CGCGATCGCA GCGTTTCTCT CGGTGCGCAC CGAGTTCATC
GGCAAGGCAG AACGTGTTAA AGAGATTTTC GTTGATACGA CTACTGACCT GCGGCGCGAC
CGCTACTTTC AGGGCAGTGG TCTCGTTGAT CTGATGCGCG CTATTCAGTC GGTGTGA
 
Protein sequence
MSPSHQPPES QPTVKVFDPL ELDRSVIAIP LLRQMEEELR GIEAFRAAHP FPEDGEFNTL 
IEYNREFDGK PEEMRELVVK MAEEAAAKAL EASKKRVEES KEIAPAGTRG TPVRGGSTRS
RADIYGNPFL MEQRRDKDLQ EAVGNQKIGP VSEDGAYSFA SLHATIIRRI LAANERLSQG
EEASKKPPIL RIHPTRYEVI IDINLEYPGG REEARRWIYQ NIEEAKNKAK VYDAGQDIHL
KKEQRESGYM FACLEARTIK ALIELDVAQA KVKAREAQAK VGDTSHEAKA RAAKINPAKF
RAIFRIWPDF EISASITHSI ATIKADAAQN SFSARGAGIT WAVMDSGIKQ DHGHFRKHNN
VDKDSTWHKD FTADGSDPFN DENGHGTHVA GIIAGEWRPA GAPGGTQVAA PAEVSPPAGD
SAAPSRIPVA VSRYLKTGSE DVEYQQIKLV DGISGMAPRC KLISLRVLDE NGKGSVSNLI
AAIAHVQEIN GYGRKLLIHG VNMSLGYTFE PEWFACGQSP LCVEVDRLVK TGVVVVVAAG
NTGYGTLKAA TGALSAGMAL TINDPGNAEL AITVGSTHRD MPHVYGVSYF SSKGPTGDGR
LKPDLVAPGE KIISCATGQL KSEGAQGIEC DYVETSGTSM AAPHVSGAIA AFLSVRTEFI
GKAERVKEIF VDTTTDLRRD RYFQGSGLVD LMRAIQSV