Gene Nmul_A1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1891 
Symbol 
ID3784263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2177846 
End bp2179726 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content60% 
IMG OID637811977 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_412578 
Protein GI82703012 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG ACAAGAGGGA AGAGCAGGCC GGTGTAGCGG CCCGGCTTGC CGAGAGGTTC 
ATATCAAACA TACGCGCAAA GGTTGCCTCA TCCAGCTGCC GTTTTGCAGG GGCGATACTG
GCAGTTTTGC TGGGATTGGG TCCGGTCATT TCCAGTGCAG AGCCTCCGGA GGGCAGGGGG
CCTGGAATGA ACAAGCCGGT TGGCTGGGCG AAGGGTCGCA TCCTGGTCAT GCCGCGTGCC
GGCTTGCCGG AAAAGGAACT GGCCAAGGTT CTGGGCGAAC ATGGCGGCAA GGGCAGGAAG
ATCGGGCAGA GCGATCTGTA TATCGTCGAC TTGCCGGGTA ATGCCTCCGA GAAGGCGGTG
GCGGCAAGGC TTGCGCATCA TCCGGCGCTC AAGTTTGCCG AGATCGACCA GGAAGTCGAA
CCCGCGCTCA TTCCGAACGA TCCCTACTAT GGCAGCGCCT GGCATTTGCC CAAGATTGGC
GCTCCGTCCG CCTGGGACAG TTCCCTGGGC AGCGGCGTGA CCATCGCCAT CCTGGATTCA
GGAGTCGATA GCACTCACCC TGATCTGGCT ACGGAACTGG TGCCAGGCTG GAATTTCTAC
GAGAACAATT CCAATACATC GGATGTTTAC GGGCATGGCA CCAAGGTTGC GGGTGCGGCG
GGAGCGGCAG GCAATAACGC GCTGGGGGTG GCCTCGGTGG CGGCACGGGC GAAGATCATG
CCGATCCGGG TAAGTGGCTC TGATGGGTAT GCCACCTGGA GTGCGATTTC CCAAGGGCTT
ATCTATGCGG CAGACCGCGG GGTTCGCGTC GCCAATGCCA GCTTTCTCGG ACTGACCGAC
AGTGCGAGTA CTCGCAGTGC GGCGCAATAC CTGAAGAACA AAGGCGGCCT TGTCATTGTC
AGCGGGGGCA ACACTGGCGT ACAGCAAAAT TATGCCGCAA CCACCAGCAT GATTCCTGTC
TCTGCCACTG ACGGAAATGA CGTACGGACG AGCTGGTCTA GTTATGGTAA TTACATTGCT
CTCGCGGCTC CCGGTGCAGG AATCTGGAGC ACTACCAAGG GTGGCGGCTA TGGCGCGGTT
TCCGGCACGT CGTTCTCAAG CCCGGTAACG GCTGGGGTGG TGGCGCTCAT GATGGCGGCA
AAGCCGACAT TATCCAATAC CCAGATCGAG AGTCTGCTGT ATTCCACGGC GGTGGATCTT
GGAACGCCAG GGCGCGATCC TTACTACGGC TATGGGCGAG TGAATGCGGC GCGTGCCGTA
CAGGCCGCCG CCGGCACCAC GCTGACAGCA GATACGCAGG CTCCGGCGGT TTCCATCACT
TCCCCTGCTG GCGGATCAAG CGTAACAGGC CTGGTCGGCG TAAACGTGGC TGCGAGCGAT
AACGTCGGCG TGACTCGTGT CGAGTTGCGG GTCAACAATA CCACAGTCGC AGTCGACACC
ACGGCGCCGT TCGCCTTCAC CTGGGATTCG GCAGGCGTAG CCAACGGTAT GGCGAACCTG
ACCGCTTATG CATTCGATGC GGCCGGCAAC TCCAAGGCTT CGACCACCGT CTCGGTGAAT
GTGGCAAACG GAACGACAAC GGTGGCCAGG GATACTACCG CGCCACAGGT GAAGATCGTA
AATCCGGTCA CAGGCAATGT TTCGGGCAGC AATGTGGCCA TCAGCGTGAA TGCAAGCGAT
GATAGCGGCG CTTCCGGAAT CACCTGTACG CTCTACATCG ACGGCGTGCT CAAGGCTACC
GGAAAAGGAA GTACGTTAGG ATATAGCTGG AATACCCGCC CAAGCAATGT GCGCGCGGGA
GCACACACTA TCTGGACGGT CGCCAGAGAT GCGGCGGGCA ATACAGCATC TGCTTCGGTG
AATGTGACAG TGATCAAGTA A
 
Protein sequence
MNADKREEQA GVAARLAERF ISNIRAKVAS SSCRFAGAIL AVLLGLGPVI SSAEPPEGRG 
PGMNKPVGWA KGRILVMPRA GLPEKELAKV LGEHGGKGRK IGQSDLYIVD LPGNASEKAV
AARLAHHPAL KFAEIDQEVE PALIPNDPYY GSAWHLPKIG APSAWDSSLG SGVTIAILDS
GVDSTHPDLA TELVPGWNFY ENNSNTSDVY GHGTKVAGAA GAAGNNALGV ASVAARAKIM
PIRVSGSDGY ATWSAISQGL IYAADRGVRV ANASFLGLTD SASTRSAAQY LKNKGGLVIV
SGGNTGVQQN YAATTSMIPV SATDGNDVRT SWSSYGNYIA LAAPGAGIWS TTKGGGYGAV
SGTSFSSPVT AGVVALMMAA KPTLSNTQIE SLLYSTAVDL GTPGRDPYYG YGRVNAARAV
QAAAGTTLTA DTQAPAVSIT SPAGGSSVTG LVGVNVAASD NVGVTRVELR VNNTTVAVDT
TAPFAFTWDS AGVANGMANL TAYAFDAAGN SKASTTVSVN VANGTTTVAR DTTAPQVKIV
NPVTGNVSGS NVAISVNASD DSGASGITCT LYIDGVLKAT GKGSTLGYSW NTRPSNVRAG
AHTIWTVARD AAGNTASASV NVTVIK