Gene Nmul_A0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0807 
Symbol 
ID3785851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp924964 
End bp926160 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID637810893 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_411506 
Protein GI82701940 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAAGC TCTGGCTAGT TTTCGCGCAA GCTACCACGA TAGTTCTGGC AGCCCTGTTT 
GTCGTTTCGA CTTTACGCCC GGGTCTATTA CCCTGGCAAT CCGGAGATGG CGGTGTCGTA
ACCATAAAGG AAGCTCCGGT TGACAAGTCA CACAAGGCAG AGGTCGAGCC TGCCCCTGGG
AGCTTCAGTA GCGCGGCGAA AAAGGCCATG CCCTCCGTGG TGAATGTATT CACCACCAAG
GAAATCAAGG CTGCCCCCCA TCCCTTCATG GAGGATCCTT TCTTCCGGCG TTTCTTCGGA
GATCGTTTCG AATCCCCTCA GAGCCGCCGC GCCGCCAGCC TGGGATCCGG GGTGATCGTG
AGTCCGCAGG GATATATCCT TACGAACCAT CATGTCATTG AAGCGGCGGA TGAAATCGAG
ATTGCGCTGG CGGACGGGAG AAAAACGAAA GCGCGGGTCA TCGGCTCCGA TCCCGAAACC
GATCTTGCAG TGGTAAGAGT GGATATGGAA GGACTTCCGG CCATCACTTT CGGATACTCC
GACAATGCCC TGGTCGGCGA TATTGTCCTT GCGATCGGTA ATCCTTTCGG TGTGGGCCAG
ACGGTAACGA TGGGGATTAT CAGCGCACTC GGACGAACCC ATCTGGGTAT CAACACCTTC
GAAAATTTCA TTCAGACCGA TGCTGCCATC AATCCGGGAA ATTCTGGCGG TGCACTGGTG
GATGCGTCGG GTAACCTCAT CGGCATCAAT ACCGCCATAG TCTCCAGAAC GGGAGGATCA
CTTGGGATAG GCTTTGCCAT TACAGCAGGG GTAGCCAAGC AGATCATGGA GCAGATTATC
CGGACAGGGG GCGTGACCCG TGGCTGGATC GGCGTGGAAG TACAGGATAT GACGCCGGAA
CTTGCGGAGT CGTTCAAGCG CTCGACTACC AGCGGGGCGT TGATTGCAGG TGTGCTCAAG
GGAGGACCTG CGGATCGTGC CGGAGTGAAG CCGGGTGACA TTATTGTAGG AGTGGGAGGA
AAAGAGGTAA CGGATTCATC CGGCATGCTC AATCTGGTAG CGGCATTACC TCCCGGAAAC
ATGGCGACCA TTACAGTCAT GCGTAACCAG AACAAGAAGG CAATCGAGAT CAATGTTGGA
AAACGTCCCA AGCCTCAGCC TCAGGAGCAG TTTCAGGAGC CCGAGGAGCT GGAATAA
 
Protein sequence
MHKLWLVFAQ ATTIVLAALF VVSTLRPGLL PWQSGDGGVV TIKEAPVDKS HKAEVEPAPG 
SFSSAAKKAM PSVVNVFTTK EIKAAPHPFM EDPFFRRFFG DRFESPQSRR AASLGSGVIV
SPQGYILTNH HVIEAADEIE IALADGRKTK ARVIGSDPET DLAVVRVDME GLPAITFGYS
DNALVGDIVL AIGNPFGVGQ TVTMGIISAL GRTHLGINTF ENFIQTDAAI NPGNSGGALV
DASGNLIGIN TAIVSRTGGS LGIGFAITAG VAKQIMEQII RTGGVTRGWI GVEVQDMTPE
LAESFKRSTT SGALIAGVLK GGPADRAGVK PGDIIVGVGG KEVTDSSGML NLVAALPPGN
MATITVMRNQ NKKAIEINVG KRPKPQPQEQ FQEPEELE