Gene Nmul_A1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1749 
Symbol 
ID3786051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2001686 
End bp2003107 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content54% 
IMG OID637811835 
Productpeptidase S1C, Do 
Protein accessionYP_412438 
Protein GI82702872 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCTA AATTTTTACT GGTGGTGCTG CTTGGATTAT CCACGAGCGT GTTTGCCCAG 
GCGCCGGCGC GTGAATTGCC GGATTTTACC GGGTTGGTGG AAAGAGAAGG ACCCTCGGTC
GTAAACATCA GCACCGTTCA GTCAGGCAAC TCAACCACGG AACGGGCTTT TCCGGGAATA
CCGAACATAC CCGAGGATGA TCCGTTTTTC GAGTTTTTCC GGAGACACAT GCAGCCGCAT
GGAGGAATGC CACGAGATTT CGAATCCAGA TCGGTAGGCT CCGGCTTTAT TATCAGTTCG
GATGGCTATA TTCTGACAAA TACCCATCTG GTGGATGGAG CCGATGAAAT CAATGTGAAG
CTTACGGACA AACGGGAGTT CCGGGCAAAG CTCATCGGCG CGGATCGCAA GACGGATATT
GCTCTCCTCA AGATCGATGC TACCGGACTG CCGAAAGTCA CCCAGGGGGA TCCCAATAAC
ATGAAGGTAG GAGAATGGGT GGTAGCGATC GGATCACCGT TCGGCTTTGA AAACAGCGTC
ACTGCCGGAA TTGTCAGCGC CAAAGGACGC TCTCTGGCAC AGGAGAATTT CGTGCCCTTC
ATCCAGACGG ATGTCGCGAT CAACCCCGGG AATTCCGGCG GACCGCTATT CAATATGAAC
GGTGAGGTGG TCGGTGTCAA TTCCCAGATC TACAGTCGCA CGGGTGGATT CATGGGTTTG
TCATTCGCCA TTCCCATCGA TGTTGCCCGG GATATATCGA ATCAGTTGAT TGCGAGCGGC
AAGGTAAGCC GAGGCAGAAT CGGCGTGCTG ATTCAGGAAA TCACAAAGGA ACTGGCGGAA
TCGTTCGGTT TACCCAAGCC AGCCGGCGCG CTGGTAGCTT CTGTCCAGAA AGGCGGCCCG
GCAGACAAGG CAGGCATTCA AGCACGGGAT GTCATTCTCA AGTTCGACGG CAAGACCGTG
AATTCCTCGG GCGATCTGCC ACGCATAGTA GGCTCGACCA AGCCAGGCAC GAAAGTGCAG
ATGCAGGTGT GGCGAAATGG GTCGACCAAG GAGTTCACGA TTACAGTGGA TGAGCTTCCG
GAGGATGAAA AACCGGCAGC GCGTTCTGGA AAGCGGGGCA AGACGCCGGA TACAGCGAAT
CGCATCGGCT TGAGCCTGAT CGAGTTGACT CCCGATCAGA AGAAGGAACT GGAAACTGAA
AGCGGGCTTC TGGTCGAGGA CATGGTGCCT GGTATCGCGA GTCGCGCGGG GGTCAGACCC
GGTGATGTCA TTCTGAGCAT CAATAATCAG GATGTCAAAA CGGTTGATCA ATTCAACCAG
TTGCTCAACA AGGTCGAGAA AGGACGCAAT ATCGCCCTGC TGGTCAAACG CGGTGATACA
GCCACCTTCA TTACCATGAA GATGAATGGT GAACACAGAT AA
 
Protein sequence
MIAKFLLVVL LGLSTSVFAQ APARELPDFT GLVEREGPSV VNISTVQSGN STTERAFPGI 
PNIPEDDPFF EFFRRHMQPH GGMPRDFESR SVGSGFIISS DGYILTNTHL VDGADEINVK
LTDKREFRAK LIGADRKTDI ALLKIDATGL PKVTQGDPNN MKVGEWVVAI GSPFGFENSV
TAGIVSAKGR SLAQENFVPF IQTDVAINPG NSGGPLFNMN GEVVGVNSQI YSRTGGFMGL
SFAIPIDVAR DISNQLIASG KVSRGRIGVL IQEITKELAE SFGLPKPAGA LVASVQKGGP
ADKAGIQARD VILKFDGKTV NSSGDLPRIV GSTKPGTKVQ MQVWRNGSTK EFTITVDELP
EDEKPAARSG KRGKTPDTAN RIGLSLIELT PDQKKELETE SGLLVEDMVP GIASRAGVRP
GDVILSINNQ DVKTVDQFNQ LLNKVEKGRN IALLVKRGDT ATFITMKMNG EHR