Gene Nwi_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1391 
Symbol 
ID3677029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1515861 
End bp1517351 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content62% 
IMG OID637712942 
Productpeptidase S1C, Do 
Protein accessionYP_318004 
Protein GI75675583 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.407109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGT TTTGGCGCGT TTTCTTCACG CGAGCCGGTA TCCACTTCGC TCCAAAACAC 
TATATATCTG CGATCCTGAT CCCGGGAATC AAATTCATGA CGCTGCTCCG CCTGTCCGCC
GCTCTGCTCC TCTCTCTGAC GATGCTGGCG CCTGCTCATG CCCAGGACCG TCGCGTCCCG
GCCTCCCAGG CCGAACTGCG GCTTTCCTAT GCGCCGATCG TGCAGCGGGT GGTCCCGGCC
GTGGTCAACG TCTACGCCGC TAAAATGATT CAGCATCGCC ATCCGCTGCT GGATGATCCG
ATATTTCGTC GTTTCTTCGG CGTTCCGGGC GGGCAGCCGG AACAGATGCA GCGCTCGCTC
GGCTCCGGCG TGATGGTCGA TCCGTCCGGC CTCGTCGTGA CGAACAATCA TGTGATCGAG
GGCGCGGATC AGGTCAAGAT CGCGCTGCCG GACAAGCGTG AGTTCGAGGC GGAGATCGTG
CTCAAGGATA CGCGCACTGA TCTTGCGGTC CTGCGGTTGA AGAATGTGCA TGAGAAGTTT
CCGACGCTCG ATTTCGCGAA TTCGGATGAG TTGCAGGTCG GCGATGTCGT CATGGCGATC
GGAAATCCGT TCGGCGTCGG GCAGACGGTG ACGCAAGGCA TCGTTTCGGC GCTGGCGCGC
ACGCAGGTCG GGATTACGGA TTATCAATTC TTCATCCAGA CCGATGCGGC GATCAATCCC
GGAAACTCGG GCGGGGCGCT GGTTGACATG ACGGGCCATC TCGTCGGCAT CAACACCGCG
ATTTTCTCGC GCACCGGCGG GTCGCTTGGC ATCGGCTTCG CCATTCCGGC CAACATGGTG
CGGGTGGTGG TGGCGTCGGC CAAGAGCGGC GGCAAGGCGG TGCGCCGGCC CTGGCTCGGA
GCCCGGCTGC AGGCGGTAAC CCCGGAGATT GCCGAAACCA TTGGCCTCAA GGCGCCGGCC
GGTGCGCTGG TGGCCAGCGT TGCGGCTGGA AGTCCGGCGG CGCGCGCCGG TCTCAAACTG
TCCGACCTGA TCGTCTCGAT CGACGGTCAG CCTGTCGATG ACCCCAATGC GTTTGACTAT
CGGTTCGCAA CGCGTCCGCT CGGCGGGACC TCGCAGCTGG AAGTCCAGCG CAACGGAAAG
CCGCTCAAGC TCACTGTTCC GCTCGAAACA GCGCCGGATA CCAGTCGCGA CGAGGTCGTG
ATTACCGCAC CATCACCGTT CCAGGGCGCC AAGGTGGTGA ATGTCTCGCC GGGGCTGGCC
GATGAGCTTC ATCTCGATTA CGGCGCCGAA GGGGTCGTGA TCGTCGCATT GGCGGAGAGT
GGAATGGCCG CGAATGTCGG GTTCAGGAAG GGCGACATCA TCCTCGCGGT CAACAACAAG
AAGATTGTCA AGACCACCGA TCTCGAACAA GCCGTTCGCG AAGCGAGGCG GCTCTGGCGG
ATCACGCTGA TGCGCAGCGG CCAGCAGATT CAAGTCATGC TGGGCGGATG A
 
Protein sequence
MSLFWRVFFT RAGIHFAPKH YISAILIPGI KFMTLLRLSA ALLLSLTMLA PAHAQDRRVP 
ASQAELRLSY APIVQRVVPA VVNVYAAKMI QHRHPLLDDP IFRRFFGVPG GQPEQMQRSL
GSGVMVDPSG LVVTNNHVIE GADQVKIALP DKREFEAEIV LKDTRTDLAV LRLKNVHEKF
PTLDFANSDE LQVGDVVMAI GNPFGVGQTV TQGIVSALAR TQVGITDYQF FIQTDAAINP
GNSGGALVDM TGHLVGINTA IFSRTGGSLG IGFAIPANMV RVVVASAKSG GKAVRRPWLG
ARLQAVTPEI AETIGLKAPA GALVASVAAG SPAARAGLKL SDLIVSIDGQ PVDDPNAFDY
RFATRPLGGT SQLEVQRNGK PLKLTVPLET APDTSRDEVV ITAPSPFQGA KVVNVSPGLA
DELHLDYGAE GVVIVALAES GMAANVGFRK GDIILAVNNK KIVKTTDLEQ AVREARRLWR
ITLMRSGQQI QVMLGG