Gene Nwi_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1195 
Symbol 
ID3674423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1301419 
End bp1302969 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content63% 
IMG OID637712745 
Productpeptidase S1C, Do 
Protein accessionYP_317809 
Protein GI75675388 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TGCCGTCTTC CCAGTTAAAG CCCATTCGTT CTGGGGTCAC CGCGCGCAGG 
TTCGCGTTGA TGGCGTCTGT CGTCGCGGGT GTTGGCGTAG CCGCTTACGG GCTAAGCCCT
TCCGCAGACA AGGCCGAGTT CTTCATAAGT CCCGCTTATG CGCAGGTCAA CACCGACGTG
AGCAAGGTCG CGCAGCCGAT CGGCTTTGCC GACATCGTCG AGCGGGTGAA GCCGTCGGTA
ATTTCGGTCA AGATCCGTAT GAAGGACAGG GCGGTGGCCA GCGGGGACGA TTCGCCGTTC
CAGCCGGGCT CGCCCATGGA GCGGTTTTTC CGCCGCTTTG GCGGACCGGA TGGCCTACCG
CCAGGAATGC GCGGCCGGCG CGGTCCTGGC CTCGTGACAG GGCAGGGCTC CGGCTTCTTC
ATCTCGGCTG ATGGGTATGC TGTGACCAAC AACCATGTCG TCGACGGCGC GGACAAGGTC
GAAGTCACGA CCGACGACGG CAAGACCTAC ACCGCCAAGG TGATCGGGGC TGATCCGCGC
ACGGACATTG CCCTGATCAA GGTCGAGGGC GGCTCCAGTT TCCCGTTTGC GAAACTGTCG
GATGACAAGG TGCGGATCGG CGATTGGGTT CTCGCGGTCG GCAATCCGTT CGGTCTTGGC
GGTTCGGTGA CCGCCGGCAT CGTGTCGGCC AGCGGCCGTG ACATTGGCAC CGGCCCCTAT
GACGACTTCA TCCAGATTGA CGCCCCGGTC AACAAAGGTA ATTCCGGCGG CCCGGCTTTC
AACATGAAGG GCGACGTCAT CGGCGTGAAT ACCGCGATCT ATTCGCCGTC CGGCGGCAGC
GTCGGCATTG CGTTCTCGAT TCCCGCGAAT ACGGTGAAGA GTGTCGTTGC TCAGCTCAAG
GACAAAGGGT CGGTGAGCCG CGGATGGATC GGCGTCCAGA TCCAGCCGGT GACTTCCGAC
ATCGCCGACA GCCTCGGTCT CAAGAAAGCC GAAGGCGCGC TGGTGGCTGA GCCGCAGGCC
AACAGTCCGG CGGCCAAGGC GGGTATAAAG TCCGGCGACG TGATCACGGC CGTCAATGGT
GATCCGGTCA AGGACGCGCG CGAACTGGCG CGGACCATCG GTAGCTTGGC GCCGGGAACG
TCGATCAAGC TCAACGTCCT GCGCAAGGGC ACGGACAAGG TCGTCAACCT GACGCTTGGC
ACTCTGCCCG ACACCGTCGA AGCCCGAGCT GACCACTATG GCGATCGTGG CTCATCGCGC
GACAACGATA TTCCCAGGCT TGGTTTGACC GTTGCGCCGG CAGGATCGGT TGCGGGTGCC
GGCGGGGAAG GGGTCGTCGT GGTTGACGTG GATCCGAAAA GCCCCGCGGC CGAGCGCGGG
TTCAGGGAAG GCGACGTGAT TTTGGAGGTC GCGGGAAAGA GCGTGTCCAG CTCCGCCGAC
ATACGCGAGG CAATCAAGGC GGCGCGCGCG GAAGACAAGA ACAGTGTTTT GATGCGGGTT
CGCACCGGCA GCACGGCGCG TTTCGTCGCA ATGCCAATTG CGAAGGGTTA A
 
Protein sequence
MTDLPSSQLK PIRSGVTARR FALMASVVAG VGVAAYGLSP SADKAEFFIS PAYAQVNTDV 
SKVAQPIGFA DIVERVKPSV ISVKIRMKDR AVASGDDSPF QPGSPMERFF RRFGGPDGLP
PGMRGRRGPG LVTGQGSGFF ISADGYAVTN NHVVDGADKV EVTTDDGKTY TAKVIGADPR
TDIALIKVEG GSSFPFAKLS DDKVRIGDWV LAVGNPFGLG GSVTAGIVSA SGRDIGTGPY
DDFIQIDAPV NKGNSGGPAF NMKGDVIGVN TAIYSPSGGS VGIAFSIPAN TVKSVVAQLK
DKGSVSRGWI GVQIQPVTSD IADSLGLKKA EGALVAEPQA NSPAAKAGIK SGDVITAVNG
DPVKDARELA RTIGSLAPGT SIKLNVLRKG TDKVVNLTLG TLPDTVEARA DHYGDRGSSR
DNDIPRLGLT VAPAGSVAGA GGEGVVVVDV DPKSPAAERG FREGDVILEV AGKSVSSSAD
IREAIKAARA EDKNSVLMRV RTGSTARFVA MPIAKG