Gene Nwi_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2346 
Symbol 
ID3674492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2566954 
End bp2568444 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content61% 
IMG OID637713910 
Productpeptidase S1C, Do 
Protein accessionYP_318952 
Protein GI75676531 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.466246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAG CGATGCAAGC ATTGAGTCTT CGCCGGCGTT TCCTTCTGAG CGCGTTTTGC 
CTCGGCGTCG CGAGCGCCGT GGTATCCGCG CCGGCGCTGG CCCGCGGTCC CGATGGCATC
GCCGACGTGG CGGAAAAGGT GATCGACGCG GTGGTCAATA TCTCGACGAC CCAAAAGGTG
GAAGCCAAGA GCGGCGAGGG CCGCCGCATG ATTCCGCAGC TTCCGCCGGG ATCACCGTTC
GAGGAGTTCT TCGACGACTT CTTCAAGAAT CGCCGCGGCG GCAAGAGTGA CGGCCTGCAG
ACGCACAAGA CGAACTCGCT TGGCTCCGGA TTCATCGTCG ATGCCGCCGG AATTGTGGTC
ACCAATAATC ACGTCATCGC CGATTCCGAT GAGATCAACG TCATTTTGAA TGACGGCACC
AAGGTCAAGG CCGAGACCGT CGGGGTCGAC AAGAAAACTG ACCTTGCGGT GCTGAAGTTC
AAGCCGCCGC ATCCGCTGAT GGCCGTTAAA TTCGGCGATT CCGACAAGTT GCGGCTCGGC
GAATGGGTGA TCGCGATCGG CAACCCTTTC AGTCTGGGCG GCTCCGTTTC CGCCGGAATC
GTCTCGGCGC GCAACCGCGA CATCAATAAC GGCCCTTACG ACAGCTACAT CCAGACCGAT
GCCGCCATCA ATCGCGGCAA CTCCGGCGGC CCGTTGTTCA ACCTTGAAGG CGAGGTCGTG
GGAGTGAATA CGCTGATCAT CTCGCCATCC GGCGGCTCGA TCGGTATCGG TTTCGCGGTG
CCGTCGAAGA CGGTAGCGGG CGTCGTCGAT CAGTTGCGGA AATTCGGCGA GCTGCGTCGC
GGCTGGCTCG GCGTTCGCAT TCAGCAGGTG ACCGACGAAA TCGCCGACAG CCTCAATATC
AAGCCGGCTC GCGGAGCGCT GGTTGCCGGT GTGGAGGATA AGGGGCCGGC CAAGCCTGCC
GGCATCGAAC CCGGCGATGT CGTCATCGCT TTCGACGGCA AGGACATCAA GGAGCCTAAG
GATCTGTCGC GCATGGTCGC GGATACGGCG GTGGGCAAGA CCGTCAGCGT CGTGGTCATC
CGCAAGGGCA AGGAAGAGAC CAAGAAGGTG ACGCTGGGAC GTCTCGACGA CAGCGACAAG
GCCATCCCCG TTTCGATCAA GACCAAGCCG GAAGATGACG ACAAGCCCGT CACGCAAAAG
GCGCTCGGCC TCGATCTCGC CACGCTAAGC AAGGATCTGC GCGCGCGCTA CAAGATCAAG
GACAGCGTCA AGGGCGTCAT CATCACCGAG GTGGAAGCAG ATTCCGACGC CGCCGAGAAG
CGGCTCAGCG CCGGCGACGT GATCGTCGAG GTGGCGCAGG AAGCTGTGAA CAGTGCGTCC
GATATCCGCA AGCGCATCGA TCAGCTGAAA AAGGACGGCA AGAAAGCTGT GCTGCTGATG
GTGTCGAACG GCGCCGGCGA ACTGCGTTTC GTGGCGCTGA GCCTGAAGTA G
 
Protein sequence
MTRAMQALSL RRRFLLSAFC LGVASAVVSA PALARGPDGI ADVAEKVIDA VVNISTTQKV 
EAKSGEGRRM IPQLPPGSPF EEFFDDFFKN RRGGKSDGLQ THKTNSLGSG FIVDAAGIVV
TNNHVIADSD EINVILNDGT KVKAETVGVD KKTDLAVLKF KPPHPLMAVK FGDSDKLRLG
EWVIAIGNPF SLGGSVSAGI VSARNRDINN GPYDSYIQTD AAINRGNSGG PLFNLEGEVV
GVNTLIISPS GGSIGIGFAV PSKTVAGVVD QLRKFGELRR GWLGVRIQQV TDEIADSLNI
KPARGALVAG VEDKGPAKPA GIEPGDVVIA FDGKDIKEPK DLSRMVADTA VGKTVSVVVI
RKGKEETKKV TLGRLDDSDK AIPVSIKTKP EDDDKPVTQK ALGLDLATLS KDLRARYKIK
DSVKGVIITE VEADSDAAEK RLSAGDVIVE VAQEAVNSAS DIRKRIDQLK KDGKKAVLLM
VSNGAGELRF VALSLK