Gene Nwi_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0449 
Symbol 
ID3676624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp505255 
End bp506610 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID637711990 
Productpeptidase S41A 
Protein accessionYP_317068 
Protein GI75674647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.143572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.477373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCA AGGTTTCCCT TATTGTTCTC AGCGCTGCCG CGGGTGCCGC GTTGACGCTG 
TTCGTGACGC AGCCGCGATC CGTGCTGATG GGATCGAGCG CGCGCGCCGC GACGTCGGAT
ACCTATCGCC AGCTCAACCT GTTCGGCGAC GTGTTCGAGC GTGTCCGCAC CGATTACGTC
GAAAAACCCG ATGACAGCAA GCTGGTCGAA TCGGCCATCA GCGGCATGCT CACGGGTCTC
GATCCGCATT CGAGCTACAT GGATGCAAAG AGCTTCCGCG ACATGCAGGT CCAGACCCGC
GGTGAATTCG GCGGCCTCGG CATCGAGGTG ACGATGGAGG ATGGCCTGGT CAAGGTGGTC
TCGCCGATCG ACGACACGCC GGCGTCAAAG GCGGGGATCC TCGCGAACGA CATCATCACC
AATCTCGACG ACGAGGCGGT GCAGGGCCTG ACCCTCAACC AGGCCGTCGA CAAGATGCGC
GGCCCGATCG GCACCAAGAT CAAGCTGAAG ATCATCCGTA AGGGACAAGA TAATCCGATC
GACGTGACGC TGGTGCGCGA CAACATTCGT GTCCGCTCGG TTCGTTCGCG GACGGAGTCC
GACGACATCG CCTATATCCG CATCACGACC TTCAACGAGC AGACCACCGA GGGACTGAAG
AAGTCGGTTT CCGACCTCCA GAGCCAGATC GGTGACAAGC TCAAGGGGTA TATCATCGAT
CTGCGAAACA ACCCCGGCGG GTTGCTCGAG GAAGCCGTGA CCGTTTCAGA TGCCTTCCTC
GATCGCGGCG AGATCGTGTC GACGCGCGGG CGCAACGCCG AGGAAACCCA GCGGCGCAGC
GCGCATCCGG GTGACCTCGC CAAAGGCAAG CCGGTGATCA TTCTGGTCAA CGGCGGATCG
GCATCCGCGT CGGAAATCGT CGCCGGCGCG TTGCAGGACC ACAAGCGGGC GACCCTGATC
GGCACCCGCT CGTTCGGCAA GGGCTCGGTG CAGACCATCA TTCCCCTCGG CTCCGGCAAT
GGCGCGCTCC GGCTGACGAC CGCGCGCTAT TATACGCCCT CGGGCCGCTC GATTCAGGCC
AAGGGCATCG TTCCGGATAT CGAGGTGCTT CAGGATGTGC CCGATGAATT GAGGTCACGG
ACCGACACCA AGGGCGAGGC TTCGCTGCGG GGACACCTGC GAAACGGCAA CGACGAGAAG
ACAGGCTCGC AATCCTACGT CCCGCCGGAC GCCAAGAACG ACAAGGCGCT CAAGATGGCC
GGAGACCTTC TGCGCGGCGT CAAGATCAAC GCCTCTTCGC CGCCCTCCAA CAATAAAGCG
GCGATCGAAA AGCCCGCGAA CAAGGCGGCG AACTGA
 
Protein sequence
MMRKVSLIVL SAAAGAALTL FVTQPRSVLM GSSARAATSD TYRQLNLFGD VFERVRTDYV 
EKPDDSKLVE SAISGMLTGL DPHSSYMDAK SFRDMQVQTR GEFGGLGIEV TMEDGLVKVV
SPIDDTPASK AGILANDIIT NLDDEAVQGL TLNQAVDKMR GPIGTKIKLK IIRKGQDNPI
DVTLVRDNIR VRSVRSRTES DDIAYIRITT FNEQTTEGLK KSVSDLQSQI GDKLKGYIID
LRNNPGGLLE EAVTVSDAFL DRGEIVSTRG RNAEETQRRS AHPGDLAKGK PVIILVNGGS
ASASEIVAGA LQDHKRATLI GTRSFGKGSV QTIIPLGSGN GALRLTTARY YTPSGRSIQA
KGIVPDIEVL QDVPDELRSR TDTKGEASLR GHLRNGNDEK TGSQSYVPPD AKNDKALKMA
GDLLRGVKIN ASSPPSNNKA AIEKPANKAA N