Gene Nwi_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2471 
Symbol 
ID3674688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2694494 
End bp2696581 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content63% 
IMG OID637714037 
Productpeptidase S9, prolyl oligopeptidase 
Protein accessionYP_319076 
Protein GI75676655 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.511382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA CACCGACACC TCCGAAAGCC GACCGCCGTT CACATACATA TTCGCATCAC 
GGTATCACCC TGACGGACGA CTACGCCTGG CTGAAGGATG CGAACTGGCA GCAGGTGCTT
CGCGAGCCTT CATTACTCGA ACCGGATATT CGCGCCTACC TTGAGGCCGA AACCGCCTAC
TCGAACGCCG TGCTGGCGCA GACCGAGCCG CTGCAAAGGC AACTGGTCGC GGAAATGCGC
GGCCGCATCA AGGAGGACGA TTCCACCGTT CCCTCCCCCG ACGGTCCTTT TGCCTACTTC
ACGAAGTACC GGGAGGGCGG CCAGCACCGG ATGATCGGCC GGCAGCCGCG CGACGGCGGC
GACGCCCGCT TTCTCATCGA CGGCGATGCG CTCGCGGAAA ACAAAAAGTT CTTTCAGCTT
GGCGAGGCCG ATCATTCTCC GGATCATCGC CTGGAAGCGT GGAGCTGCGA CGATCGCGGT
TCGGAATATT TCACGATCCG TATCCGCGAC TGGGACAGCG GCGCCGATCT CGCCGACGTC
ATCGAGGAAA CAGACGGCGG CGTGGTGTGG AGCGCGGACT CACGCGCGTT CTACTACGTC
AAGCTCGACG ACAACCATCG GCCGATGCAG GTCTATCGCC ACCAGATCGG CACGGTTCAG
GCCGATGATA CGCTCATCTA TGAAGAACAG GACATCGGCT GGTTCACGCA TATTCATGAG
AGCGCCAGCG GCCGTTTCTG CGTGATAGCA GGCGGCGATC ATGAAACGTC CGAACAGCGG
CTGATCGATC TGGCTGCGCC ACAGGCCGTG CCGAGCCTGA TCGCGCCACG CGAGGACGGC
GTGCAGTATT CCGTCGCCGA TCGCGGCGAC GAACTGTTCA TCCTCACAAA CGCCGACGAC
GCGATCGACT TCAAGATCAT GACGGCGCCG CTCTCGTCGC CCGACCGCGC GAACTGGCGC
GACCTCATCC CGTATCGTCC CGGCACCTAT GTGATCGACT TTGAGCTTTA TTCCGGTCAC
ATGGCGCGAC TCGAACGAAC CAATGCGCTG CCCTCGATCA TCATCCGTGA TCTGGCGACC
GGCGATGAGC ACGCAATCGC CTTCGACGAG GCCGCCTATT CGCTCGACAC CCATGGCGGC
TACGAGTTCG ACACCACGAC CTTGCGCTTC AGCTATTCGT CGATGACGAC GCCGTCCGAG
GTGTTCGATT ATGACATGGC GAGCCGCTCG CGCACTTTGC GCAAGCGGCA GGAGATTCCG
TCAGGGCATG ATCCCGCGAA CTATGTCACC ACCCGCATCA TGGCGAAGGC GGACGATGGC
GCGGAGGTGC CGGTCTCGAT CCTGCATCGC AAGGATTTCG TGCTGGATGG CCGCGCGCCG
CTTCTGCTCT ACGGCTACGG CTCCTACGGT CATTCCATGC CCGCCTCGTT CTCGGCCAAC
CGGCTGTCGC TGGTCGATCG CGGTTTCGCT TATGCCATCG CGCACATCCG AGGCGGCGCG
GACAAGGGCT GGGGCTGGTA TCTCGACGGC AAGCGCGAGA AGAAAACCAA CAGCTTCGAC
GATTTCGCCG CCTGCGCCCG CGCGCTGATC GCGGCGAACT ATACCTCAGC CAAGAGGATC
GTCGGCCACG GCGGCAGCGC TGGGGGCATG TTGATGGGTG CGGTGGCCAA CCGCGCCGGC
GAGTTGTTCG CCGGCATCGT CGCGGAAGTG CCGTTCGTCG ACGTGCTCAA CACCATGCTC
GACGACACGC TGCCGCTGAC CCCGCCGGAG TGGCCGGAAT GGGGCAACCC CATCGTGAGC
GCGGAGGATT TCCGCACCAT CCTGTCCTAC TCGCCATACG AGAATGTCGC GGCGAAGGAC
TATCCGGCGA TCCTCGCCTT GGGCGGACTG ACCGATCCGC GCGTCACCTA TTGGGAGCCG
GCGAAATGGA TCGCGCGGCT GCGCGCCACC ATGACGTCAG GCGGCCCCGT CCTGCTGCGG
ACCAATATGG GCGCGGGGCA CGGAGGCGCG TCGGGCCGTT TCAACCGGCT CGACGAGGTC
GCACTCGCCT ATGCCTTCGC TCTATGGGCT GTCGAATATG ACAGTTGA
 
Protein sequence
MTSTPTPPKA DRRSHTYSHH GITLTDDYAW LKDANWQQVL REPSLLEPDI RAYLEAETAY 
SNAVLAQTEP LQRQLVAEMR GRIKEDDSTV PSPDGPFAYF TKYREGGQHR MIGRQPRDGG
DARFLIDGDA LAENKKFFQL GEADHSPDHR LEAWSCDDRG SEYFTIRIRD WDSGADLADV
IEETDGGVVW SADSRAFYYV KLDDNHRPMQ VYRHQIGTVQ ADDTLIYEEQ DIGWFTHIHE
SASGRFCVIA GGDHETSEQR LIDLAAPQAV PSLIAPREDG VQYSVADRGD ELFILTNADD
AIDFKIMTAP LSSPDRANWR DLIPYRPGTY VIDFELYSGH MARLERTNAL PSIIIRDLAT
GDEHAIAFDE AAYSLDTHGG YEFDTTTLRF SYSSMTTPSE VFDYDMASRS RTLRKRQEIP
SGHDPANYVT TRIMAKADDG AEVPVSILHR KDFVLDGRAP LLLYGYGSYG HSMPASFSAN
RLSLVDRGFA YAIAHIRGGA DKGWGWYLDG KREKKTNSFD DFAACARALI AANYTSAKRI
VGHGGSAGGM LMGAVANRAG ELFAGIVAEV PFVDVLNTML DDTLPLTPPE WPEWGNPIVS
AEDFRTILSY SPYENVAAKD YPAILALGGL TDPRVTYWEP AKWIARLRAT MTSGGPVLLR
TNMGAGHGGA SGRFNRLDEV ALAYAFALWA VEYDS