Gene Nwi_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2594 
Symbol 
ID3675023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2817155 
End bp2819008 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content62% 
IMG OID637714160 
ProductTPR repeat-containing protein 
Protein accessionYP_319199 
Protein GI75676778 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.390942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGG CGCGGGTTGC GTACCGAGAA ACCCTGATGT TCGTGCCAAG AAATCTGGAG 
CTGCGCATCC CGATGTTGTC CATTCGTTTC AATCGCCTGG CGATGGCTGT TGCCGCTGCG
GCGTTGTTGG TTCCGTCTGG TCCGCTGGCG GCTCAGACGC CGGACCATCC GACCGAAACC
TCGCCATTTC CCAGCGCCAA GGATCTGCGC AGCATGACGA CGTCGGGCAG CTATCTCGCC
GCGCGTCATG CCAGCGTCGA GCGCGACGCC AGGTCCGCGG CGACGTTCTA TCGTTCGGCG
TTACGCACCG ATCCGAAAAA CAACGAGCTT CTGGATCGCG CGTTCATCTC GTCGCTGGCC
GAAGGCAATA TCGCCGCCGC GGTGAAGCTG GCCGATCGCA TTCTCGCCAT CGACAAATCC
AACCGCGTCG CGCGGCTCGT CGTCGGCGTT GATAATCTCA AGCAGAAAAA ATACGCGGCC
GCCCGGAAAA ACATCAAATT GTCCATTCGC GGACCGATCA CCGACCTGGT GGCGACGTTG
CTGTCGGGAT GGGCCGACTA TGGCGCCGGC GACACCAGGC AGGCCGTCGC TCTTATCGAT
GCGCTGACAG GTCCCGAATG GTACCCGATC TTCAAGGATC TGCATTCGGG GATGCTGCTC
GATCTTGCGG GCCGGAAGGA AGAGGCTGGG GCTCGTCTCG AGCGCGCTTA CAAGCTCGAC
GATTCGGCGC TCCGCGTGTC GGATGCTTAT GCGCGGTGGT TGTCGCGCAA CAAGGACGCC
GCGGCGGCGC TCGCCGTGTA CGAGTCGTTC GACAAGAAAC TGGCGCGGCA TCCTCTGATT
CAGGAAGGAA TCAAGGAGCT CAAGGCTCGC AAGAAGCTGC CGCAGCTCGT CAATTCGCCG
CAGACCGGCG CCGCCGAAGC TCTTTACGGC ATCGGCGCGT CCCTGACCCG GCGCGGCGGC
GAGGATCTGG CGCTGGTCTA TCTGCAACTC GCGCTTTATC TCTCGCCTGA TCATCCGATG
GCGCTGTTGT CGCTCGCCGA CCTCTATGAA TCGGTGAAGA AGCCGGAGAT GGCGATCGGC
GTTTATGAGC GTGTGCCAGC CGGATCGCCG CTGCGGCGCA ATGCCCAGAT TCAACTCGCC
ACCAATCTCG ACGCCGTCGA TCGCAGCGAC GAGGCGATCA AGATTCTCAA GGGCGTGACG
GAGCAGGACC CGAACGACCT GGAGGCAATC ATGGCGCTCG GCAACGTCGA GCGCGGCCGC
AAGAAGTTCG CCGATTGTGC GAAGACCTAC GGTAAAGGCA TCGACGTCAT CGCCGAAGCC
AAGGACAAGC CGAACTGGGT CTACTACTAT TTCCGCGGGA TCTGCCTGGA ACGGTCGAAG
AACTGGGCCA AGGCGGAAGT CGACATGAAG AAGGCGCTCG AGATTCAACC CGAGCAGCCG
CATGTGCTGA ACTACCTCGG CTACTCCTGG ATCGACCGTG GCCTCAATCT CGATGAAGCC
ATGAAGATGA TCAAGCGCGC CGTCGATCAG CGGCCGGACG ACGGCTATAT CGTCGACAGC
CTCGGCTGGG CCTATTATCG CATCGGCAAT TATGAGGAGG CGGTCAAGAC GCTGGAGCGG
GCCATCAACC TCAAGCCCGA GGATCCGACG GTGAACGATC ACCTCGGCGA CGCTTACTGG
CGCGTCGGCC GGACGCTGGA GGCGAAATTT CAGTGGGCGC ATGCGCGCGA TCTGAAGCCG
GAACAGGAAG AACTGCCGAA AATCGAGGCC AAGATCAAGA ACGGCCTGCC GGCGGACGAT
ACGCCGTCCG CCGCCTCGGC GGACAAGAAA AAGGACGCCG GCAAGGGCGG CTGA
 
Protein sequence
MVPARVAYRE TLMFVPRNLE LRIPMLSIRF NRLAMAVAAA ALLVPSGPLA AQTPDHPTET 
SPFPSAKDLR SMTTSGSYLA ARHASVERDA RSAATFYRSA LRTDPKNNEL LDRAFISSLA
EGNIAAAVKL ADRILAIDKS NRVARLVVGV DNLKQKKYAA ARKNIKLSIR GPITDLVATL
LSGWADYGAG DTRQAVALID ALTGPEWYPI FKDLHSGMLL DLAGRKEEAG ARLERAYKLD
DSALRVSDAY ARWLSRNKDA AAALAVYESF DKKLARHPLI QEGIKELKAR KKLPQLVNSP
QTGAAEALYG IGASLTRRGG EDLALVYLQL ALYLSPDHPM ALLSLADLYE SVKKPEMAIG
VYERVPAGSP LRRNAQIQLA TNLDAVDRSD EAIKILKGVT EQDPNDLEAI MALGNVERGR
KKFADCAKTY GKGIDVIAEA KDKPNWVYYY FRGICLERSK NWAKAEVDMK KALEIQPEQP
HVLNYLGYSW IDRGLNLDEA MKMIKRAVDQ RPDDGYIVDS LGWAYYRIGN YEEAVKTLER
AINLKPEDPT VNDHLGDAYW RVGRTLEAKF QWAHARDLKP EQEELPKIEA KIKNGLPADD
TPSAASADKK KDAGKGG