Gene Nwi_0183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0183 
Symbol 
ID3676639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp207838 
End bp211017 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content67% 
IMG OID637711721 
ProductSel1 repeat-containing protein 
Protein accessionYP_316803 
Protein GI75674382 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.536181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGC GGGGATCGTC GAGCGACGAC GGAATCGAAA CATCCGATCG CGAGGGAGCG 
GACGCTGCCG CGCGGCGCGC CGGCACATCC CGCGACGACG GACCGGGTTC CGGTTCCGGC
GGCACCCCGC CGGGTCCGAC GCAGCCGGCG GCATCTCCCC AGAACAGCCC CGAGATGAGG
GACATTCATC AACGGCTAGA TTCCATCACC CGCCAGATCG ATCAGTTGGC AAGCCCGGCC
GAGAACGGCG AACCCGCGGT CGCCCGTCAG CTCAACGACG CCATTTCGCG GCTCGATGCC
CGGCTCGCCC GGGTCACGGC CCAGACACCC GCAGCGAACC CGCAGCAACG GGGCGCGAGC
GGAAGCTACG ACCGACCGAC ATCACCGGAC CTCGTCACGC TCGAATCCGC GATCGCGGAG
ATCGCCGCGC GCCAGCATGA ACTCGATGAG ACGCCGCGCG CGCCATCCTC ATACAGCTCA
GCCCCCGTTG CGTCCGCGAT GAGAACGTCG CGACAGGCGG ATTTCTCCAG CCTCGAACGG
CAACTCTTCA AGATCACGAG CCAGATCGAA TCCCTGCAAC GCCCCGACGG GATCGAGCAG
TCGATCGCCG CATTCCGCAT CGAGCTTGCG GATATCCGTC ATGTGATCAC CGAGGCCATG
CCGCGCAGGG CGATCGAATC CCTCGAAAAC GAGATTCGCT CGCTTTCGCA GCGCATCGAC
GAAGTCCGCC AGAACGGCAG CGATGGTCAG GCGCTGGCGA ATATCGAACG CGCCCTCAAG
GAGATCTACG ACGCTTTGCG ATCGCTGAAG CCGGCGGAAC AGCTCTCAGG ATTCGACGAG
GCCATCCGCA ATCTCGGCAA CAAGATCGAT TCGATCGTGC GGGGCAGCGG CGACAGCGGC
ATGATGCAGC AGCTTGAGAG CGCGATCGGC GCGCTTCGCG GCATCGTCTC GAACGTCGCC
TCCAACGACG CGCTGGAGCG CCTCGGCAAC GATCTCAACA TGCTGTCGTC CAAGGTCGAG
CAGCTCGGCC GGCCCGCGGG CGACGGGGAT TTCTACGCCG CGCTCGAACA GCGCATGGCC
GCATTGACGC AGACCCTGGA AAACCGCGAA AGCCCGGCCT CGGGCAGCAG CTTCGAACAA
CTTGAAGAGA CGGTGCGAGC GCTGTCCGAG CGTCTCGACC GCCTGCCGGC CGGTCACGAT
TCGTCCTCGG CGCTGGCGCA CCTCGAACAG CGGGTCTCGA TGCTGCTCGA ACGCCTGGAG
ACCGCAGGCG AATATTCAGG ATCCCATCTC GGGCGCGTCG AGGAAGGTTT GCAGGACATT
CTGCATTGCC TGGAACGGCA GCAGGCCGGA CTCGCGGCGA TGTCCGAAAG CGGTCCGCGA
AGCGCTGTCC CGACCATGGA TAGCGAGGTC GTGGAAGCGA TCAAGCGCGA ACTATCCGAG
ATGCGCTTCT GCCAGTCGGA AACCGACCGC CATACCCAGG ACTCGCTCGA GGCCGTTCAC
AACACCCTGG AGCATGTGGT CGACCGGCTG GCGATGATCG AAAGCGACCT TCGCGCGGTA
CACACCATGC CCGCCCGTGC CGAGCCGTCC CGCGGCGGAA TAATGCCGGA ACGGACGGCA
AATCTGCCGC CCAAACCCGA ATTGCCGAAT CCGGTACTGT CGCAAGCGGC CACGCAGCCC
ACGCCGGCCC GCACCGCCAC GGCGGCGCCG ATCCCGCGCG CCATAGCCGA CGCCCTGATC
CCCAAAGAGA CCTTCGATCC GGATCGCGTC GTGCCATCCA CGACAGCGCC CTCGCCGCGC
CCCGCGATCG ACCCGAAGCT GCCGCCCGAT CATCCACTCG AGCCCGGCAC CCGTCCGGCG
GGACGCGCGG CAACGCCGTC GGAGCGCATC GCGGCCTCCG AGAGCGAGAC CGGCGACGTC
GCCGAAACCC CGCGCGAGCA GTCGGGCTCG TCCTTCATCG CCGCCGCGCG CCGCGCGGCG
AAGGCGGCCG CCGCCTCCAC GCCGTCGCCC GACAAGGCGG GTCGAACCAA GGTCGCCATC
GAACCGGCGC GCCCCGCCAC GGGCGGATCG GACATCACCT CCAAGATCCG TTCCCTGCTC
GTTGCGGCGA GCGTGGTGGT GATCGTCCTC AGCAGTTTCA AGTTCGCGAT GACCCTGCTC
GACAGTTCTC CGCGCGCCAC CCTCAGCGAA AGCGACCACG CGGCTCCCGT GACCAAGCCG
CCAGCCGACC CCGAGGGCAG CGCCGGCCCA GAGATACCGC AGCCGCCCTC GATGATCGCG
CCGACGCCGA TCGATCGGCA ATCGATGATC GCACCGCCCG CCGGTGACAG CGCGGCTCCG
GCCAAAAACC CCTCGGATGC CATGCCGGCG AGCCGGCCGG ACACGCCGTC CGCCGACGTC
ACCGGTGCGA TCCCGAACGG ACTCACGCCG GAACCGGCTG CAACTCCACC CGCCGCATCG
CTTCCCGACA GCATCGGCGG CACGACGCTG CGTTCCGCCG CGCTCCAGGG CGACCCCGCA
GCCTCGTTCG AGGTCGGTGT TCGCTACGCC GAAGGAAAGG GCGTGACCGT CAACTACGAC
GAAGCCGCGA AGTGGTACGA ACGGGCGGCT CATGCCGGCA TCGTGCCCGC CATGTTCAGG
CTTGGCGCCC TGCATGAAAA GGGGCTCGGC ACGAGCAAGG ACGTCGATAC CGCGCGGCGC
TATTACCTGC AGGCGGCGGA CCGGGGCAAT GCCAAGGCGA TGCACAACCT CGCCGTGCTC
GATGCCGATG GTGGCGGCAA GGGCGCCGAC TATGTGAGCG CCGCGCAATG GTTCAGCAAG
GCGGCCGAGC GTGGAATAGC CGACAGCCAG TACAATCTCG GCATTCTTTA CGCCCGCGGC
ATCGGCGTCG AACAGAATCT CGCGAAATCC TACAAGTGGT TCAGCCTCGC CGCCGCCCAG
GGTGACGTGG ACTCAGGGCG CAAGCGCGAT GAAGTCGCCA AGCGGCTCGA TCCGCCGTCG
CTTGCCGCCG CCAAGCTCGC GGTTCAAACC TTCGTGGTCA CGCCGCAGCC CAGCGATGCG
GTCAGCGTTC CCGGCCCCGC GGGCGGATGG GACAGCGCGG CTGCAAAGCC CCGCGCCAAA
CCGGCGCCGG GCAAATCCGC GAGGGCGAAC CACGCGATGG CGCGGCATAC CGCGCATTAA
 
Protein sequence
MNSRGSSSDD GIETSDREGA DAAARRAGTS RDDGPGSGSG GTPPGPTQPA ASPQNSPEMR 
DIHQRLDSIT RQIDQLASPA ENGEPAVARQ LNDAISRLDA RLARVTAQTP AANPQQRGAS
GSYDRPTSPD LVTLESAIAE IAARQHELDE TPRAPSSYSS APVASAMRTS RQADFSSLER
QLFKITSQIE SLQRPDGIEQ SIAAFRIELA DIRHVITEAM PRRAIESLEN EIRSLSQRID
EVRQNGSDGQ ALANIERALK EIYDALRSLK PAEQLSGFDE AIRNLGNKID SIVRGSGDSG
MMQQLESAIG ALRGIVSNVA SNDALERLGN DLNMLSSKVE QLGRPAGDGD FYAALEQRMA
ALTQTLENRE SPASGSSFEQ LEETVRALSE RLDRLPAGHD SSSALAHLEQ RVSMLLERLE
TAGEYSGSHL GRVEEGLQDI LHCLERQQAG LAAMSESGPR SAVPTMDSEV VEAIKRELSE
MRFCQSETDR HTQDSLEAVH NTLEHVVDRL AMIESDLRAV HTMPARAEPS RGGIMPERTA
NLPPKPELPN PVLSQAATQP TPARTATAAP IPRAIADALI PKETFDPDRV VPSTTAPSPR
PAIDPKLPPD HPLEPGTRPA GRAATPSERI AASESETGDV AETPREQSGS SFIAAARRAA
KAAAASTPSP DKAGRTKVAI EPARPATGGS DITSKIRSLL VAASVVVIVL SSFKFAMTLL
DSSPRATLSE SDHAAPVTKP PADPEGSAGP EIPQPPSMIA PTPIDRQSMI APPAGDSAAP
AKNPSDAMPA SRPDTPSADV TGAIPNGLTP EPAATPPAAS LPDSIGGTTL RSAALQGDPA
ASFEVGVRYA EGKGVTVNYD EAAKWYERAA HAGIVPAMFR LGALHEKGLG TSKDVDTARR
YYLQAADRGN AKAMHNLAVL DADGGGKGAD YVSAAQWFSK AAERGIADSQ YNLGILYARG
IGVEQNLAKS YKWFSLAAAQ GDVDSGRKRD EVAKRLDPPS LAAAKLAVQT FVVTPQPSDA
VSVPGPAGGW DSAAAKPRAK PAPGKSARAN HAMARHTAH