Gene Nwi_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0537 
Symbol 
ID3676945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp599987 
End bp602029 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content63% 
IMG OID637712082 
ProductSel1 repeat-containing protein 
Protein accessionYP_317156 
Protein GI75674735 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.590266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAT CCAGATCGCT TCTCCAGTGG GTTCGTCGTC CGGTTCTTGG GCCTTCTCGC 
GGTGGCCTCT TCGAGCATGG AATGGGCGCG TATGAGCGCG GCGAGTATCT TGAAGCGGTC
AGGTGCTGGA AGCTGGCGCT GGAGGCGGGC GAGGCCGAGG CCGCCTATCG GATCGGAAGG
CTTTATGTGA AAGGCGAAGG GGTCGTCAGG AGCGTGCCAG ATGCCGCGAT CTGGTATGAG
CGCGCGGCCC GGCTGGGCCA TGTCGACGCG CAGTTTCAAC TCGGCCTGAT CTTTCTGCAG
GGGGAGAAGC CGCTGCTGGG CCCGCATCGC CACGAAACAT GGCGCCAGTC CTCGGCCATA
CGTCTCGGCG ACAAGGCGTC GAACATACAC CAGCTTGTGT TTCCGCATGG AACGAGCGTT
GAGCGGAATG TTGACGCTGC GTTTCACTGG ATATCGGCCG CGGCGGACAA AGGCAAAGCC
GAAGCGCAGA CGGTCCTCGG CAATATGTAC AGTGAGGGAT TAGGCTGCGA GAAGAATCTC
CAGATAGCGC TTGCATGGTA TGGGGTTGCG GCCGAGCAGA ACTGTGCTGC CGCGGAATTC
GCTCTCGGCG ATATTCATTT TCAGGGCAAA GGGGTTCCCG TCGACTTCGA GCAGGCCGCG
GTCTGGTATC GCAAGGCGGC CGAGCAGGAT CACGTCAGGG CGCAGGTCGC GCTCGCGTTC
ATGAACCTGA AGGGAACGGG AATGCCGGAG AACCCTGCGG AAGCGGCACG CCTGTTCCAG
GGCGCTGCGA TGCATGACGA TATCATCGCG CTGTACAACA TCGGCTTGTT ACGCCTGAAG
GGCCATGGCG TCGCGAAGGA TATCGACAAG GCTGAAACCG CGCTGCGCAA GGCGGCGCGC
AAGGATTACT TTCCCGCGAT TCAGGCGCTG GCGGAATTCT ATTCGCACGG CGGAGGATTT
GCTCCGGATC TTCGGGAGGC CGCGGTCTGG TATGAGAAGG CCGCCGAGCG GGACGACGTA
CAGGCGCAAT TCTTCATGGG CCGATTCTAT GCGATGGGCA CCGGCGTCGG CCCGAACATT
CGTCAGGCTG CAAAATGGTT CGAGCGCGCG GCCCGCAACG GTCACGCGAC TGCCGCGTTC
AACATCGCCA TTTTTTATCT CAACGGATCC GGCGTCGAAC GCGATGTAGA CCGCGCGATC
GAATGGTTCG AGCGCGCGTC CGAAGGCGGC ATCCGCGCCG CGCAGCTGCA GCTCGGAAAA
CTTTATTCGG CAGGCAACGG TGTGCCGCGC GATCAGAAGC TGGCAAGAGA ATGGCTCGGC
AAGGCCGCCA ACGGCGGAGA CCCGGACGCC CAGACGGCCT ATGCGCTTTT CCTGCTTCGT
CAGGATGGTT CAGCGGAACA GCTCGAGCAG GCAAAGGCGC TGCTGGTCGA GGCGGCCGAG
GCCGATCACG CGCCGGCTGC GTTCCAGCTC GGCGTGCTGC ATATGGGCAA GTTCGGGGGA
GAAACAGACA TCGCGGCCGC CGTTCCATGG TTCGCACGCG CCGCCGGCGC GGGGCATGTC
GATGCACAAT ATACCCTTGC GCTGCTTCAT CTCGATCCGG GCAGCGGCAT GAGTGACGCG
AAGGCTGCCG CGTCATGGAT GACGAAGGCC GCTCATGCGG GCCATGCCGG CGCGCAGTTT
CAGCTCGCCG TGCTGTATTG CACCGGCGCC GGCCTGGCTC AGGATGTCGC GCAAGGGGTG
AGGTGGTACG AGGCCGCCGC GCAACAGGGA CACAGGATCG CGCAGTTCAA CCTCGCGGTG
ATGCTTGGAA AAGGACAGGG ATGCGAGGTC GACCTCGGGA AGGCGGTCGA ATGGTTCGAG
AAGGCGGCCC GGCAGGATGT GGCGGAAGCG CAGATTGCGC TCGGCGACGC GTTGATGTCG
GGAAGCGGCG TGACGAAGGA TCAGGATGCG GCGGTGCAGT GGTATCGCCG GGCGGCCGGT
CACAACCATG AAGGCGCCCG GCACCGTCTG AATGCGATCG GCGTAACGAC GGTCGATGGC
TGA
 
Protein sequence
MLSSRSLLQW VRRPVLGPSR GGLFEHGMGA YERGEYLEAV RCWKLALEAG EAEAAYRIGR 
LYVKGEGVVR SVPDAAIWYE RAARLGHVDA QFQLGLIFLQ GEKPLLGPHR HETWRQSSAI
RLGDKASNIH QLVFPHGTSV ERNVDAAFHW ISAAADKGKA EAQTVLGNMY SEGLGCEKNL
QIALAWYGVA AEQNCAAAEF ALGDIHFQGK GVPVDFEQAA VWYRKAAEQD HVRAQVALAF
MNLKGTGMPE NPAEAARLFQ GAAMHDDIIA LYNIGLLRLK GHGVAKDIDK AETALRKAAR
KDYFPAIQAL AEFYSHGGGF APDLREAAVW YEKAAERDDV QAQFFMGRFY AMGTGVGPNI
RQAAKWFERA ARNGHATAAF NIAIFYLNGS GVERDVDRAI EWFERASEGG IRAAQLQLGK
LYSAGNGVPR DQKLAREWLG KAANGGDPDA QTAYALFLLR QDGSAEQLEQ AKALLVEAAE
ADHAPAAFQL GVLHMGKFGG ETDIAAAVPW FARAAGAGHV DAQYTLALLH LDPGSGMSDA
KAAASWMTKA AHAGHAGAQF QLAVLYCTGA GLAQDVAQGV RWYEAAAQQG HRIAQFNLAV
MLGKGQGCEV DLGKAVEWFE KAARQDVAEA QIALGDALMS GSGVTKDQDA AVQWYRRAAG
HNHEGARHRL NAIGVTTVDG