Gene Nwi_2739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2739 
Symbol 
ID3676333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2974610 
End bp2975650 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID637714306 
ProductSel1 repeat-containing protein 
Protein accessionYP_319344 
Protein GI75676923 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0269187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.433518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC TGCGTCCGGT CGCCATCGCC GCCGGCCTTC TCATGCTCGG GGAAAGCGCG 
GCTGCGCAGC TCCAGCCCTC GCCGTCGGCC GATGCGCCGT CCGCGGGCAA GAGCGTCAAG
ACCAAGGCCA TCAAACCGCC GCCGCCAACG CCGCCCGCCG GGCTGGAGCC GGACAACAAG
GCGAACGCGC AGGTTGCTGA CGATCCGAAC GCGGATCCGG TTTACGGCGC CTATCAGCGC
GGCCTTTACA AGACGGCGTT CGATCTCGCC TTGAAACGAG CACAGGAGGA CAAGAACCCC
GCCGCCATGA CCATGCTCGG CGAACTCTAC GCCAATGGGC TCGGCGTCAG GCGCGACTAC
GGCAAGGCCA TCGAATGGCA TCAACGTGCG GCCGATCTGG GCGATCGAGA GGCCATGTTC
GCGCTCGCCA TGCTTCGCAT CAGCGGACGC GGCGGACCTC CCGACAGGAC GGGCGCGGTG
AAATGGCTGG CGGCGTCGGC CAAGCTCGGC CAGCCCAAGG CCGCCTACAA TCTGGCGCTT
CTCCACATGG ACGGGCAAAC GCTGCCGCAG GATTTCAAGC GCGCCGCCGA ACTGTTGCGA
TTCGCAGCCG ACGCCGGCAG TCCGGAAGCG CAGTATGCGC TGGCCACTTT CTACAAGGAA
GGCACCGGCG TCGAAAAGAA CCTCTACAAG TCGGTGCGGC TGTTGCAGGC CGCCTCGCTC
GCCGGCAACG TCGACGCCGA GGTCGAATAT GCAATCGCGT TGTTCAACGG CAGCGGCACC
GGGAAAAACG AGGCGGCCGC GGTATCGCTG CTGCGCAAAG CCGCCAGGCG AAACAGCGCG
ATCGCCCAAA ATCGTCTCGC CCACGCCCTT GTCGAAGGCA TGGGCGTCCC GATGGACAAG
GTCGAAGGCC TGAAATGGCA CATCGTGGCG AAAACCGGCG GCAAGGGCGA TCTGAAGCTC
GACGCGGCGA TGGCGCAGGC GACGCCCGAA GAACGCGCCG GTGCGGAGAG CGCCGCGCGC
AAATGGCTTG GAATCAAATG A
 
Protein sequence
MSLLRPVAIA AGLLMLGESA AAQLQPSPSA DAPSAGKSVK TKAIKPPPPT PPAGLEPDNK 
ANAQVADDPN ADPVYGAYQR GLYKTAFDLA LKRAQEDKNP AAMTMLGELY ANGLGVRRDY
GKAIEWHQRA ADLGDREAMF ALAMLRISGR GGPPDRTGAV KWLAASAKLG QPKAAYNLAL
LHMDGQTLPQ DFKRAAELLR FAADAGSPEA QYALATFYKE GTGVEKNLYK SVRLLQAASL
AGNVDAEVEY AIALFNGSGT GKNEAAAVSL LRKAARRNSA IAQNRLAHAL VEGMGVPMDK
VEGLKWHIVA KTGGKGDLKL DAAMAQATPE ERAGAESAAR KWLGIK