Gene Nwi_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1661 
Symbol 
ID3675396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1806745 
End bp1807989 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID637713219 
Productcysteine desulphurases, SufS 
Protein accessionYP_318274 
Protein GI75675853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.278494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC CGGCGGTCGC CAACGGTTCC TATGATGTCA TGCGCGTGCG CGAGGATTTT 
CCGGCGCTGG CGATGAAGGT TTATGGCAAA CCGCTGGTCT ATCTCGACAA CGCCGCCTCG
GCGCAGAAGC CGAACGCGGT GCTCGACCGG ATGGCTGAGG CTTACAAGAC CGAGTACGCC
AACGTTCATC GCGGGCTGCA TTATCTCGCC AACGCCGCAA CCGAAGCCTA TGAAGGCGGC
CGCGCGCGGG TCGCCCGGTT TTTGAACGCA GGACGGACCG AAGAGATCAT CTTTACGCGT
AACGCCAGCG AGGCGATCAA CCTCGTGGCG TCGTCATGGG GCGAGCCGAA CATCAAGGCG
GGCGACGAGA TCGTGCTCTC CATCATGGAG CATCATTCCA ACATCGTTCC CTGGCATTTC
CTGCGCGAGC GCCACGGCGC CGTGATCAAG TGGGCGCCCG TCGACGACGA CGGCAACTTC
CTGATCGAGG AATTCGAGAA GCTGCTGACG CCGAAGACGA AGCTCGTGGC CATCACGCAG
ATGTCCAACG CGCTCGGCAC CCTCGTTCCA GTCAAGGACG TGGTGAAGCT GGCTCATGCG
CGCGACATAC CGGTGCTGGT GGACGGCAGC CAGGCCGCGG TGCACCTTGC GATCGACGTG
CAGGACATCG ACTGCGATTT CTATGTCTTC ACCGGACACA AGCTCTACGG CCCGACCGGG
ATCGGCGCGC TGTATGCGAA GCATGACCAT CTCGTCTCCA TGCGTCCCTT CAACGGCGGC
GGCGAGATGA TTCGCGAGGT CGCGCAGGAT TGGGTCACCT ACGGCGATCC ACCGCACAAG
TTCGAGGCAG GCACGCCCGC GATTGTCGAG TCGATCGGGC TTGGCGCCGC CATCGATTAC
GTCAATTCGA TCGGGAAGGA ACGCATCGCA GCGCACGAAC ACGATCTTCT GACCTATGCC
CAGGAGCGGC TGCGCGAGAT CAACTCGCTG CGCATCATCG GCACCGCGCG CGACAAGGGG
CCGGTGATCT CGTTCGAGAT GAAAGGCGCG CATCCACACG ATGTCGCGAC CGTGATCGAC
CGCGCCGGAA TCGCGGTTCG CGCCGGGACG CATTGCGTGA TGCCGCTTTT AGAGCGTTTC
AATGTTTCGG CGACCTGCCG CGCTTCGTTC GGCATGTATA ATACGCGCGA GGAAATCGAT
CATCTGGCGC AGGCGCTGAT CAAGGCCCGG GAGTTGTTCT CATGA
 
Protein sequence
MTHPAVANGS YDVMRVREDF PALAMKVYGK PLVYLDNAAS AQKPNAVLDR MAEAYKTEYA 
NVHRGLHYLA NAATEAYEGG RARVARFLNA GRTEEIIFTR NASEAINLVA SSWGEPNIKA
GDEIVLSIME HHSNIVPWHF LRERHGAVIK WAPVDDDGNF LIEEFEKLLT PKTKLVAITQ
MSNALGTLVP VKDVVKLAHA RDIPVLVDGS QAAVHLAIDV QDIDCDFYVF TGHKLYGPTG
IGALYAKHDH LVSMRPFNGG GEMIREVAQD WVTYGDPPHK FEAGTPAIVE SIGLGAAIDY
VNSIGKERIA AHEHDLLTYA QERLREINSL RIIGTARDKG PVISFEMKGA HPHDVATVID
RAGIAVRAGT HCVMPLLERF NVSATCRASF GMYNTREEID HLAQALIKAR ELFS