Gene Rxyl_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0166 
Symbol 
ID4117871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp170396 
End bp171622 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content71% 
IMG OID638034957 
Productcysteine desulfurase 
Protein accessionYP_642956 
Protein GI108803019 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily
[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGACG TCCTCAAGGT GCGGCGCGAC TTTCCCATCC TCGAGCGGGA GGTGAACGGC 
CGCCCGCTCG TCTACCTGGA CAACGCGGCG ACCTCCCAGA AGCCGCTGCA GGTCATCCGG
ACCCTCTCCC GGTACTACGA GCGGCACAAC GCCAACATCC ACCGCGGCGT CCACCGCCTG
GCCGAGGAGG CCACCGGGCT CTACGAGGAG GCGCGCGGCA AGGTGGCCCG CTTTATCGGG
GCTCCCGACC CCCGCGGCCT CGTCTTCACG CGGGGGACCA CCGAGTCCAT AAACCTCGTC
GCCCACGCCT GGGGCAGAAA GAACCTGCGC GAGGGCGACG AGGTGGTGCT CACCGAGGCC
GAGCACCACT CCAACCTGGT GCCCTGGCAG CTCGCCGCGC GGGACACCGG CGCGAGGCTT
CGCTTCATCC CGGTCCTGGA CGACGGCACG CTGGACATGG AGGCGGCGGA GCACCTGATC
GGGCCGCGGA CGCGGCTCGT GGGCTGCGTG CACGCCTCCA ACGTGCTCGG CACGGTCAAC
CCGGTGGAGC GGCTGGCGGA GCTGGCGCAC GAGGCGGGGG CCCTGATGCT GGTCGACGGG
GCGCAGAGCG CGCCGCACCT GCCGGTGGAC GTAACCTCTC TGGGCTGCGA CTTCTTCGCG
GCGAGCGGGC ACAAGATGCT CGGGCCGACC GGGGTGGGAT TCCTGTGGGC CCGCCCCGAG
CTTCTCGAGG AGATGGAGCC GTTCCTCGGC GGGGGCGAGA TGATCCGGGA GGTCCGCCTG
GAGCGTTCCA CCTGGAACGA GATCCCCTAC AAGTTCGAGG CCGGGACGAT GAACATCGCC
CAGGCGATCG GGCTGGGGGC CGCCGTGGAC TACCTGGGCT CCCTGGGGAT GGAGAGCGTC
CGGGAGCACG AGCGGCGCCT CGGGGCGTAC GCCTACCGCC GGCTCGCCGG GGTCGAGGGG
ATCACCCTCT ACGGCCCGGC GGAGAACCGG ACGGGGGTCG TGGCCTTCAA CCTCCCCGAG
GTGCACCCGC ACGACCTCTC CCAGCTCCTG GACCAGGAGG GCGTCGCCAT AAGGAGCGGC
CACCACTGCT GCCAGCCGCT GATGCGCCGC CTCGGGGTGG TGGCGACCGC CCGGGCCAGC
CTCTACCTGT ACAACACGGA GGAGGAGGTG GAGGCGCTGG TCGAGGCCAT CGCCCGCGCC
CGCGAGTTCT TCGAGGCGCC GGCATGA
 
Protein sequence
MYDVLKVRRD FPILEREVNG RPLVYLDNAA TSQKPLQVIR TLSRYYERHN ANIHRGVHRL 
AEEATGLYEE ARGKVARFIG APDPRGLVFT RGTTESINLV AHAWGRKNLR EGDEVVLTEA
EHHSNLVPWQ LAARDTGARL RFIPVLDDGT LDMEAAEHLI GPRTRLVGCV HASNVLGTVN
PVERLAELAH EAGALMLVDG AQSAPHLPVD VTSLGCDFFA ASGHKMLGPT GVGFLWARPE
LLEEMEPFLG GGEMIREVRL ERSTWNEIPY KFEAGTMNIA QAIGLGAAVD YLGSLGMESV
REHERRLGAY AYRRLAGVEG ITLYGPAENR TGVVAFNLPE VHPHDLSQLL DQEGVAIRSG
HHCCQPLMRR LGVVATARAS LYLYNTEEEV EALVEAIARA REFFEAPA