Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0166 |
Symbol | |
ID | 4117871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 170396 |
End bp | 171622 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638034957 |
Product | cysteine desulfurase |
Protein accession | YP_642956 |
Protein GI | 108803019 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGACG TCCTCAAGGT GCGGCGCGAC TTTCCCATCC TCGAGCGGGA GGTGAACGGC CGCCCGCTCG TCTACCTGGA CAACGCGGCG ACCTCCCAGA AGCCGCTGCA GGTCATCCGG ACCCTCTCCC GGTACTACGA GCGGCACAAC GCCAACATCC ACCGCGGCGT CCACCGCCTG GCCGAGGAGG CCACCGGGCT CTACGAGGAG GCGCGCGGCA AGGTGGCCCG CTTTATCGGG GCTCCCGACC CCCGCGGCCT CGTCTTCACG CGGGGGACCA CCGAGTCCAT AAACCTCGTC GCCCACGCCT GGGGCAGAAA GAACCTGCGC GAGGGCGACG AGGTGGTGCT CACCGAGGCC GAGCACCACT CCAACCTGGT GCCCTGGCAG CTCGCCGCGC GGGACACCGG CGCGAGGCTT CGCTTCATCC CGGTCCTGGA CGACGGCACG CTGGACATGG AGGCGGCGGA GCACCTGATC GGGCCGCGGA CGCGGCTCGT GGGCTGCGTG CACGCCTCCA ACGTGCTCGG CACGGTCAAC CCGGTGGAGC GGCTGGCGGA GCTGGCGCAC GAGGCGGGGG CCCTGATGCT GGTCGACGGG GCGCAGAGCG CGCCGCACCT GCCGGTGGAC GTAACCTCTC TGGGCTGCGA CTTCTTCGCG GCGAGCGGGC ACAAGATGCT CGGGCCGACC GGGGTGGGAT TCCTGTGGGC CCGCCCCGAG CTTCTCGAGG AGATGGAGCC GTTCCTCGGC GGGGGCGAGA TGATCCGGGA GGTCCGCCTG GAGCGTTCCA CCTGGAACGA GATCCCCTAC AAGTTCGAGG CCGGGACGAT GAACATCGCC CAGGCGATCG GGCTGGGGGC CGCCGTGGAC TACCTGGGCT CCCTGGGGAT GGAGAGCGTC CGGGAGCACG AGCGGCGCCT CGGGGCGTAC GCCTACCGCC GGCTCGCCGG GGTCGAGGGG ATCACCCTCT ACGGCCCGGC GGAGAACCGG ACGGGGGTCG TGGCCTTCAA CCTCCCCGAG GTGCACCCGC ACGACCTCTC CCAGCTCCTG GACCAGGAGG GCGTCGCCAT AAGGAGCGGC CACCACTGCT GCCAGCCGCT GATGCGCCGC CTCGGGGTGG TGGCGACCGC CCGGGCCAGC CTCTACCTGT ACAACACGGA GGAGGAGGTG GAGGCGCTGG TCGAGGCCAT CGCCCGCGCC CGCGAGTTCT TCGAGGCGCC GGCATGA
|
Protein sequence | MYDVLKVRRD FPILEREVNG RPLVYLDNAA TSQKPLQVIR TLSRYYERHN ANIHRGVHRL AEEATGLYEE ARGKVARFIG APDPRGLVFT RGTTESINLV AHAWGRKNLR EGDEVVLTEA EHHSNLVPWQ LAARDTGARL RFIPVLDDGT LDMEAAEHLI GPRTRLVGCV HASNVLGTVN PVERLAELAH EAGALMLVDG AQSAPHLPVD VTSLGCDFFA ASGHKMLGPT GVGFLWARPE LLEEMEPFLG GGEMIREVRL ERSTWNEIPY KFEAGTMNIA QAIGLGAAVD YLGSLGMESV REHERRLGAY AYRRLAGVEG ITLYGPAENR TGVVAFNLPE VHPHDLSQLL DQEGVAIRSG HHCCQPLMRR LGVVATARAS LYLYNTEEEV EALVEAIARA REFFEAPA
|
| |