Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2554 |
Symbol | |
ID | 4598411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2723366 |
End bp | 2724652 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639777160 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_923745 |
Protein GI | 119716780 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGA CCCAGCTGGC CGGCCTGCTC CCCGAGCTGG AGGTGATCCG CGCGGACTTC CCGATCCTGG ATCGGACCCT CGCGGGTGGT CTGCCGCTGG TCTACCTCGA CAGCGCCAAC ACCTCGCAGA AGCCGCAGGT CGTGATCGAC ACCATGGTCG ACCACCTCGA ACGGCACAAC GCGAACGTCG CCCGGGCCAT GCACCAGCTC GGCGCGGAGT CGTCGGAGGC GTTCGAGGCG GCGCGGGACA AGGTCGCGGC GTTCATCAAC GCGCCGAGCC GGGACGAGGT GATCTTCACC AAGAACGCCT CCGAGGCGCT CAACCTGGTG GCCAACACCC TGTCCTGGGC GCGGGGCGAC CGGGTGAACG GAGTCGGGGC TCTCGGCGCG GGCGACGAGG TCGTCATCAC CGAGATGGAG CACCACTCCA ACATCGTGCC GTGGCAGCTG CTGACCGAGC GCACGGGCGC GACCCTGCGC TGGTTCGGTC TCACCGACGA CGGCCGGCTC GACCTCTCGA ACATCGACTC GCTGATCACC GAGCGGACCA AGGTGGTCGC CCTCACCTGG GTCTCGAACA TGCTCGGCAC GATCAACCCG GTCGCCGAGA TCGCTCGGCG GGCCCACGAG GTCGGCGCGC TCGTGGTGGT CGACGCCTCC CAGGCCGCTC CTCAGCTGCC CGTGGACGTG GTCGCGTCGG GTGCCGACCT GCTGGCGTTC ACCGGCCACA AGGTCGTCGG GCCGACCGGC ATCGGCGTGC TCTGGGGCCG CCGCGAGGTG CTCGACCAGC TACCGCCCTT CCTCGGTGGC GGCGAGATGA TCGAGACGGT GCGGATGGAG CGCTCGACGT ACGCCCCGAT CCCGCACAAG TTCGAGGCCG GTACGCCGCC CATCGTCGAG GCCGTGGGCC TCGGCGCAGC GGTCGACTAC CTCTCGATGA TCGGCCTGGA CGCGATCCAC CGGCACGAGC AGGCGATCAC GGCGTACGCG CTCGACGGGC TCGCCACCGT GCCCGGCCTG CGAGTCCTCG GTCCGCTCTC CGCGAAGGAC CGCGGCGGAG CGATCGCCTT CGAGATCGAC GGGGTGCACC CGCACGACGT CGCGCAGGTG CTCGACTCGC GCGGGGTCGC GGTCCGGGCC GGCCACCACT GCGCGAAGCC CGCGCACGCC CGCTTCGGCG TGCAGAGCTC CACCCGGATG TCGTCGTACC TCTACACGAC GCCGGCGGAG ATCGACGCGC TCGTCGAGGC CCTGGAATAC ACCCGCTCGT ACTTCAAGTT GGGTTGA
|
Protein sequence | MTTTQLAGLL PELEVIRADF PILDRTLAGG LPLVYLDSAN TSQKPQVVID TMVDHLERHN ANVARAMHQL GAESSEAFEA ARDKVAAFIN APSRDEVIFT KNASEALNLV ANTLSWARGD RVNGVGALGA GDEVVITEME HHSNIVPWQL LTERTGATLR WFGLTDDGRL DLSNIDSLIT ERTKVVALTW VSNMLGTINP VAEIARRAHE VGALVVVDAS QAAPQLPVDV VASGADLLAF TGHKVVGPTG IGVLWGRREV LDQLPPFLGG GEMIETVRME RSTYAPIPHK FEAGTPPIVE AVGLGAAVDY LSMIGLDAIH RHEQAITAYA LDGLATVPGL RVLGPLSAKD RGGAIAFEID GVHPHDVAQV LDSRGVAVRA GHHCAKPAHA RFGVQSSTRM SSYLYTTPAE IDALVEALEY TRSYFKLG
|
| |