Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1235 |
Symbol | |
ID | 4446264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1358585 |
End bp | 1359916 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639689043 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_830729 |
Protein GI | 116669796 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTCATGG TATCCACTCC CGCTACCGCA GACACCCAGG TCGCGCTAAG TGATGTTGAA GTCCTGCGTA TCCGCAACGA CTTTCCCGTC CTGGACCAGG AAGTCAACGG CAGGCCCCTT GTTTACCTGG ATTCCGGCGC CACTTCTCAG AACCCCCGCA GCGTCCTCGA AGCCGAGCAG GAGTTCTACG AACTGAGGAA TGCCGCGGTG CACCGCGGTG CCCACCACCT TGCCGTCCAG GCCACCGACG CCTTCGAGGA TGCCCGCGCC ACTGTGGCCG GTTTCGTCGG CGTGGCGGAG GACGAACTGG TCTGGACAGC GAACGCCACC GCCGGGCTCA ATCTCCTCGC GTACGCCTTC TCGAACGCCA GTGTCGGCAC CGTCCGGGGC GAGGCCGGCC GCTTCGCCCT CGGTCCGGGG GACGAGATTG TGGTGACGGA GATGGAACAC CACGCGAACC TGATTCCCTG GCAGGAGCTC TGCCGGCGCA CCGGTGCCAC CTTGAAATTC ATCCCCATCG ACGACGACGG CGCGTTGCGG CTTGAGGAGG CGGCGCGGCT TATCACCGGG CGTACCAAGG TCCTGGCGTT CACCCATGCG TCAAATGTGC TTGGAACCAT CAATCCGGTG CCCGAGCTCG TGCGGCTGGC CCGGGCGGCA GGGGCCCTCG TTGTGCTGGA CGCCTGCCAG TCGGCGCCGC ACCTGCCCTT GGACTTCAAG GCCCTGGACG TGGACTTCGC GGTATTCTCC GGCCACAAGA TGCTGGCGCC CACGGGGATC GGCGGCGTGT ACGGGCGGCG CGAATTGTTG AATGCCATGC CTCCGTTCCT GACCGGGGGT TCCATGATCA CGACCGTGAC GATGGAAAAG GCCGAGTACC TTCCGGCGCC CCAGCGGTTC GAGGCCGGCA CCCAGCCCAT CTCGCAGGCT GTGGCGCTCG CGGCGGCCGC GAACTACCTG CGCGAAACCA GCATGGAACG AATCGCCGGC TGGGAAGCGT CCCTGGGCCA GCGGCTGGTC ACGGGGCTGA GCGCCATTGA CGGAGTCCGG GTCGTGGGGC CCGCTGCCGG CGTAGAACGG CTCGGCCTGG CAGCCTTCGA CGTGGCCGGC GTGCATGCGC ACGACGTCGG GCAGTACCTG GACAGCATGG GCATCGCCGT CCGCGTGGGT CACCACTGCG CGCAACCGCT CCACCGCCGG CTGGGCCTGA CCGCCACCAC GCGGGCGAGC ACCTATTTGT ACAACACAAC GGAAGAAGTG GACCTGCTGA TCGAAGCTGT GGCCCAGGTC CGGCCCTACT TCGGCGTAGA AGGCACGGGG ACATCCAAAT GA
|
Protein sequence | MVMVSTPATA DTQVALSDVE VLRIRNDFPV LDQEVNGRPL VYLDSGATSQ NPRSVLEAEQ EFYELRNAAV HRGAHHLAVQ ATDAFEDARA TVAGFVGVAE DELVWTANAT AGLNLLAYAF SNASVGTVRG EAGRFALGPG DEIVVTEMEH HANLIPWQEL CRRTGATLKF IPIDDDGALR LEEAARLITG RTKVLAFTHA SNVLGTINPV PELVRLARAA GALVVLDACQ SAPHLPLDFK ALDVDFAVFS GHKMLAPTGI GGVYGRRELL NAMPPFLTGG SMITTVTMEK AEYLPAPQRF EAGTQPISQA VALAAAANYL RETSMERIAG WEASLGQRLV TGLSAIDGVR VVGPAAGVER LGLAAFDVAG VHAHDVGQYL DSMGIAVRVG HHCAQPLHRR LGLTATTRAS TYLYNTTEEV DLLIEAVAQV RPYFGVEGTG TSK
|
| |