Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5928 |
Symbol | sufS |
ID | 5152249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 6143849 |
End bp | 6145090 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640560653 |
Product | cysteine desulfurase |
Protein accession | YP_001241775 |
Protein GI | 148257190 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGCAG CGGTGAACGG CAGCGGCTAT GATGTGGGCC GCATCCGCGC GGATTTCCCC ATCCTCGGCA TGCAGGTGCA CGGCAAGCCG CTGGTCTATC TCGACAACGC CGCCTCGGCG CAGAAGCCCC AGGCTGTGCT CGACCGGATG AACCAGGTCT ACACCGCGGA ATACGCCAAT GTGCATCGCG GGCTGCATTA TCTCGCCAAT GCCGCGACCG AGGCCTATGA GGGCGCGCGC GCCAAGGTCG CGGCCTTCCT CAACGCCGAA AGCCCCGAGC AGATCATCTT CACCCGCGGC GCCACCGAAG CCGTGAACCT GGTGGCGCAG ACCTTCGGCC GCGAGAGGAT CGGGCCGGGT GACGAGATCG TGCTCTCGAT CATGGAGCAC CACGCCAACA TCATCCCCTG GCATTTCCTC AGGGAGCGCC AGGGCGCCGT GATCAAATGG GCCCCTGTCG ACGACGCCGG CAACTTCCTG ATCGACGAGT TCGAAAAGCT GCTGACGCCG CGCACCAAGA TGGTGGCCAT GACGCAGATG TCGAACGTGC TCGGCACGGT CGTTCCGGTC AAGGACGTCA TCCGCATCGC GCATGCGCGC GGCATCCCCG TGCTGATCGA CGGCTCGCAG GCCGCGGTGC ATATGGAGGT CGACGTCCGC GATCTCGATG CGGATTTCTA CTGCATCACT GGCCACAAGA TCTACGGCCC GACCGGGATC GGCGCGCTCT ATGGCAAGCG CGAACTCCTG GCATCGATGC CGCCCTTCAA TGGCGGCGGC GAGATGATCC GTGAGGTGTC GCGGGACTGG GTCACCTATG GCGATCCGCC GCACCGCTTC GAGGCCGGCA CGCCGCCGAT CGTCGAGGCC GCGGGGCTTG GGGCGGCGAT CGACTACGTC GCCTCGATCG GCAAGCAGCG CATCGCCGCG CACGAGCATG ATCTCGTCAC TTACGCCACC GAGCAGCTGC GCGAGATCAA TTCGCTGCGC ATCATCGGCA ATGCCGCCGA CAAGGGGGCC ATCGTCTCGT TCGAGATGCA GGGCGCCCAC GCCCACGACT TCGCCACCGT CATCGACCGC TCCGGCGTTG CGGTGCGCGC CGGCACCCAT TGCGCCATGC CGCTGCTGGA GCGCTTCGGC GTCACCGCCA CCTGCCGCGC GTCCTTCGCG ATGTACAATA CGCGGGAAGA GGTCGATGCG CTGGTGCGCG CGCTGAAGAA GGCGCAGGAG TTTTTCTCCT AG
|
Protein sequence | MHAAVNGSGY DVGRIRADFP ILGMQVHGKP LVYLDNAASA QKPQAVLDRM NQVYTAEYAN VHRGLHYLAN AATEAYEGAR AKVAAFLNAE SPEQIIFTRG ATEAVNLVAQ TFGRERIGPG DEIVLSIMEH HANIIPWHFL RERQGAVIKW APVDDAGNFL IDEFEKLLTP RTKMVAMTQM SNVLGTVVPV KDVIRIAHAR GIPVLIDGSQ AAVHMEVDVR DLDADFYCIT GHKIYGPTGI GALYGKRELL ASMPPFNGGG EMIREVSRDW VTYGDPPHRF EAGTPPIVEA AGLGAAIDYV ASIGKQRIAA HEHDLVTYAT EQLREINSLR IIGNAADKGA IVSFEMQGAH AHDFATVIDR SGVAVRAGTH CAMPLLERFG VTATCRASFA MYNTREEVDA LVRALKKAQE FFS
|
| |