Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3969 |
Symbol | sufS |
ID | 5151531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4176796 |
End bp | 4178043 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640558804 |
Product | cysteine desulfurase |
Protein accession | YP_001239945 |
Protein GI | 148255360 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0413583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC ATCCGGCAGT CGCCAACGGG ACCTATGACG TCGCCCGCAT CCGCGAGGAT TTCCCGGCAC TTGCGCTGCA GGTCTATGGC AAGCCGCTCG TCTATCTCGA CAACGCCGCC TCGGCGCAGA AGCCGCGCGC AGTGCTCGAC CGCATGACGC AGGCCTACAC CAGCGAATAC GCCAACGTGC ATCGCGGCCT GCATTATCTC GCCAATGCCG CGACCGAAGC CTATGAAGGC GGCCGCGAGC GGGTGACGCG CTTCCTCAAT GCAAGACGCA ACGAGGAGAT CATCTTCACC CGCAACGCGA CCGAGGCGAT CAACCTCGTG GCGTCGTCGT GGGGCGGGCC GAATATCGGC GAGGGCGATG AGATCGTGCT CTCGATCATG GAGCACCATT CCAACATCGT GCCGTGGCAT TTCTTGCGCG AGCGCCAGGG TGCGGTGATC AAGTGGGCGC CGGTCGACGA TGAAGGCAAC TTCCTGATCG ACGAGTTCGA AAAGCTGCTG ACCGCCAAGA CCAAGCTGGT CGCGATCACG CAGATGTCGA ACATGCTCGG CACGCTCGTG CCGGTGAAGG AGGTCGTGCG CATTGCGCAT GCGCGCGGCA TTCCTGTGCT GATCGATGGT AGCCAGGCAG CGGTGCATCT TGCCGTCGAT GTCCAGGACA TCGACTGTGA TTTCTACGTC TTCACCGGTC ACAAGCTCTA TGGGCCGACC GGGATCGGCG TGCTTTACGG GAAGTATGAT CGCCTCGCCG CGATGCGTCC CTTCAATGGC GGCGGCGAGA TGATCCGCGA GGTCGCGCGC GACTGGGTCA CCTATGGCGA TCCGCCGCAT AGGTTCGAGG CGGGCACGCC GATGATCGTC GAGGCGGTCG GACTGGGTGC TGCGATCGAC TACGTGAATT CCGTCGGCAA GGATCGCATT GCCGCTCACG AGCACGACCT GCTGACCTAT GCGCAAGAGC GGCTGCGCGA GATCAACGCG TTGCGGATCA TCGGTACGGC CAAAGGCAAG GGGCCGGTGA TCTCGTTCGA GATGAAGGGC GCGCATGCCC ACGATATCGC GACCGTGATC GACCGGCAGG GCGTCGCGGT GCGCGCGGGC ACGCATTGCG TCATGCCGCT TTTGGAGAGA TTCAACGTCA CTGCGACCTG CCGAGCCTCG TTTGGCATGT ACAATACGAG AGAAGAGGTC GATCAGCTGG CACAGGCTCT GATCAAGGCG CGGGATTTGT TCGCATGA
|
Protein sequence | MSTHPAVANG TYDVARIRED FPALALQVYG KPLVYLDNAA SAQKPRAVLD RMTQAYTSEY ANVHRGLHYL ANAATEAYEG GRERVTRFLN ARRNEEIIFT RNATEAINLV ASSWGGPNIG EGDEIVLSIM EHHSNIVPWH FLRERQGAVI KWAPVDDEGN FLIDEFEKLL TAKTKLVAIT QMSNMLGTLV PVKEVVRIAH ARGIPVLIDG SQAAVHLAVD VQDIDCDFYV FTGHKLYGPT GIGVLYGKYD RLAAMRPFNG GGEMIREVAR DWVTYGDPPH RFEAGTPMIV EAVGLGAAID YVNSVGKDRI AAHEHDLLTY AQERLREINA LRIIGTAKGK GPVISFEMKG AHAHDIATVI DRQGVAVRAG THCVMPLLER FNVTATCRAS FGMYNTREEV DQLAQALIKA RDLFA
|
| |