Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_1715 |
Symbol | sufS |
ID | 4892159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009080 |
Strand | + |
Start bp | 1708734 |
End bp | 1709975 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640150370 |
Product | cysteine desulfurase SufS |
Protein accession | YP_001081256 |
Protein GI | 126450214 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.023757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACA TGCTGCGTTC GGCCACGATG ATGGCGAGCG CGTTTCCGGC GCTTGCGCAG CGCGTGAACG GCGCGCCGCT CGCGTATCTC GACAACGCGG CGACGACGCA CGTGCCGCAG CCCGTCCTCG CCGCGATGCG CGGCTTCGAC GAGCGCGATC GCGCGAACAT CCACCGCGGC GTCCACACGC TCAGCCAGCG GGCGACCGAC GCTTACGAAC GCGCGCGCGA CACGCTCGCG CGCTTCGTCG GCGCGAACGG CGAGCACCTG CTCGTCTTCA CGTCGGGCGC AACCGATGCG CTGAATCTCG TCGCGAATGG GCTATCGGTT GCCGGCCACA CGCAAGCCGT GCTGCAGGAA GGCGACGAGA TCATCGTCAG CGCGCTCGAG CATCACGCGA ACCTCGTGCC GTGGCAGATG GCCGCGCGCC GTTGCGGCGC GAAGCTGCGG ATCCTGCATC CGGATCCGCA GGGCAGCCTG CATGTTCGGG ACCTCGAACG ATTGCTGGGG CCGCGCACGC GCGTGTTCGC GGTCACCGCG TGCTCGAACG CGACGGGCGA GCGGCCGCCG TACGAAGCGC TGCTCGCGGC CGCGCGCGCG GGCGGCGCGC TGACGGTGCT CGACGCGGCA CAGGCGGTCG GCCACGACGT GCCCGATCTG TCCGCGCTCG CGTGCGACTT CGCCGCGTTC TCCGGCCACA AGATGTACGG GCCGATGGGC ACCGGCGCGC TCGTCGGCCG CCGGGACGCG CTCGAGCGGC TCGTCCCGCT GCGCTTCGGC GGCGACATGG TGAGCTGGGT GGGCGAGACG GACGCGACAT TCGACGCGCT GCCCGCGCGG CTCGAAGGCG GCACGCCGAA CGTCGCGGGC GCGGTCGGCA TCGCGGCGGC CGCCGACTAT ATCGACGCGA TCGGCCGCGC CCGGATCGAC GCCCACGTGC GCGCACTGCG CGATCACGCG GCCGCGGGCC TCGCGGCGCT CGACGGCGTG ACCGTGCTCT CGCCGCGCAC GTCGTCGGCG ATCGTATCCT TCGTCGTCGA CGGCGTGCAT CCGCACGACA TCGGCACATT GCTCGACGAG CGCGGCATCG CCGTGCGCAC GGGCTTTCAC TGCGCGCAGC CGCTGCTCGA ACGGCTCGGC TGCGGGCCGA CGACGCGCGC GTCGTTCGCG CTCTACAACA CGCACGACGA AGTCGAGCGC CTCGTCGCGG GCGTCGCGCA AGCACTGAAG GTGTTGAGAT GA
|
Protein sequence | MGDMLRSATM MASAFPALAQ RVNGAPLAYL DNAATTHVPQ PVLAAMRGFD ERDRANIHRG VHTLSQRATD AYERARDTLA RFVGANGEHL LVFTSGATDA LNLVANGLSV AGHTQAVLQE GDEIIVSALE HHANLVPWQM AARRCGAKLR ILHPDPQGSL HVRDLERLLG PRTRVFAVTA CSNATGERPP YEALLAAARA GGALTVLDAA QAVGHDVPDL SALACDFAAF SGHKMYGPMG TGALVGRRDA LERLVPLRFG GDMVSWVGET DATFDALPAR LEGGTPNVAG AVGIAAAADY IDAIGRARID AHVRALRDHA AAGLAALDGV TVLSPRTSSA IVSFVVDGVH PHDIGTLLDE RGIAVRTGFH CAQPLLERLG CGPTTRASFA LYNTHDEVER LVAGVAQALK VLR
|
| |