Gene BURPS1710b_2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2826 
SymbolsufS 
ID3690188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3138406 
End bp3139647 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content72% 
IMG OID637729282 
Productcysteine desulfurase SufS 
Protein accessionYP_334210 
Protein GI76809234 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGACA TGCTGCGTTC GGCCACGATG ATGGCGAGCG CGTTTCCGGC GCTTGCGCAG 
CGCGTGAACG GCGCGCCGCT CGCGTATCTC GACAACGCGG CGACGACGCA CGTGCCGCAG
CCCGTCCTCG CCGCGATGCG CGGCTTCGAC GAGCGCGATC GCGCGAACAT CCACCGCGGC
GTCCACACGC TCAGCCAGCG GGCGACCGAC GCTTACGAAC GCGCGCGCGA CACGCTCGCG
CGCTTCGTCG GCGCGAACGG CGAGCACCTG CTCGTCTTCA CGTCGGGCGC AACCGATGCG
CTGAATCTCG TCGCGAATGG GCTATCGGTT GCCGGCCACA CGCAAGCCGT GCTGCAGGAA
GGCGACGAGA TCATCGTCAG CGCGCTCGAG CATCACGCGA ACCTCGTGCC GTGGCAGATG
GCCGCGCGCC GTTGCGGCGC GAAGCTGCGG ATCCTGCATC CGGATCCGCA GGGCAGCCTG
CATGTTCTGG ACCTCGAACG ATTGCTGGGG CCGCGCACGC GCGTGTTCGC GGTCACCGCG
TGCTCGAACG CGACGGGCGA GCGGCCGCCG TACGAGGCGC TGCTCGCGGC CGCGCGCGCG
GGCGGCGCGC TGACGGTGCT CGACGCGGCA CAGGCGGTCG GCCACGACGT GCCCGATCTG
TCCGCGCTCG CGTGCGACTT CGCCGCGTTC TCCGGCCACA AGATGTACGG GCCGATGGGC
ACCGGCGCGC TCGTCGGCCG CCGGGACGCG CTCGAGCGGC TCGTCCCGCT GCGCTTCGGC
GGCGACATGG TGAGCTGGGT GGGCGAGACG GACGCGACAT TCGACGCGCT GCCCGCGCGG
CTCGAAGGCG GCACGCCGAA CGTCGCGGGC GCGGTCGGCA TCGCGGCGGC CGCCGACTAT
ATCGACGCGA TCGGCCGCGC CCGGATCGAC GCCCACGTGC GCGCACTGCG CGATCACGCG
GCCGCGGGCC TCGCGGCGCT CGACGGCGTG ACCGTGCTCG CGCCGCGCAC GTCGTCGGCG
ATCGTATCCT TCGTCGTCGA CGGCGTGCAT CCGCACGACA TCGGCACATT GCTCGACGAG
CGCGGCATCG CCGTGCGCAC GGGCTTTCAC TGCGCGCAGC CGCTGCTCGA ACGGCTCGGC
TGCGGGCCGA CGACGCGCGC GTCGTTCGCG CTCTACAACA CGCACGACGA AGTCGAGCGC
CTCGTCGCGG GCGTCGCGCA AGCACTGAAG GTGTTGAGAT GA
 
Protein sequence
MGDMLRSATM MASAFPALAQ RVNGAPLAYL DNAATTHVPQ PVLAAMRGFD ERDRANIHRG 
VHTLSQRATD AYERARDTLA RFVGANGEHL LVFTSGATDA LNLVANGLSV AGHTQAVLQE
GDEIIVSALE HHANLVPWQM AARRCGAKLR ILHPDPQGSL HVLDLERLLG PRTRVFAVTA
CSNATGERPP YEALLAAARA GGALTVLDAA QAVGHDVPDL SALACDFAAF SGHKMYGPMG
TGALVGRRDA LERLVPLRFG GDMVSWVGET DATFDALPAR LEGGTPNVAG AVGIAAAADY
IDAIGRARID AHVRALRDHA AAGLAALDGV TVLAPRTSSA IVSFVVDGVH PHDIGTLLDE
RGIAVRTGFH CAQPLLERLG CGPTTRASFA LYNTHDEVER LVAGVAQALK VLR