Gene BURPS668_2706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2706 
SymbolsufS 
ID4885346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2678763 
End bp2680004 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content72% 
IMG OID640128634 
Productcysteine desulfurase SufS 
Protein accessionYP_001059730 
Protein GI126441575 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.994563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGACA TGCTGCGTTC GGCCACGATG ATGGCGAGCG CGTTTCCGGC GCTTGCGCAG 
CGCGTGAACG GCGCGCCGCT CGCGTATCTC GACAACGCGG CGACGACGCA CGTGCCGCAG
CCCGTCCTCG CCGCGATGCG CGGCTTCGAC GAGCGCGATC GCGCGAACAT CCACCGCGGC
ATCCACACGC TCAGCCAGCG GGCGACCGAC GCTTACGAAC GCGCGCGCGA CACGCTCGCG
CGCTTCGTCG GCGCGAACGG CGAGCACCTG CTCGTCTTCA CGTCGGGCGC AACCGATGCG
CTGAATCTCG TCGCGAATGG GCTATCGGTT GCCGGCCACA CGCAAGCCGT GCTGCAGGAA
GGCGACGAGA TCATCGTCAG CGCGCTCGAG CATCACGCGA ACCTCGTGCC GTGGCAGATG
GCCGCGCGCC GTTGCGGCGC GAAGCTGCGG ATCCTGCATC CGGATCCGCA GGGCAGCCTG
CATGTTCGGG ACCTCGAACG ATTGCTGGGG CCGCGCACGC GCGTGTTCGC GGTCACCGCG
TGCTCGAACG CGACGGGCGA GCGGCCGCCG TACGAGGCGC TGCTCGCGGC CGCGCGCGCG
GGCGGCGCGC TGACGGTGCT CGACGCGGCA CAGGCGGTCG GCCACGACGT GCCCGATCTG
TCCGCGCTCG CGTGCGACTT CGCCGCGTTC TCCGGCCACA AGATGTACGG GCCGATGGGC
ACCGGCGCGC TCGTCGGCCG CCGGGACGCG CTCGAGCGGC TCGTCCCGCT GCGCTTCGGC
GGCGACATGG TGAGCTGGGT GGGCGAGACG GACGCGACAT TCGACGCGCT GCCCGCGCGG
CTCGAAGGCG GCACGCCGAA CGTCGCGGGC GCGGTCGGCA TCGCGGCGGC CGCCGACTAT
ATCGACGCGA TCGGCCGCGC CCGGATCGAC GCCCACGTGC GCGCACTGCG CGATCACGCG
GCCGCGGGCC TCGCGGCGCT CGACGGCGTG ACCGTGCTCG CGCCGCGCAC GTCGTCGGCG
ATCGTATCCT TCGTCGTCGA CGGCGTGCAT CCGCACGACA TCGGCACATT GCTCGACGAG
CGCGGCATCG CCGTGCGCAC GGGCTTTCAC TGCGCGCAGC CGCTGCTCGA ACGGCTCGGC
TGCGGGCCGA CGACGCGCGC GTCGTTCGCG CTCTACAACA CGCACGACGA AGTCGAGCGC
CTCGTCGCGG GCGTCGCGCA AGCACTGAAG GTGTTGAGAT GA
 
Protein sequence
MGDMLRSATM MASAFPALAQ RVNGAPLAYL DNAATTHVPQ PVLAAMRGFD ERDRANIHRG 
IHTLSQRATD AYERARDTLA RFVGANGEHL LVFTSGATDA LNLVANGLSV AGHTQAVLQE
GDEIIVSALE HHANLVPWQM AARRCGAKLR ILHPDPQGSL HVRDLERLLG PRTRVFAVTA
CSNATGERPP YEALLAAARA GGALTVLDAA QAVGHDVPDL SALACDFAAF SGHKMYGPMG
TGALVGRRDA LERLVPLRFG GDMVSWVGET DATFDALPAR LEGGTPNVAG AVGIAAAADY
IDAIGRARID AHVRALRDHA AAGLAALDGV TVLAPRTSSA IVSFVVDGVH PHDIGTLLDE
RGIAVRTGFH CAQPLLERLG CGPTTRASFA LYNTHDEVER LVAGVAQALK VLR