Gene BMA10229_A3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A3103 
SymboliscS-1 
ID4792758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp3140750 
End bp3141973 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content65% 
IMG OID 
Productcysteine desulfurase 
Protein accessionYP_001029044 
Protein GI124385796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.412431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACG ATATCCCCCA CCTGCCCATC TACATGGACT ACAGCGCGAC GACCCCCGTC 
GATCCGCGCG TCGTCGACAA GATGGTGCCG TATCTGCGCG AGCAGTTCGG CAACCCGGCG
TCGCGCAGCC ACGCATACGG CTGGGACGCG GAGCGCGCGG TCGAAGAGGC GCGCGAGCAG
GTGGCCGCCC TCGTGAACGC CGATCCGCGC GAGATCATCT GGACCTCCGG CGCGACCGAG
TCCGACAACC TCGCGATCAA GGGCGCCGCG CACTTCTATC AGGGCAAGGG CAAGCACATC
GTCACCGTGA AGACCGAGCA CAAGGCGGTG CTCGACACCT GCCGCGAGCT CGAGCGCGAA
GGCTTCGAGG TGACCTATCT CGACGTGAAG GACGACGGGC TCGTCGATCT CGACGTGTTC
AAGGCCGCGC TGCGCCCGGA CACGATCCTC GTGTCGGTGA TGCATGTGAA CAACGAGATC
GGCGTGATCC AGGACATCGC GACGATCGGC GAGATCTGCC GCGAGAAGGG CATCATCTTC
CACGTCGACG CCGCGCAGGC GACGGGCAAG GTCGAAATCG ACCTCGCGAA GCTGAAGGTC
GACCTGATGT CGTTCTCCGC GCACAAGACC TACGGCCCGA AGGGCATCGG CGCGTTGTAT
GTGCGCCGCA AGCCGCGCGT GCGCATCGAG GCGCAGATGC ACGGCGGCGG CCACGAGCGC
GGCATGCGCT CGGGCACGTT GCCGACGCAC CAGATCGTCG GCATGGGCGA GGCGTTTCGC
ATCGCGCGCG AAGAGATGGC GACCGAGAAC GAGCGCATCC GGATGCTGCG CGACAAGCTG
CTGCGCGGCC TGTCGGAAAT CGACGAAACC TACGTGAACG GCGATCTCGA GCACCGGATT
CCGCACAACC TGAACATCAG CTTCAATTTT GTCGAAGGCG AATCGCTGAT CATGGCGATC
AAGGACGTCG CGGTGTCGTC GGGTTCCGCG TGCACGTCGG CGTCGCTCGA GCCGTCCTAC
GTGCTGTGCG CGCTCGGCCG CAACGACGAG CTCGCGCACA GCTCGATCCG CTTCACGGTC
GGCCGCTTCA CGACGGAGCA GGAAGTCGAC TACGTGATCG ACCTGCTGAA GAGCAAGATC
GCGAAGCTGC GCGACCTGTC GCCGCTTTGG GAGATGCATC AGGAAGGCAT CGATCTGTCG
ACGATCGAAT GGGCGGCGCA CTGA
 
Protein sequence
MNNDIPHLPI YMDYSATTPV DPRVVDKMVP YLREQFGNPA SRSHAYGWDA ERAVEEAREQ 
VAALVNADPR EIIWTSGATE SDNLAIKGAA HFYQGKGKHI VTVKTEHKAV LDTCRELERE
GFEVTYLDVK DDGLVDLDVF KAALRPDTIL VSVMHVNNEI GVIQDIATIG EICREKGIIF
HVDAAQATGK VEIDLAKLKV DLMSFSAHKT YGPKGIGALY VRRKPRVRIE AQMHGGGHER
GMRSGTLPTH QIVGMGEAFR IAREEMATEN ERIRMLRDKL LRGLSEIDET YVNGDLEHRI
PHNLNISFNF VEGESLIMAI KDVAVSSGSA CTSASLEPSY VLCALGRNDE LAHSSIRFTV
GRFTTEQEVD YVIDLLKSKI AKLRDLSPLW EMHQEGIDLS TIEWAAH