Gene BMA10229_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_1755 
SymboliscS-2 
ID4789763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp1797240 
End bp1798361 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID 
Productcysteine desulfurase 
Protein accessionYP_001025553 
Protein GI124382124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.983783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TCTACCTCGA CTACAACGCA AGCACCCCCA TCGACCCGGC GGTCGCGAAT 
GCGATGCGCC CGCTGCTGCC TGAAGCCTTC GGCAATCCGT CCAGCGGACA TTGGGCAAGC
GTGCCGGCAA AGGTTGCACT GGATCGGGCG CGCGCCCAGG TCGCGACGCT GATCGGCGCC
GCACCGGACG AAATTGTCTT CACGAGCGGC GGCAGTGAAG CGAACAACCT TGCGCTGAAA
GGAACTGTAC TCGCATCGTC GAACGCAGGC GGCCACGTGA TCACCCAGTC GACCGAACAT
CCGGCAATCC TGAAGCCGCT GCAATTCCTT GAGCGACTGG GCACTAAAGT CACCTGCGTT
CCCGTGGACG GCAACGGCCG CGTCGATCCC GACGCCATCC GTCGCGCAAT CACGCCGCGC
ACCACGCTCA TCACGGTCAT GCATGCGAAC AACGAAACCG GCACCATTCA GCCGATCGAG
GATATCGGCA CGATCGCCCG CGAGCACGGC ATCCGGTTTC ACACGGACGC GGCCCAGTCG
ATCGGCAAGA TCCCGGCCGA CGTCGATGCG CTCGGCGTCG ATCTGCTGTC GATCGCCGGT
CACAAGCTCT ACGCGCCGAA GGGCATCGGC GCGCTCTACG TTCGAAAGGG AGTCGGGCTC
GAGCCGCTGA TTCACGGCGC AGGCCATGAA AATGGCCGGC GTGCGGGGAC GGAATCCGCA
CTCCTCGCGG CAGGGTTCGG CGAGGCCTGT GCGCTCGCGC GCGATCTCGC GGCGATGGAC
GGCATTCGCG CGTTGCGCGA CCGGCTGTGG CACGCGTTGC AGGATCGCTT CGACGATCGC
GTTCGCCTGA ACGGACATCT CGAGTACCGC TTGCCGAACA CGGTGAACGT GTCGTTCGTC
GGCCGCGTCG GCACTGACGT GCTTGCCGCA CTCGACGGCG TCGCCGCGTC CACCGGTTCG
GCCTGCCACG CCGGACGCAT GACGCTGTCG CCGGTGCTGG CCGCGATGGG CATCGCCGAG
CAGGTCGGGA TGGGGGCCAT CCGATTCAGC CTCGGCCGGC ACACGACAGC CGACGAGATC
GATACCGTCA TCGCGAAATT GAGCGCGATC GCCCGTGGCT GA
 
Protein sequence
MTHIYLDYNA STPIDPAVAN AMRPLLPEAF GNPSSGHWAS VPAKVALDRA RAQVATLIGA 
APDEIVFTSG GSEANNLALK GTVLASSNAG GHVITQSTEH PAILKPLQFL ERLGTKVTCV
PVDGNGRVDP DAIRRAITPR TTLITVMHAN NETGTIQPIE DIGTIAREHG IRFHTDAAQS
IGKIPADVDA LGVDLLSIAG HKLYAPKGIG ALYVRKGVGL EPLIHGAGHE NGRRAGTESA
LLAAGFGEAC ALARDLAAMD GIRALRDRLW HALQDRFDDR VRLNGHLEYR LPNTVNVSFV
GRVGTDVLAA LDGVAASTGS ACHAGRMTLS PVLAAMGIAE QVGMGAIRFS LGRHTTADEI
DTVIAKLSAI ARG