Gene BMA10247_A0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A0422 
SymboliscS-2 
ID4890438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp369293 
End bp370414 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID640146700 
Productcysteine desulfurase 
Protein accessionYP_001077625 
Protein GI126447472 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TCTACCTCGA CTACAACGCA AGCACCCCCA TCGACCCGGC GGTCGCGAAT 
GCGATGCGCC CGCTGCTGCC TGAAGCCTTC GGCAATCCGT CCAGCGGACA TTGGGCAAGC
GTGCCGGCAA AGGTTGCACT GGATCGGGCG CGCGCCCAGG TCGCGACGCT GATCGGCGCC
GCACCGGACG AAATTGTCTT CACGAGCGGC GGCAGTGAAG CGAACAACCT TGCGCTGAAA
GGAACTGTAC TCGCATCGTC GAACGCAGGC GGCCACGTGA TCACCCAGTC GACCGAACAT
CCGGCAATCC TGAAGCCGCT GCAATTCCTT GAGCGACTGG GCACTAAAGT CACCTGCGTT
CCCGTGGACG GCAACGGCCG CGTCGATCCC GACGCCATCC GTCGCGCAAT CACGCCGCGC
ACCACGCTCA TCACGGTCAT GCATGCGAAC AACGAAACCG GCACCATTCA GCCGATCGAG
GATATCGGCA CGATCGCCCG CGAGCACGGC ATCCGGTTTC ACACGGACGC GGCCCAGTCG
ATCGGCAAGA TCCCGGCCGA CGTCGATGCG CTCGGCGTCG ATCTGCTGTC GATCGCCGGT
CACAAGCTCT ACGCGCCGAA GGGCATCGGC GCGCTCTACG TTCGAAAGGG AGTCGGGCTC
GAGCCGCTGA TTCACGGCGC AGGCCATGAA AATGGCCGGC GTGCGGGGAC GGAATCCGCA
CTCCTCGCGG CAGGGTTCGG CGAGGCCTGT GCGCTCGCGC GCGATCTCGC GGCGATGGAC
GGCATTCGCG CGTTGCGCGA CCGGCTGTGG CACGCGTTGC AGGATCGCTT CGACGATCGC
GTTCGCCTGA ACGGACATCT CGAGTACCGC TTGCCGAACA CGGTGAACGT GTCGTTCGTC
GGCCGCGTCG GCACTGACGT GCTTGCCGCA CTCGACGGCG TCGCCGCGTC CACCGGTTCG
GCCTGCCACG CCGGACGCAT GACGCTGTCG CCGGTGCTGG CCGCGATGGG CATCGCCGAG
CAGGTCGGGA TGGGGGCCAT CCGATTCAGC CTCGGCCGGC ACACGACAGC CGACGAGATC
GATACCGTCA TCGCGAAATT GAGCGCGATC GCCCGTGGCT GA
 
Protein sequence
MTHIYLDYNA STPIDPAVAN AMRPLLPEAF GNPSSGHWAS VPAKVALDRA RAQVATLIGA 
APDEIVFTSG GSEANNLALK GTVLASSNAG GHVITQSTEH PAILKPLQFL ERLGTKVTCV
PVDGNGRVDP DAIRRAITPR TTLITVMHAN NETGTIQPIE DIGTIAREHG IRFHTDAAQS
IGKIPADVDA LGVDLLSIAG HKLYAPKGIG ALYVRKGVGL EPLIHGAGHE NGRRAGTESA
LLAAGFGEAC ALARDLAAMD GIRALRDRLW HALQDRFDDR VRLNGHLEYR LPNTVNVSFV
GRVGTDVLAA LDGVAASTGS ACHAGRMTLS PVLAAMGIAE QVGMGAIRFS LGRHTTADEI
DTVIAKLSAI ARG