Gene BMA10229_A2937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2937 
SymbolcysI 
ID4792920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2959291 
End bp2960970 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content66% 
IMG OID 
Productsulfite reductase (NADPH) hemoprotein beta-component 
Protein accessionYP_001028881 
Protein GI124385066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.655156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCAGT ACGACCAATA CGATCAGACG ATCGTCGACG AGCGCGTCGC GCAGTACCGC 
GACCAGGTCC GCCGCCGCTT GTCCGGCGAA TTGAGCGAGG ACGAGTTCCG TCCGCTGCGG
CTGCAGAACG GCCTGTACAT GCAGCGCCAC GCCTACATGC ACCGCATCGC GATTCCCTAC
GGCAATCTCC GTAGCGTCCA GTTGCGCATG CTCGCCCGCA TCGCGCGCGA GCACGATCGC
GGCTACGGCC ATTTCTCGAC GCGCTCGAAT ATCCAGTACA ACTGGGTGAA GCTCGAGGAA
ACGCCCGAGA TTCTCGCAAA GCTCGCGTCG GTGCAGATGC ACGCGATCCA GACCTCGGGC
AACTGCATCC GCAACATCAC GGCCGACCAG TTCGCGGGCG TGGCTCAGGA CGAGGAGCTC
GATCCGCGTC CGTGGGCCGA GATCCTGCGC CAGTGGTCGA CCTTCCATCC CGAATTCGCA
TGGCTGCCGC GCAAGTTCAA GATCGCGGTG TCCGGCTCGA AGGCCGATCG CGCCGCCGTG
CAGATCCACG ATCTCGGCGT CTACCTGAAG AAGAACGACG CGGGCGAAGT GGTCGCGAGC
ATTCTCGCGG GCGGTGGTCT CGGCCGCACG CCGATCGTCG GCGCGATTAT CCGCGAGAAT
CTGCCGTGGC AGCACCTGCT CACTTACTGC GAGGCGGTGC TGCGCGTCTA CAACCGTTAT
GGCCGCCGCG ACAACCTGTA CAAGGCACGG ATCAAGATCC TCGTGAAGGC GCTGTCGCCC
GCGAAGTTCG CGCAACAGGT CGAAGCGGAG TGGCAGCACC TGAAGGACGG CCCGTCCACG
CTCACGCAGG CCGAACTCGA GCGCGTGTCG CAGTTCTTCC AGCCGCCCGC CTATGAGAAG
CTCGCCGATA CCGACGCGTC GTTCGAGCAG CATCTGCTCG AGAATCGCGC GTTCGCGCGC
TGGGTCGAGC GCAACGTCGC GCCGCACAAG GTGCCGGGCT ATGCGGCCGT CACGCTGTCG
TTGAAGAACC ACCGCGTCGC GCCCGGAGAC GCGAGCGCCG AGCAGATGGA GCAGGTCGCC
GACTGGGCCG ACGCCTATTC GTTCGGCGAG CTGCGCGTGT CGCACGAACA GAACCTGATT
CTCGCGAACG TGAAGAAGCG CGACCTGTTC GCGGTATGGG AAAAGGCGAA GGCGGCCGGT
TTCGCGACGC CGAACGTCGG CTTGCTGACC GACATCATCG CGTGCCCGGG GGGCGACTTC
TGCTCGCTCG CGAACGCGAA GTCGATCCCG ATCGCGCTCG CGATCCAGCA GCGCTTCGAC
GATCTCGACT ACGTGTACGA CCTGGGCGAC GTGTCGCTCA ACATCTCGGG CTGCATGAAC
TCGTGCGGGC ACCACCACGT CGGCAACATC GGCATCCTTG GCGTCGACAA GGACGGCGCG
GAGTGGTATC AGGTGTCGCT CGGCGGCGAG CAGGGCACGG GAGCGGGCGG CGCGCGCCTC
GGCCGCGTGA TCGGCCCGTC GTTCTCGGCC GAGGAAGTGC CCGACGTGAT CTCGAAGCTG
ATCGACACGT TCGTCGAATC GCGCATCGAC GGCGAGCGCT TCATCGACAC GTACGAGCGC
ATCGGCATCG CGCCGTTCAA GGAGCGCGTC TACGCGGCGC GCCAGACCGC GCACGCGTAA
 
Protein sequence
MYQYDQYDQT IVDERVAQYR DQVRRRLSGE LSEDEFRPLR LQNGLYMQRH AYMHRIAIPY 
GNLRSVQLRM LARIAREHDR GYGHFSTRSN IQYNWVKLEE TPEILAKLAS VQMHAIQTSG
NCIRNITADQ FAGVAQDEEL DPRPWAEILR QWSTFHPEFA WLPRKFKIAV SGSKADRAAV
QIHDLGVYLK KNDAGEVVAS ILAGGGLGRT PIVGAIIREN LPWQHLLTYC EAVLRVYNRY
GRRDNLYKAR IKILVKALSP AKFAQQVEAE WQHLKDGPST LTQAELERVS QFFQPPAYEK
LADTDASFEQ HLLENRAFAR WVERNVAPHK VPGYAAVTLS LKNHRVAPGD ASAEQMEQVA
DWADAYSFGE LRVSHEQNLI LANVKKRDLF AVWEKAKAAG FATPNVGLLT DIIACPGGDF
CSLANAKSIP IALAIQQRFD DLDYVYDLGD VSLNISGCMN SCGHHHVGNI GILGVDKDGA
EWYQVSLGGE QGTGAGGARL GRVIGPSFSA EEVPDVISKL IDTFVESRID GERFIDTYER
IGIAPFKERV YAARQTAHA