Gene BMA10229_A1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A1648 
Symbol 
ID4792415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp1690756 
End bp1693086 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative methyl-accepting chemotaxis protein 
Protein accessionYP_001027628 
Protein GI124383350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.562276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCCT TCATGCTCTC CTCGATCCGG TCCCGCATTC TCGTCGCGTG CCTCGCCATC 
GTCATCGGCT CGCTCGTCAT CAACACGGCG CTCAACTATT TCGTCGCCAA CCGCTATAAC
CGCGAGTCCA TCAGCCAGAA CCTCAGCGCG GTGCTGACCG GCCACGAAGC CGGCATCGCC
GATTGGGTCG CGTCGAAGAC GCAGATGATC GTGTCGGTCG AAGACGCCGC GATCTCGCCG
GATCCGATTC CGGCGCTCAA GCAGATCGCC GCCGCCGGCG GGTTCACCAA TGTCTATGTC
GGCTACGCGG ACAAGACCGC GAAATTCTCC GATCCCACCG GTATTCCGCC CGACTACGAT
CCGACCGGCC GCCCGTGGTA CAAGCAGGCC GCGCAGGCGG GCAAGCCGGT CGTCACGCCG
CCCTATGTCG ACGTCGGCAC GGGCAAGCTC GTCGTCGCGT TCGCCGCGCC GATCATGCGC
GACGGCGCGC TGAAGGGCGT CGTGTCCGGC GACGTCGCGA TGGACAGCGT GATCGCGAAC
GTCAAGGCGA TCCACCCGAC GCCCGGCAGT TTCGGGATGC TCGTCGACCG CAGCGGCCAT
ATCGTCGCGC ATGCCGATTC GAAGCTCACG CTCAAGCCCG TCACCGATCT CTCCGACGAT
TTGAGCATCG ACGCGCTCGC CGCGGCGTCG GCCGACGAGA ACGCCGCGCC GATCGACGCG
CACGTCGCGG GCGCGGCGAA GCTGATGCGC GCGCGCGCCG TGCCGGGTAC CGACTGGCTG
ACCGTCGTCG CGCTCGACAA GTCCGATGCG ACGGCCGGCA TGCATTCGCT GCTGCTCGTC
TCGATCGGCA CGCTCGTCGC GCTTGCCGCC GTCGCCGCGC TGATCGTCGG CGCGATCACG
GGCGTCGCGT TCCGCGGGCT CGCGCGCATT CGCGACGCGA TGGAATCGAT CGGCTCCGGC
ACGGGCGATC TGACGCAGCG CCTGCCCGAT AGCGGCCGCG ACGAAGTGGC GCAGATCGCG
CGCTCGTTCA ACGCGTTCGT CAGCAAGCTG CAGGAGGTGA TGCGCGTGAT CCGCGACGCG
AGCGAATCGG TGCGGCATGC GGCGGGCGAG ATCGCGTCGG GCAATCACGA TCTGTCGCGC
CGCACGGAAT CGGCGGCGGC GAGCCTGCAG CAGACGGCCG CGTCGATCGA GGAGATCACG
TCGACGGTCA CGCAATCGGC AGGCGCCGCG CGCCAGGCGA ACGACATCGC GACGAACGCG
GCGAGCGTCG CGTCGCGCGG CGGCACGGTC GTGTCCGACG TCGTGTCGAC GATGCACGAG
ATCGAAGGCG CGTCCGGCAA GATCGCCGAC ATCATCGGCG TGATCGACGG CATCGCGTTC
CAGACCAATA TCCTCGCGCT GAACGCGGCC GTCGAGGCGG CGCGCGCGGG CGAGGAGGGG
CGCGGCTTCG CGGTGGTCGC GGGCGAGGTG CGCTCGCTCG CGCAGCGCGC AGGCGGCGAA
GGAAATCAAG GCGCTGATCG ATTCGAGCGT GACGAGCGTG TCGACGGGCG CCACGCTCGT
CCAGCAGGCC GGGCAGACGA TGAGCGACAT CGTCGGCACG GTTTCGAACG TGACGACGAT
CATGCGCGAG ATCTCGAATG CCGCCGACGA ACAGACGCGC GGCATCCAGG AAGTGAATCG
CGCGGTCGCG CAGCTCGACG AGATGGTTCA GCAGAACGCG GCGCTCGTCG AGCAGTCGGC
CGCGGCCGCC TCGGCGTTGC AGACTCAGGC GGTCGAGCTC GCCGATGCGG TCGGGCAGTT
CAAGGTCGCG TGATCGGCGC GCCGATCGCA TATCGGTCAC GCCTCGACCG TCGATCGCGC
ATCGGTCACG CGCGGGCCGC GCCGCGCCCG AGCCAATCGC TCAGCGCCTG CGTGACGAAT
GCCGGATTCT CGAGATTCGA GATATGTCCG GCATTCGGCA CGAACGCCTT TTCGCAGCCG
ATGAGCGCCG CGATTTCGTC GGCTTCCTCG GGCGGCCGCG CGACGTCGTT CGCGCCGCAC
ATCACGAGCG TGCGGTCCGC CGGCAGCGCG CTCAGTTGCG CGCGCGCGTC TTCGCGGCCG
AACGTGATCT TGCCGAGCGG TATCACCGAG TCGCGCAGCC GCTCGGTCGT GAACGCCTGC
AGCGCGCGCC GGAAGCCCGT GGGCAGCTCG CTCGCCGGAT CGATGCCGGG GCGGAAGAAG
ATCGGCACGA TCGCGTCGAG CAGCGGCGCC GGAATCGCGC CTTGCGCGTC GATGGCCTCG
AGCATCTGGA AGTACTGGTT GCGCGTCGCG TCGGGCTCGA CGCCGACGTA G
 
Protein sequence
MSPFMLSSIR SRILVACLAI VIGSLVINTA LNYFVANRYN RESISQNLSA VLTGHEAGIA 
DWVASKTQMI VSVEDAAISP DPIPALKQIA AAGGFTNVYV GYADKTAKFS DPTGIPPDYD
PTGRPWYKQA AQAGKPVVTP PYVDVGTGKL VVAFAAPIMR DGALKGVVSG DVAMDSVIAN
VKAIHPTPGS FGMLVDRSGH IVAHADSKLT LKPVTDLSDD LSIDALAAAS ADENAAPIDA
HVAGAAKLMR ARAVPGTDWL TVVALDKSDA TAGMHSLLLV SIGTLVALAA VAALIVGAIT
GVAFRGLARI RDAMESIGSG TGDLTQRLPD SGRDEVAQIA RSFNAFVSKL QEVMRVIRDA
SESVRHAAGE IASGNHDLSR RTESAAASLQ QTAASIEEIT STVTQSAGAA RQANDIATNA
ASVASRGGTV VSDVVSTMHE IEGASGKIAD IIGVIDGIAF QTNILALNAA VEAARAGEEG
RGFAVVAGEV RSLAQRAGGE GNQGADRFER DERVDGRHAR PAGRADDERH RRHGFERDDD
HARDLECRRR TDARHPGSES RGRAARRDGS AERGARRAVG RGRLGVADSG GRARRCGRAV
QGRVIGAPIA YRSRLDRRSR IGHARAAPRP SQSLSACVTN AGFSRFEICP AFGTNAFSQP
MSAAISSASS GGRATSFAPH ITSVRSAGSA LSCARASSRP NVILPSGITE SRSRSVVNAC
SARRKPVGSS LAGSMPGRKK IGTIASSSGA GIAPCASMAS SIWKYWLRVA SGSTPT