Gene BMAA1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1040 
Symbol 
ID3087879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1077614 
End bp1079614 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content70% 
IMG OID637564942 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_105706 
Protein GI53716929 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.643588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGCG AAACCGGATA CATGGGATTC GTCGTCGTGC TCGTGTTCAT GGTGCTGCTC 
GCGCTGCAGC TCGCCACGCT CAGCGCGCCG GCCACGCAGA TCGCGTACAG CGACTTTCGC
AAGCTCGCCG CGGCCGCGCA GCTCGACGAT CTCGAAGTCA GCCCGACGCG CATCACGGGC
GTGCTGCGCA GTGCGTCCGC GGCGGCGGCG CTGCCCGCCT CCGACGCGGA GGCGATCAAG
CGCGCGGGCA CGCCGTGGCG CTTCTCGACA AAGCGCGTGA CCGACGAGCG CCTGATCGAC
ACGCTCGCCG CGACGGGCAC CCGCTATCGC GGCGCCGACG ACGACACGTG GATCGGCACG
CTCGCATCGT GGATCGTGCC GATCGCGGTG TTCGCGCTCG TCTGGAACCT GATGCTGCGG
CGCCCGCGCG GCGGCCTGCA GGACTGGTCG GGCGTCGGCA AGAGCAAGCC GCGCGTCTAT
GTGGAGGCGA AGACCGGCAT CGATTTCGAC GACATCGCGG GCATCGACGA GGCGAAGGCC
GAGCTCCAGC AGATCGTCGC GTTTCTGCGC GCGCCCGCGC GCTACCAGCG GCTCGGCGGC
AAGATCCCGA AGGGCGTGCT GATCGTCGGC GCGCCCGGCA CCGGCAAGAC GCTGCTCGCG
AAGGCGGTGG CGGGCGAGGC GGGCGTGCCG TTCTTCTCGA CGAGCGGCTC GTCGTTCGTC
GAGATGTTCG TCGGCGTCGG CGCGGCGCGC GTGCGCGATC TGTTCGAGCA GGCGCAGCAA
AAGGCGCCGT GCATCATCTT CATCGACGAG CTCGACGCGC TCGGCAAGGT GCGCGGCGCG
GGGCTCGCGT CGGGCAACGA CGAGCGCGAG CAGACGCTGA ACCAGTTGCT CGTGGAGATG
GACGGCTTCC AGGCGAACTC CGGCGTGATC CTCATGGCGG CGACCAATCG TCCGGAGATT
CTCGATCCCG CGCTGCTGCG CCCGGGCCGC TTCGACCGCC ACATCGCGAT CGACCGGCCG
GACTTGACGG GGCGCCGGCA GATCCTGTCG GTCCACGTGA AGCACGTGAA GCTCGGCCCG
GACGTCGATC TCGGCGAGCT CGCGTCGCAC ACGCCCGGCT TCGTCGGCGC GGATCTCGCG
AACATCGTCA ACGAGGCGGC GCTGCACGCG GCCGAGCTCG ACAAGCCCGC GATCGACATG
TCCGATTTCG ACGAGGCGAT CGACCGCGCG ATGACCGGCA TGGAACGCAA GAGCCGCGTG
ATGAGCGAGC GCGAGAAGAT CACGATCGCG CATCACGAGG CGGGGCACGC GCTGATCGCG
CAGACGCGCG CGCACAGCGA TCCGGTGAAG AAGGTGTCGA TCATTCCGCG CGGCATCGCG
GCGCTCGGCT ACACGCAGCA GGTGCCGACC GAGGATCGCT ACGTGCTGCG CAAGAGCGAG
CTGCTCGACC GGCTCGACGT GCTGCTCGGC GGGCGCGTCG CCGAGGAGAT CGTGTTCGGC
GACGTGTCGA CGGGCGCGGA GAACGATCTC GAGCGCGCGA CCGAAATGGC GCGGCACATG
GTCGCCCGCT ACGGGATGAG CGAGCGGATC GGCCTCGCGA CGTTCGGCGA CGCGGACACC
CAGGGGCTGT CGCCCCTCGT CTGGCAGCGC GGCGGCGAGC GCTGCAGCGA GAGCACCGCG
ACGCGGATCG ACGACGAGAT CCAGCGGCTC CTCGCCGAGG CGCACGATCG CGTGTCGCGT
ACGCTGAAGG AGCGGCGCGG CGCGCTCGAA CGGATCGCCG GGTATCTGCT CGAGCACGAG
GTGGTCGATC ACGACAAGCT CGTGAGGCTC GTCAACGACG AGCCGACGCC CGAGCCCGGC
GCGCGCGATC CGGGCGGCGA CGCGGCGAAG CGAAGCGGCA TCGGCGCCGC GCCGGCGAAG
CCGCCGGCGG AAGTCGGGAG CGCCGAGCTT CGCGATCCGG CTCGAAAGGC CGACAACGCG
GACCACTCCG TGCCGCAGTG A
 
Protein sequence
MKSETGYMGF VVVLVFMVLL ALQLATLSAP ATQIAYSDFR KLAAAAQLDD LEVSPTRITG 
VLRSASAAAA LPASDAEAIK RAGTPWRFST KRVTDERLID TLAATGTRYR GADDDTWIGT
LASWIVPIAV FALVWNLMLR RPRGGLQDWS GVGKSKPRVY VEAKTGIDFD DIAGIDEAKA
ELQQIVAFLR APARYQRLGG KIPKGVLIVG APGTGKTLLA KAVAGEAGVP FFSTSGSSFV
EMFVGVGAAR VRDLFEQAQQ KAPCIIFIDE LDALGKVRGA GLASGNDERE QTLNQLLVEM
DGFQANSGVI LMAATNRPEI LDPALLRPGR FDRHIAIDRP DLTGRRQILS VHVKHVKLGP
DVDLGELASH TPGFVGADLA NIVNEAALHA AELDKPAIDM SDFDEAIDRA MTGMERKSRV
MSEREKITIA HHEAGHALIA QTRAHSDPVK KVSIIPRGIA ALGYTQQVPT EDRYVLRKSE
LLDRLDVLLG GRVAEEIVFG DVSTGAENDL ERATEMARHM VARYGMSERI GLATFGDADT
QGLSPLVWQR GGERCSESTA TRIDDEIQRL LAEAHDRVSR TLKERRGALE RIAGYLLEHE
VVDHDKLVRL VNDEPTPEPG ARDPGGDAAK RSGIGAAPAK PPAEVGSAEL RDPARKADNA
DHSVPQ