Gene BMA3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA3301 
SymboldppA 
ID3088537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp3412225 
End bp3413853 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID637563853 
Productdipeptide ABC transporter, periplasmic didpeptide-binding protein 
Protein accessionYP_104773 
Protein GI53724885 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.511584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATA ACCGTCTGTT GCGCGCACTG CGTGCTACCG CCATCGCGGG CGTTGCAGCG 
GCATCGTTCG GCATCGCGGG TTCTGCATTC GCACAGATCC TGAACAAAAC GCTCGTCTAC
TGCTCAGAAG GCAGCCCGGC GGGCTTCGAT TCCGCGCAAT TCACGACGAG CGTCGATTTC
ACCGCGTCGA CGTTCCCGAT CTACAACCGC CTCGTCGAAT TCGAGCGCGG CGGCACGAAG
GTCGAGCCCG GCCTCGCCGA GAAGTGGGAC ATCTCGGCCG ACGGCAAGGT CTACACGTTC
CATCTGCGCC ACGGCGTCAA GTTCCATACG ACCGATTTCT TCAAGCCCAC GCGCGAATTC
AACGCGGACG ACGTCGCGTT CACGTTCGAG CGGATGCTCG ATCCGAATCA GGCGTTTCGC
AAGGCGTACC CGGTGTCGTT CCCGTACTTC ACCGACATGG GCCTCGACAA GCTGATCGTG
AAGATCGAGA AGCTCGATCC GTACACGATC CGCTTCACGC TGAAGGAGCC GAACGCGCCG
TTCATCCAGA ACCTCGCGAT GGAATTCGCG TCGATCCTCT CGGCCGAATA CGCGGACCAA
CTGATGAAGG CGGGCAAGGC GGCCGACATC AACCAGAAGC CGATCGGCAC GGGCCCGTTC
ATCTTCCGCA GCTACACGAA GGACGCGACG ATCCGCTTCG ACGGCAATCC TGATTATTGG
AAGAAGGGCG CGGTGAAGAT CTCGAAGCTG ATCTTCTCGA TCACGCCCGA CCCGGGCGTG
CGCGTGCAGA AGATCAAGCG CAACGAGTGC CAGGTGATGA GCTATCCGCG GCCCGCGGAC
ATCGCGACGC TGAAGGCCGA TTCGAACGTC GACATGCCGT CGCTGCCGGG CTTCAACCTC
GGCTACCTCG CGTACAACGT GCAGCACAAG CCCGTCGACA AGCTCGAAGT GCGCCAGGCG
CTCGACATGG CGATCAACAA GAAGGCGATT CTCGAATCCG TCTATCAGGG CGCGGGCCAG
GCGGCGAGCG CGCCGATGCC GCCGACCCAA TGGTCGTACG ACAAGAACCT GAAGGCCGCC
GCCTACGATC CGGCGAAGGC GAAGGCGCTG CTCGCGAAGG CGGGCTACCC GAACGGCTTC
CCGATCACGC TGTGGGCGAT GCCCGTGCAG CGCCCGTACA ACCCGAACGC GAAGCTGATG
GCCGAGATGA TCCAGGCCGA TTGGGCGAAG ATCGGCGTGC AGGCGAAGAT CGTCACGTAC
GAGTGGGGCG AGTACATCAA GCGCGCGCAT GCGGGCGAGC ACGATACGAT GCTGATCGGC
TGGAACGGCG ACAACGGCGA TCCCGACAAC TGGCTCGGCA CGCTGCTCGG CTGCGAGGCG
GTCAAGGGCA ACAACTTCTC CGAGTGGTGC TACAAGCCGT TCGACGAGCT GATCCAGAAG
GGCCGCGTGA CGACGTCGCA GGACGGCCGC ACGAAGATCT ACATGCAGGC GCAGCAGATC
TTCGCGCAGC AACTGCCGTT CTCGCCGATC GCGAACTCGA CCGTCTATCA GCCGGTGCGC
AAGAACATCG TCGACATGCG GATCGAGCCG CTCGGCTATG CGCGCTTCGA CGGCGTCAGC
GTGAAATAA
 
Protein sequence
MEHNRLLRAL RATAIAGVAA ASFGIAGSAF AQILNKTLVY CSEGSPAGFD SAQFTTSVDF 
TASTFPIYNR LVEFERGGTK VEPGLAEKWD ISADGKVYTF HLRHGVKFHT TDFFKPTREF
NADDVAFTFE RMLDPNQAFR KAYPVSFPYF TDMGLDKLIV KIEKLDPYTI RFTLKEPNAP
FIQNLAMEFA SILSAEYADQ LMKAGKAADI NQKPIGTGPF IFRSYTKDAT IRFDGNPDYW
KKGAVKISKL IFSITPDPGV RVQKIKRNEC QVMSYPRPAD IATLKADSNV DMPSLPGFNL
GYLAYNVQHK PVDKLEVRQA LDMAINKKAI LESVYQGAGQ AASAPMPPTQ WSYDKNLKAA
AYDPAKAKAL LAKAGYPNGF PITLWAMPVQ RPYNPNAKLM AEMIQADWAK IGVQAKIVTY
EWGEYIKRAH AGEHDTMLIG WNGDNGDPDN WLGTLLGCEA VKGNNFSEWC YKPFDELIQK
GRVTTSQDGR TKIYMQAQQI FAQQLPFSPI ANSTVYQPVR KNIVDMRIEP LGYARFDGVS
VK