Gene BMASAVP1_A2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A2109 
Symbol 
ID4679293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp2085757 
End bp2088567 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content69% 
IMG OID639846373 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_993422 
Protein GI121599743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.556934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATGACT TCGGCCGTGA CGCCGCTCGC GAGGTCCGAC AGCAGAAACG CGCCCGCGTT 
GCCGACCTGC TCGATCGTCA CGTTGCGCTT GAGCGGCGAG TTGCTCTCGA CGAAATCGAG
GATCTTGCCG AAGCTCTTGA TGCCGCTTGC CGCGAGCGTC TTGATCGGGC CCGCCGAGAT
CGCGTTCACG CGCACGCCCT TCGCGCCGAG CGACACCGCG AGATAGCGCA CGCTCGCCTC
GAGCGCCGCC TTCGCGAGGC CCATCGTGTT GTAGTTCGGG ATCGCCCGCT CCGCGCCGAG
ATACGACAGC GTGAGCAGCG ACGCATCGTC CGACAGCATC GGCAGCGCCG CCTTCGCGAG
CGCGGGGAAG CTGTATGCGG AGATGTCGTG CGCGATGCGG AAGTTCTCGC GCGTGAGGCC
GTCGAGGAAG TCGCCCGCGA TCGCCTCGCG CGGCGCGAAG CCGATCGAGT GGACGAGGCC
GTCGAGCGAA TCCCAGTGTG TCTTCAGCGA CGCGAAGAGC GCATCGATCT GCGCATCGTC
GGCGACATCG CACGGGAACA CGAGTTCGCT GCCGAACTCG GCCGCGAACT CGGTGATGCG
ATCCTTGAAG CGATCGCCGA CGTAGGTGAA CGCCAGCTCG GCGCCTTCGC GCTTGCACGC
CTTGGCGATG CCGTAAGCGA TCGAACGGTT CGACAAGAGG CCCGTCAGCA GAATACGTTT
ACCGTCGAGA AAGCCCATGA ATTCTCCTAT GTGTGGCGCC CGCGCCCGCA CGGTCGGCGG
GCTGCGTTTG GCTGCAACCG GTTGCGAATG GGTAGAATTC TCTCACATCG CCATCCTCGA
ACGACTGCCA ACACTTCCAT GACGATCGGT TCGCCCCGGG CCCGCCGGCC GCGATCGCCG
CAACGCGCGG CGCCATCAAA ACAGGCCGCG CGCGCCGCCG CGCCGCGACA GGCGGCCCGC
GCGCGCGCCG CGCTCGCGCG CTTCGCGCGC CGGGCGGCGG CGGGCGTCAC GCTCGCCTTC
GTCGCGGTGC CCGCGGCGCA CGCCGTCTAC GCGATCGCGC AGTACGGCGA GCCGAAGTAT
CCGGCGGGCT TCGCGCATTT CGACTACGTG AACCCCGACG CGCCGAAGGG CGGCACGCTC
GTGCTCGCGA ACCCGAACCG GCTCACGACG TTCGACAAGT TCAATCCGTT CACGATGCGC
GGCAACCCGG CGCCCGGAAT CGACCTGCTG TTCGAGAGCC TGACGACGGG CAGCGCCGAC
GAGCCCGCCT CCGCGTACGG CCTGCTCGCG GACGACATCG CCGTCGCGCC GGACGGCCTG
TCGGTCACGT TCCATCTGAA TCCGCGCGCG CGCTTCTCGA ACGGAGAACC CGTCACCGCG
GCGGACGTCA AGTATTCGTT CGACACGCTG AAGAGCCCGA AGGCGGCGCC GCAATACCCG
GCGTACTACG CGGACATCGC GCGCGCGGTG ATCGTCGACG CGGCGACCGT GCGCTTCGAG
TTTCGCCGCA AGAACCGCGA GCTGCCGCTG ATCGCGGGCG GCATCCCGGT GTTCTCGCGC
AAATGGGGCG TGCGCGCGGA CGGCTCGCAC ATCGCGTTCG ACCAGATCGC GTTCGAGCAG
CCGATCGGCA GCGGCCCGTA CCTGATCGAG CGCTACGACA GCGGGCGCAC GATCACGTAC
CGGCGCAATC CCGCCTACTG GGGCGCGGCG CTGCCCGTGC GGATCGGCAC GAACAACTTC
GAGCGCATCG TCTACAAGCT GTACGGCGAC GGCGTCGCGC GGCTCGAGGC GTTCAAGGCC
GGCGAATACG ACGTGCTCGT CGAGTACATC GCGCGCAACT GGGCGCGGCG CGACGTCGGC
AAGCGCTTCG ACAGCGGCGA GCTCGTCAAG CGCGAGTTCC GCCAGCACAA CGGCGCGGGA
ATGCAGGGCT TCTTCATGAA CCTGCGCCGG CCGCTGTTCC AGGACGTGCG CGTGCGCCAC
GCGCTCGATC TCGCGTTCGA TTTCGAATGG CTGAACCGGC AGCTCTTCTA TGGCGCGTAC
ACGCGCCTGA ACAGCTATTT CGCCGATACC GACCTGCAGG CGACGGGCAC GCCGAGCGCG
GGCGAGCTCG CGCTGCTCGC CCCGTTGCGC GCGCAGCTCG ACCCGGCCGT GTTCGGGCCG
ATGACCGTGC AGCCGAGCAC CGATTTGCCC GCGTCGCTGC GCGCGAACCT GCTGAAGGCG
CGCGCGCTGC TCGCCGAGGC CGGCTGGACC TACCGCGACG GCGCGCTGCG CAACGCGAAG
GGCGAGCCGT TCGTGTTCGA GATTCTCGAC GATTCGGGCT CGGCGTTCGA GCCGGTGGTC
GCCGCGTACA TCCGCAATCT CGCGAAGCTC GGGATCGTCG TGAAGTACCG GACGGCCGAT
TTCGCGCTGC TGCAAAAGCG CCTCGACGCG TTCGACTACG ACATGACGAC GGTCCGCTAC
CCGGGCGTCC AGGTGCCGGG CGCCGAGCAG GTCGCACGCT TCGCGAGCCG CTATGCGGAC
GAGCCGGGCT CGGACAACCT GACGGGGCTC AAGTCGCCCG CGGTCGACGC GATCCTGAAG
GCGCTCACGC AGGCCGAGAC GCGCGACGAA CTGCTCGACG CGACGCACGC GCTCGACCGC
GTGCTGATGC ACGGCTACTA TGCGGTGCCG CAGTGGTACA GCGCCGTGCA CCGGATCGCG
TTCAAGCGCA CGCTCGCCTA CCCGTCGGTG CTGCCGCTGT ACTATTCGGC GGAAGGCTGG
GTCGCCTCGA CGTGGTGGGC GAGGCCCGAG CATGGCGCGT CCGCGCGTTA G
 
Protein sequence
MHDFGRDAAR EVRQQKRARV ADLLDRHVAL ERRVALDEIE DLAEALDAAC RERLDRARRD 
RVHAHALRAE RHREIAHARL ERRLREAHRV VVRDRPLRAE IRQREQRRIV RQHRQRRLRE
RGEAVCGDVV RDAEVLAREA VEEVARDRLA RREADRVDEA VERIPVCLQR REERIDLRIV
GDIAREHEFA AELGRELGDA ILEAIADVGE RQLGAFALAR LGDAVSDRTV RQEARQQNTF
TVEKAHEFSY VWRPRPHGRR AAFGCNRLRM GRILSHRHPR TTANTSMTIG SPRARRPRSP
QRAAPSKQAA RAAAPRQAAR ARAALARFAR RAAAGVTLAF VAVPAAHAVY AIAQYGEPKY
PAGFAHFDYV NPDAPKGGTL VLANPNRLTT FDKFNPFTMR GNPAPGIDLL FESLTTGSAD
EPASAYGLLA DDIAVAPDGL SVTFHLNPRA RFSNGEPVTA ADVKYSFDTL KSPKAAPQYP
AYYADIARAV IVDAATVRFE FRRKNRELPL IAGGIPVFSR KWGVRADGSH IAFDQIAFEQ
PIGSGPYLIE RYDSGRTITY RRNPAYWGAA LPVRIGTNNF ERIVYKLYGD GVARLEAFKA
GEYDVLVEYI ARNWARRDVG KRFDSGELVK REFRQHNGAG MQGFFMNLRR PLFQDVRVRH
ALDLAFDFEW LNRQLFYGAY TRLNSYFADT DLQATGTPSA GELALLAPLR AQLDPAVFGP
MTVQPSTDLP ASLRANLLKA RALLAEAGWT YRDGALRNAK GEPFVFEILD DSGSAFEPVV
AAYIRNLAKL GIVVKYRTAD FALLQKRLDA FDYDMTTVRY PGVQVPGAEQ VARFASRYAD
EPGSDNLTGL KSPAVDAILK ALTQAETRDE LLDATHALDR VLMHGYYAVP QWYSAVHRIA
FKRTLAYPSV LPLYYSAEGW VASTWWARPE HGASAR