Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A2109 |
Symbol | |
ID | 4679293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | - |
Start bp | 2085757 |
End bp | 2088567 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639846373 |
Product | ABC transporter, periplasmic substrate-binding protein |
Protein accession | YP_993422 |
Protein GI | 121599743 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.556934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATGACT TCGGCCGTGA CGCCGCTCGC GAGGTCCGAC AGCAGAAACG CGCCCGCGTT GCCGACCTGC TCGATCGTCA CGTTGCGCTT GAGCGGCGAG TTGCTCTCGA CGAAATCGAG GATCTTGCCG AAGCTCTTGA TGCCGCTTGC CGCGAGCGTC TTGATCGGGC CCGCCGAGAT CGCGTTCACG CGCACGCCCT TCGCGCCGAG CGACACCGCG AGATAGCGCA CGCTCGCCTC GAGCGCCGCC TTCGCGAGGC CCATCGTGTT GTAGTTCGGG ATCGCCCGCT CCGCGCCGAG ATACGACAGC GTGAGCAGCG ACGCATCGTC CGACAGCATC GGCAGCGCCG CCTTCGCGAG CGCGGGGAAG CTGTATGCGG AGATGTCGTG CGCGATGCGG AAGTTCTCGC GCGTGAGGCC GTCGAGGAAG TCGCCCGCGA TCGCCTCGCG CGGCGCGAAG CCGATCGAGT GGACGAGGCC GTCGAGCGAA TCCCAGTGTG TCTTCAGCGA CGCGAAGAGC GCATCGATCT GCGCATCGTC GGCGACATCG CACGGGAACA CGAGTTCGCT GCCGAACTCG GCCGCGAACT CGGTGATGCG ATCCTTGAAG CGATCGCCGA CGTAGGTGAA CGCCAGCTCG GCGCCTTCGC GCTTGCACGC CTTGGCGATG CCGTAAGCGA TCGAACGGTT CGACAAGAGG CCCGTCAGCA GAATACGTTT ACCGTCGAGA AAGCCCATGA ATTCTCCTAT GTGTGGCGCC CGCGCCCGCA CGGTCGGCGG GCTGCGTTTG GCTGCAACCG GTTGCGAATG GGTAGAATTC TCTCACATCG CCATCCTCGA ACGACTGCCA ACACTTCCAT GACGATCGGT TCGCCCCGGG CCCGCCGGCC GCGATCGCCG CAACGCGCGG CGCCATCAAA ACAGGCCGCG CGCGCCGCCG CGCCGCGACA GGCGGCCCGC GCGCGCGCCG CGCTCGCGCG CTTCGCGCGC CGGGCGGCGG CGGGCGTCAC GCTCGCCTTC GTCGCGGTGC CCGCGGCGCA CGCCGTCTAC GCGATCGCGC AGTACGGCGA GCCGAAGTAT CCGGCGGGCT TCGCGCATTT CGACTACGTG AACCCCGACG CGCCGAAGGG CGGCACGCTC GTGCTCGCGA ACCCGAACCG GCTCACGACG TTCGACAAGT TCAATCCGTT CACGATGCGC GGCAACCCGG CGCCCGGAAT CGACCTGCTG TTCGAGAGCC TGACGACGGG CAGCGCCGAC GAGCCCGCCT CCGCGTACGG CCTGCTCGCG GACGACATCG CCGTCGCGCC GGACGGCCTG TCGGTCACGT TCCATCTGAA TCCGCGCGCG CGCTTCTCGA ACGGAGAACC CGTCACCGCG GCGGACGTCA AGTATTCGTT CGACACGCTG AAGAGCCCGA AGGCGGCGCC GCAATACCCG GCGTACTACG CGGACATCGC GCGCGCGGTG ATCGTCGACG CGGCGACCGT GCGCTTCGAG TTTCGCCGCA AGAACCGCGA GCTGCCGCTG ATCGCGGGCG GCATCCCGGT GTTCTCGCGC AAATGGGGCG TGCGCGCGGA CGGCTCGCAC ATCGCGTTCG ACCAGATCGC GTTCGAGCAG CCGATCGGCA GCGGCCCGTA CCTGATCGAG CGCTACGACA GCGGGCGCAC GATCACGTAC CGGCGCAATC CCGCCTACTG GGGCGCGGCG CTGCCCGTGC GGATCGGCAC GAACAACTTC GAGCGCATCG TCTACAAGCT GTACGGCGAC GGCGTCGCGC GGCTCGAGGC GTTCAAGGCC GGCGAATACG ACGTGCTCGT CGAGTACATC GCGCGCAACT GGGCGCGGCG CGACGTCGGC AAGCGCTTCG ACAGCGGCGA GCTCGTCAAG CGCGAGTTCC GCCAGCACAA CGGCGCGGGA ATGCAGGGCT TCTTCATGAA CCTGCGCCGG CCGCTGTTCC AGGACGTGCG CGTGCGCCAC GCGCTCGATC TCGCGTTCGA TTTCGAATGG CTGAACCGGC AGCTCTTCTA TGGCGCGTAC ACGCGCCTGA ACAGCTATTT CGCCGATACC GACCTGCAGG CGACGGGCAC GCCGAGCGCG GGCGAGCTCG CGCTGCTCGC CCCGTTGCGC GCGCAGCTCG ACCCGGCCGT GTTCGGGCCG ATGACCGTGC AGCCGAGCAC CGATTTGCCC GCGTCGCTGC GCGCGAACCT GCTGAAGGCG CGCGCGCTGC TCGCCGAGGC CGGCTGGACC TACCGCGACG GCGCGCTGCG CAACGCGAAG GGCGAGCCGT TCGTGTTCGA GATTCTCGAC GATTCGGGCT CGGCGTTCGA GCCGGTGGTC GCCGCGTACA TCCGCAATCT CGCGAAGCTC GGGATCGTCG TGAAGTACCG GACGGCCGAT TTCGCGCTGC TGCAAAAGCG CCTCGACGCG TTCGACTACG ACATGACGAC GGTCCGCTAC CCGGGCGTCC AGGTGCCGGG CGCCGAGCAG GTCGCACGCT TCGCGAGCCG CTATGCGGAC GAGCCGGGCT CGGACAACCT GACGGGGCTC AAGTCGCCCG CGGTCGACGC GATCCTGAAG GCGCTCACGC AGGCCGAGAC GCGCGACGAA CTGCTCGACG CGACGCACGC GCTCGACCGC GTGCTGATGC ACGGCTACTA TGCGGTGCCG CAGTGGTACA GCGCCGTGCA CCGGATCGCG TTCAAGCGCA CGCTCGCCTA CCCGTCGGTG CTGCCGCTGT ACTATTCGGC GGAAGGCTGG GTCGCCTCGA CGTGGTGGGC GAGGCCCGAG CATGGCGCGT CCGCGCGTTA G
|
Protein sequence | MHDFGRDAAR EVRQQKRARV ADLLDRHVAL ERRVALDEIE DLAEALDAAC RERLDRARRD RVHAHALRAE RHREIAHARL ERRLREAHRV VVRDRPLRAE IRQREQRRIV RQHRQRRLRE RGEAVCGDVV RDAEVLAREA VEEVARDRLA RREADRVDEA VERIPVCLQR REERIDLRIV GDIAREHEFA AELGRELGDA ILEAIADVGE RQLGAFALAR LGDAVSDRTV RQEARQQNTF TVEKAHEFSY VWRPRPHGRR AAFGCNRLRM GRILSHRHPR TTANTSMTIG SPRARRPRSP QRAAPSKQAA RAAAPRQAAR ARAALARFAR RAAAGVTLAF VAVPAAHAVY AIAQYGEPKY PAGFAHFDYV NPDAPKGGTL VLANPNRLTT FDKFNPFTMR GNPAPGIDLL FESLTTGSAD EPASAYGLLA DDIAVAPDGL SVTFHLNPRA RFSNGEPVTA ADVKYSFDTL KSPKAAPQYP AYYADIARAV IVDAATVRFE FRRKNRELPL IAGGIPVFSR KWGVRADGSH IAFDQIAFEQ PIGSGPYLIE RYDSGRTITY RRNPAYWGAA LPVRIGTNNF ERIVYKLYGD GVARLEAFKA GEYDVLVEYI ARNWARRDVG KRFDSGELVK REFRQHNGAG MQGFFMNLRR PLFQDVRVRH ALDLAFDFEW LNRQLFYGAY TRLNSYFADT DLQATGTPSA GELALLAPLR AQLDPAVFGP MTVQPSTDLP ASLRANLLKA RALLAEAGWT YRDGALRNAK GEPFVFEILD DSGSAFEPVV AAYIRNLAKL GIVVKYRTAD FALLQKRLDA FDYDMTTVRY PGVQVPGAEQ VARFASRYAD EPGSDNLTGL KSPAVDAILK ALTQAETRDE LLDATHALDR VLMHGYYAVP QWYSAVHRIA FKRTLAYPSV LPLYYSAEGW VASTWWARPE HGASAR
|
| |