Gene BURPS1710b_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1176 
SymbolpepA 
ID3689671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1220223 
End bp1221734 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID637727632 
Productleucyl aminopeptidase 
Protein accessionYP_332587 
Protein GI76809201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.624268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTTA GCATAAAAGG CTGTGATTGG AGCAAAGGCA CGGCGAACGG GTTCCTGACG 
GGGAAATCCG ACTGCATCGT GCTGGGCGTG TTCGAGGCGC AAACCTTGTC CGGCGCGGCG
CTCGACATCG ACGAAGCCAC GAAGGGGCTC GTCTCGCGCG TGATCAAGGC GGGCGACATC
GACGGCAAGC TCGGCAAGAC CTTGTTTTTG CACGAGGTTT CGGGCATCGG CGCATCGCGC
GTGCTGCTCG TCGGCCTGGG CAGGCAGGAT GCTTTCAGCC AGAAAGCCTA CGGCGACGCG
GCAAAGGCCG CATGGCGCGC GCTGCTCGGC ACGAAAGTGG TTCAGGTCAC GTTCACGCTC
GCGCAGTTGC CCGTGCCCGA GCGCGCGTCC GACTGGGGCG TGCGCGCGGC GATTCTCGCG
CTGCGCAATG AAACGTACAA GTTCACGCAG ATGAAGAGCA AGCCTGACGC GGGCGCGCCG
GCGCTCAAGC GCGTCGTGTT CAGCGTCGAT CCGGCCGACG ACAAGGCGGC GAAGGTCGCC
GCGAAGCAGG CGGTCGCGCT CGCGAACGGG ATGGACCTCA CGCGCGACCT CGGCAATCTG
CCCGGCAACG TCTGCACGCC GACCTACCTC GCGAACACCG CGAAGAAGAT CGCGAAGGAC
TGGGGCCTGA AAGTCGACGT GCTGGGCCTG AAGCAGATCC AGGCGCTCAA GATGGGCTCG
TTCCTGTCGG TCGCGAAGGG CTCGGTCGAG CCGCCGCAGT TCATCGTGCT GCAGTACCGG
GGCGCGGCCG CGAAGGCGGC GCCCGTCGTG CTCGTCGGCA AGGGCATCAC GTTCGACTCC
GGCGGCATTT CGCTGAAGCC GGGCGAGGGA ATGGACGAGA TGAAGTACGA CATGTGCGGC
GCGGGCTCGG TGCTCGGCAC GATGCGCGCG GTCGCCGAAA TGGGCCTGAA GATCAACGTC
GTCGCGATCG TGCCGACCTG CGAGAACATG CCGGCCGGCA ACGCGAACAA GCCGGGCGAC
ATCGTCACGA GCATGAAGGG CCTGACGATC GAGGTGCTCA ACACCGACGC GGAGGGCCGC
CTCATCCTGT GCGACGCGCT CACGTACGCG GAGCGCTTCA AGCCGGCCGC CGTGATCGAC
GTCGCGACGC TGACGGGCGC GTGCATCATC GCGCTCGGCC ACCACAACAC CGGCCTCTTC
TCGAAGGACG ACGCGCTCGC GGGCGAGCTG CTCGACGCGT CGCGCGAAGC GGGCGATCCG
GCGTGGCGCC TGCCGCTCGA CGACGAGTAT CAGGATCAGC TGAAGTCGAA CTTCGCGGAT
CTCGCGAACA TCGGCGGGCG CCCGGCCGGC AGCGTGACGG CCGCGTGCTT CCTGTCGCGC
TTCGCGGAAA ACTATCCGTG GGCGCACCTC GACATCGCGG GCACCGCCTG GAAGAGCGGC
GCGGCGAAGG GGGCGACGGG CCGCCCCGTG CCGCTCCTCG CGCAATTCCT GATCGACCGC
GCCGGCGCGT GA
 
Protein sequence
MDFSIKGCDW SKGTANGFLT GKSDCIVLGV FEAQTLSGAA LDIDEATKGL VSRVIKAGDI 
DGKLGKTLFL HEVSGIGASR VLLVGLGRQD AFSQKAYGDA AKAAWRALLG TKVVQVTFTL
AQLPVPERAS DWGVRAAILA LRNETYKFTQ MKSKPDAGAP ALKRVVFSVD PADDKAAKVA
AKQAVALANG MDLTRDLGNL PGNVCTPTYL ANTAKKIAKD WGLKVDVLGL KQIQALKMGS
FLSVAKGSVE PPQFIVLQYR GAAAKAAPVV LVGKGITFDS GGISLKPGEG MDEMKYDMCG
AGSVLGTMRA VAEMGLKINV VAIVPTCENM PAGNANKPGD IVTSMKGLTI EVLNTDAEGR
LILCDALTYA ERFKPAAVID VATLTGACII ALGHHNTGLF SKDDALAGEL LDASREAGDP
AWRLPLDDEY QDQLKSNFAD LANIGGRPAG SVTAACFLSR FAENYPWAHL DIAGTAWKSG
AAKGATGRPV PLLAQFLIDR AGA