Gene BURPS668_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1015 
SymbolpepA 
ID4883681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp989975 
End bp991486 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID640126943 
Productleucyl aminopeptidase 
Protein accessionYP_001058065 
Protein GI126441940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.490931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTTA GCATAAAAGG CTGTGATTGG AGCAAAGGCA CGGCGAACGG GTTCCTGACG 
GGGAAATCCG ACTGCATCGT GCTGGGCGTG TTCGAGGCGC AAACCTTGTC CGGCGCGGCG
CTCGACATCG ACGAAGCCAC GAAGGGGCTC GTCTCGCGCG TGATCAAGGC GGGCGACATC
GACGGCAAGC TCGGCAAGAC CTTGTTTTTG CACGAGGTTT CGGGCATCGG CGCATCGCGC
GTGCTGCTCG TCGGCCTGGG CAGGCAGGAT GCTTTCAGCC AGAAAGCCTA CGGCGACGCG
GCAAAGGCCG CATGGCGCGC GCTGCTCGGC ACGAAAGTGG TTCAGGTCAC GTTCACGCTC
GCGCAGTTGC CCGTGCCCGA GCGCGCGTCC GACTGGGGTG TGCGCGCGGC GATTCTCGCG
CTGCGCAATG AAACGTACAA GTTCACGCAG ATGAAGAGCA AGCCGGACGC GGGCGCGCCG
GCGCTCAAGC GCGTCGTGTT CAGCGTCGAT CCGGCCGACG ACAAGGCGGC GAAGGTCGCC
GCGAAGCAGG CGGTCGCGCT CGCGAACGGG ATGGACCTCA CGCGCGACCT CGGCAATCTG
CCCGGCAACG TCTGCACGCC GACCTACCTC GCGAACACCG CGAAGAAGAT CGCGAAGGAC
TGGGGCCTGA AAGTCGACGT GCTGGGCCTG AAGCAGATCC AGGCGCTCAA GATGGGCTCG
TTCCTGTCGG TCGCGAAGGG CTCGGTCGAG CCGCCGCAGT TCATCGTGCT GCAGTACCGG
GGCGCGGCCG CGAAGGCGGC GCCCGTCGTG CTCGTCGGCA AGGGCATCAC GTTCGACTCC
GGCGGCATTT CGCTGAAGCC GGGCGAGGGA ATGGACGAGA TGAAGTACGA CATGTGCGGC
GCGGGCTCGG TGCTCGGCAC GATGCGCGCG GTCGCCGAAA TGGGCCTGAA GGTCAACGTC
GTCGCGATCG TGCCGACCTG CGAGAACATG CCGGCCGGCA ACGCGAACAA GCCGGGCGAC
ATCGTCACGA GCATGAAGGG CCTGACGATC GAGGTGCTCA ACACCGACGC GGAGGGCCGC
CTCATCCTGT GCGACGCGCT CACGTACGCG GAGCGCTTCA AGCCGGCCGC CGTGATCGAC
GTCGCGACGC TGACGGGCGC GTGCATCATC GCGCTCGGCC ACCACAACAC CGGCCTCTTC
TCGAAGGACG ACGCGCTCGC GGGCGAGCTG CTCGACGCGT CGCGCGAAGC GGGCGATCCG
GCGTGGCGCC TGCCGCTCGA CGACGAGTAT CAGGATCAGC TGAAGTCGAA CTTCGCGGAT
CTCGCGAACA TCGGCGGGCG CCCGGCCGGC AGCGTGACGG CCGCGTGCTT CCTGTCGCGC
TTCGCGGAAA ACTATCCGTG GGCGCACCTC GACATCGCGG GCACCGCCTG GAAGAGCGGC
GCGGCGAAGG GGGCGACGGG CCGCCCCGTG CCGCTCCTCG CGCAATTCCT GATCGACCGC
GCCGGCGCGT GA
 
Protein sequence
MDFSIKGCDW SKGTANGFLT GKSDCIVLGV FEAQTLSGAA LDIDEATKGL VSRVIKAGDI 
DGKLGKTLFL HEVSGIGASR VLLVGLGRQD AFSQKAYGDA AKAAWRALLG TKVVQVTFTL
AQLPVPERAS DWGVRAAILA LRNETYKFTQ MKSKPDAGAP ALKRVVFSVD PADDKAAKVA
AKQAVALANG MDLTRDLGNL PGNVCTPTYL ANTAKKIAKD WGLKVDVLGL KQIQALKMGS
FLSVAKGSVE PPQFIVLQYR GAAAKAAPVV LVGKGITFDS GGISLKPGEG MDEMKYDMCG
AGSVLGTMRA VAEMGLKVNV VAIVPTCENM PAGNANKPGD IVTSMKGLTI EVLNTDAEGR
LILCDALTYA ERFKPAAVID VATLTGACII ALGHHNTGLF SKDDALAGEL LDASREAGDP
AWRLPLDDEY QDQLKSNFAD LANIGGRPAG SVTAACFLSR FAENYPWAHL DIAGTAWKSG
AAKGATGRPV PLLAQFLIDR AGA