Gene BURPS1106A_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1022 
SymbolpepA 
ID4899334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp999071 
End bp1000582 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content67% 
IMG OID640134252 
Productleucyl aminopeptidase 
Protein accessionYP_001065302 
Protein GI126452701 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTTA GCATAAAAGG CTGTGATTGG AGCAAAGGCA CGGCGAACGG GTTCCTGACG 
GGGAAATCCG ACTGCATCGT GCTGGGCGTG TTCGAGGCGC AAACCTTGTC CGGCGCGGCG
CTCGACATCG ACGAAGCCAC GAAGGGGCTC GTCTCGCGCG TGATCAAGGC GGGCGACATC
GACGGCAAGC TCGGCAAGAC CTTGTTTTTG CACGAGGTTT CGGGCATCGG CGCATCGCGC
GTGCTGCTCG TCGGCCTGGG CAGGCAGGAT GCTTTCAGCC AGAAAGCCTA CGGCGACGCG
GCAAAGGCCG CATGGCGCGC GCTGCTCGGC ACGAAAGTGG TTCAGGTCAC GTTCACGCTC
GCGCAGTTGC CCGTGCCCGA GCGCGCGTCC GACTGGGGCG TGCGCGCGGC GATTCTCGCG
CTGCGCAATG AAACGTACAA GTTCACGCAG ATGAAGAGCA AGCCTGACGC GGGCGCGCCG
GCGCTCAAGC GCGTCGTGTT CAGCGTCGAT CCGGCCGACG ACAAGGCGGC GAAGGTCGCC
GCGAAGCAGG CGGTCGCGCT CGCGAACGGG ATGGACCTCA CGCGCGACCT CGGCAATCTG
CCCGGCAACG TCTGCACGCC GACCTACCTC GCGAACACCG CGAAGAAGAT CGCGAAGGAC
TGGGGCCTGA AAGTCGACGT GCTGGGCCTG AAGCAGATCC AGGCGCTCAA GATGGGCTCG
TTCCTGTCGG TCGCGAAGGG CTCGGTCGAG CCGCCGCAGT TCATCGTGCT GCAGTACCGG
GGCGCGGCCG CGAAGGCGGC GCCCGTCGTG CTCGTCGGCA AGGGCATCAC GTTCGACTCC
GGCGGCATTT CGCTGAAGCC GGGCGAGGGA ATGGACGAGA TGAAGTACGA CATGTGCGGC
GCGGGCTCGG TGCTCGGCAC GATGCGCGCG GTCGCCGAAA TGGGCCTGAA GGTCAACGTC
GTCGCGATCG TGCCGACCTG CGAGAACATG CCGGCCGGCA ACGCGAACAA GCCGGGCGAC
ATCGTCACGA GCATGAAGGG CCTGACGATC GAGGTGCTCA ACACCGACGC GGAAGGCCGC
CTCATCCTGT GCGACGCGCT CACGTACGCG GAGCGCTTCA AGCCGGCCGC CGTGATCGAC
GTCGCGACGC TGACGGGCGC GTGCATCATC GCGCTCGGCC ACCACAACAC CGGCCTTTTC
TCGAAGGACG ACGCGCTCGC GGGCGAGCTG CTCGACGCGT CGCGCGAAGC GGGCGATCCG
GCGTGGCGCC TGCCGCTCGA CGACGAGTAT CAGGATCAGC TGAAGTCGAA CTTCGCGGAT
CTCGCAAACA TCGGCGGGCG CCCGGCCGGC AGCGTGACGG CCGCGTGCTT CCTGTCGCGC
TTCGCGGAAA ACTATCCGTG GGCGCACCTC GACATCGCGG GCACCGCCTG GAAGAGCGGC
GCGGCGAAGG GGGCGACGGG CCGCCCCGTG CCGCTCCTCG CGCAATTCCT GATCGACCGC
GCCGGCGCGT GA
 
Protein sequence
MDFSIKGCDW SKGTANGFLT GKSDCIVLGV FEAQTLSGAA LDIDEATKGL VSRVIKAGDI 
DGKLGKTLFL HEVSGIGASR VLLVGLGRQD AFSQKAYGDA AKAAWRALLG TKVVQVTFTL
AQLPVPERAS DWGVRAAILA LRNETYKFTQ MKSKPDAGAP ALKRVVFSVD PADDKAAKVA
AKQAVALANG MDLTRDLGNL PGNVCTPTYL ANTAKKIAKD WGLKVDVLGL KQIQALKMGS
FLSVAKGSVE PPQFIVLQYR GAAAKAAPVV LVGKGITFDS GGISLKPGEG MDEMKYDMCG
AGSVLGTMRA VAEMGLKVNV VAIVPTCENM PAGNANKPGD IVTSMKGLTI EVLNTDAEGR
LILCDALTYA ERFKPAAVID VATLTGACII ALGHHNTGLF SKDDALAGEL LDASREAGDP
AWRLPLDDEY QDQLKSNFAD LANIGGRPAG SVTAACFLSR FAENYPWAHL DIAGTAWKSG
AAKGATGRPV PLLAQFLIDR AGA