Gene BURPS668_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3358 
SymbolpepP 
ID4883314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3293860 
End bp3295269 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content72% 
IMG OID640129286 
Productxaa-pro aminopeptidase 
Protein accessionYP_001060369 
Protein GI126439525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGAATC AGCCGACCGA ACCCGCCATC GCCCTCGACG TCTACCGCCA GCGCCGCGAC 
CGCGTGCTGG CCTCGCTGCG CGCGCAAGGC GGCGGCGTCG CGATCGTGCC CACCGCACCG
GAAGTCCCGC GCAATCGCGA CAGCGACTAT CCGTACCGGC ACGACAGCTA CTTCTACTAC
CTGACGGGCT TCGCCGAGCC CGACGCGCTG CTCGTCCTCG ACGCGTCGGC GGCCGGCGAC
GCGCCGCGCT CGATCCTGTT CTGCCGCGCG AAGAATCCCG AGCGAGAAAT CTGGGAAGGG
TTCCATTTCG GGCCCGAAGG CGCGCGCGAT GCGTTCGGCT TCGACGCCGC GTTCCCGTAC
GACGCGCTCG ATGCCGAAAT GCCGCGCATC GTCGCCGACG CGCCCGCGCT CCACTACCGC
TTCGGCGTGT CGGCCGCTTT CGACGCGCGC CTGAACGGCT GGCTCGACGC GGTGCGCGCG
CGTGCGCGCG CCGGCGTCGC CGCGCCGGGC GCCGCGTTCG ATCTCGGGCC GCTCCTCGAT
GACATGCGGC TCGTCAAGGA TGCGCACGAG CAGGCAACGA TGCGCCGCGC GGCCGACATC
TCCGCGCTCG CGCACCGCCG CGCGATGGCC GCGTGCCGCC CCGGCATCCG CGAATACGAA
CTCGAGGCCG AGCTGCTCTA CACGTTCCGC CGCCACGGCG CGCAATCGCC CGCATACGGC
TCGATCGTCG CGACGGGCGC GAACGCATGC GTGCTCCACT ATCCGGCCGG CAACGCCGTC
GTCGCCGACG GCGAGCTCGT GCTGATCGAC GCCGCGTGCG AGCTCGACGG CTACGCATCC
GACATCACCC GCACGTTTCC GGCGAACGGC CGCTTCTCGG GCCCGCAACG CGCGCTTTAT
GGCATCGTGC TCGCCGCTCA GGAAGCGGCG ATCGCGGCGA CGCGCGCCGG CACGCCGTTC
GACGCGCCGC ACGACGCGGC GGTGCGCGTG CTCGCGCAGG GCATGCTCGA CACGGGGCTC
GTGCCGAAGA CGCGCTTCGC GAGCGTCGAC GACGTGATCG CCGAGCGTGC GTACACGCGC
TTCTACATGC ACCGCACCGG CCACTGGCTC GGCATGGACG TGCACGACTG CGGCGACTAC
CGCGAGCGCG CCGCGCCGCG CGACGACGAC GGCGCGCTGC CCTCGCGCGT GCTGCATCCG
GGCATGGCGC TCACGATCGA GCCGGGGCTG TACGTGCGCC CGGGCGAAGA CGTGCCGCAG
GCGTTCTGGA ACATCGGCAT CCGCATCGAG GACGACGCGT TCGTCACGCC GACGGGGTGC
GAGCTGATCA CGCGCGGCGT GCCGGTGGCG GCCGACGAGA TCGAGGCATT GATGCGCGAC
GCGCGGCCGG CGCCGCGCCC GCAGCCGTGA
 
Protein sequence
MMNQPTEPAI ALDVYRQRRD RVLASLRAQG GGVAIVPTAP EVPRNRDSDY PYRHDSYFYY 
LTGFAEPDAL LVLDASAAGD APRSILFCRA KNPEREIWEG FHFGPEGARD AFGFDAAFPY
DALDAEMPRI VADAPALHYR FGVSAAFDAR LNGWLDAVRA RARAGVAAPG AAFDLGPLLD
DMRLVKDAHE QATMRRAADI SALAHRRAMA ACRPGIREYE LEAELLYTFR RHGAQSPAYG
SIVATGANAC VLHYPAGNAV VADGELVLID AACELDGYAS DITRTFPANG RFSGPQRALY
GIVLAAQEAA IAATRAGTPF DAPHDAAVRV LAQGMLDTGL VPKTRFASVD DVIAERAYTR
FYMHRTGHWL GMDVHDCGDY RERAAPRDDD GALPSRVLHP GMALTIEPGL YVRPGEDVPQ
AFWNIGIRIE DDAFVTPTGC ELITRGVPVA ADEIEALMRD ARPAPRPQP