Gene BURPS1106A_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3394 
SymbolpepP 
ID4901368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3311703 
End bp3313112 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content72% 
IMG OID640136620 
Productxaa-pro aminopeptidase 
Protein accessionYP_001067631 
Protein GI126451784 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGAATC AGCCGACCGA ACCCGCCCTC GCCCTCGACG TCTACCGCCA GCGCCGCGAC 
CGCGTGCTGG CCTCGCTGCG CGCGCAAGGC GGCGGCGTCG CGATCGTGCC CACCGCACCG
GAAGTCCCGC GCAATCGCGA CAGCGACTAT CCGTACCGGC ACGACAGCTA CTTCTACTAC
CTGACGGGCT TCGCCGAGCC CGACGCGCTG CTCGTCCTCG ACGCGTCGGC GGCCGGCGAC
GCGCCGCGCT CGATCCTGTT CTGCCGCGCG AAGAATCCCG AGCGAGAAAT CTGGGAAGGG
TTCCATTTCG GGCCCGAAGC CGCGCGCGAT GCGTTCGGCT TCGACGCCGC GTTCCCGTAC
GACGCGCTCG ACGCCGAAAT GCCGCGCATC GTCGCCGACG CGCCCGCGCT CCACTACCGC
TTCGGCGTGT CGGCCGCTTT CGACGCGCGC CTGAACGGCT GGCTCGACGC GGTGCGCGCG
CGTGCGCGCG CCGGCGTCGC CGCGCCGGGC GCCGCGTTCG ATCTCGGGCC GCTCCTCGAT
GACATGCGGC TCGTCAAGGA TGCGCACGAG CAGGCAACGA TGCGCCGCGC GGCCGACATC
TCCGCGCTCG CGCACCGCCG CGCGATGGCC GCGTGCCGCC CCGGCATCCG CGAATACGAA
CTCGAGGCCG AGCTGCTCTA CACGTTCCGC CGCCACGGCG CGCAATCGCC CGCATACGGC
TCGATCGTCG CGACGGGCGC GAACGCATGC GTGCTCCACT ATCCGGCCGG CAACGCCGTC
GTCGCCGACG GCGAGCTCGT GCTGATCGAC GCCGCGTGCG AGCTCGACGG CTACGCATCC
GACATCACCC GCACGTTCCC GGCGAACGGC CGCTTCTCGG GCCCGCAACG CGCGCTTTAT
GACATCGTGC TCGCCGCTCA GGAAGCGGCG ATCGCGGCGA CGCGCGCCGG CACGCAGTTC
GACGCGCCGC ACGACGCGGC GGTGCGCGTG CTCGCGCAGG GCATGCTCGA CACGGGGCTC
GTGCCGAAGA CGCGCTTCGC GAGCGTCGAC GACGTGATCG CCGAGCGTGC GTACACGCGC
TTCTACATGC ACCGCACCGG CCACTGGCTC GGCATGGACG TGCACGACTG CGGCGACTAC
CGCGAGCGCG GCGCGCCGCG CGACGACGAC GGCGCGCTGC CCTCGCGCGT GCTGCATCCG
GGCATGGCGC TCACGATCGA GCCGGGGCTG TACGTGCGCC CGGGCGAAGA CGTGCCGCAG
GCGTTCTGGA ACATCGGCAT CCGCATCGAG GACGACGCGT TCGTCACGCC GACGGGGTGC
GAGCTGATCA CGCGCGGCGT GCCGGTGGCG GCCGACGAGA TCGAGGCATT GATGCGCGAC
GCGCGGCCGG CGCCGCGCCC GCAGCCGTGA
 
Protein sequence
MMNQPTEPAL ALDVYRQRRD RVLASLRAQG GGVAIVPTAP EVPRNRDSDY PYRHDSYFYY 
LTGFAEPDAL LVLDASAAGD APRSILFCRA KNPEREIWEG FHFGPEAARD AFGFDAAFPY
DALDAEMPRI VADAPALHYR FGVSAAFDAR LNGWLDAVRA RARAGVAAPG AAFDLGPLLD
DMRLVKDAHE QATMRRAADI SALAHRRAMA ACRPGIREYE LEAELLYTFR RHGAQSPAYG
SIVATGANAC VLHYPAGNAV VADGELVLID AACELDGYAS DITRTFPANG RFSGPQRALY
DIVLAAQEAA IAATRAGTQF DAPHDAAVRV LAQGMLDTGL VPKTRFASVD DVIAERAYTR
FYMHRTGHWL GMDVHDCGDY RERGAPRDDD GALPSRVLHP GMALTIEPGL YVRPGEDVPQ
AFWNIGIRIE DDAFVTPTGC ELITRGVPVA ADEIEALMRD ARPAPRPQP