Gene BURPS1710b_A2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2222 
SymbolpepX 
ID3694222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2706335 
End bp2708299 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content62% 
IMG OID637732476 
Productx-prolyl-dipeptidyl aminopeptidase 
Protein accessionYP_337373 
Protein GI76819718 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0212036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAC GGCAACGGTC GAAGATTCGT CCATCTTTCA TGCGTGGTGT CCGGCTGCTG 
GTCTTGCTCG CGCCGATAGC AATTCAGTTG GCTGGATGCG GTGGAGATGA TTCCGTGTCG
TCGCGCGCGT CGAATGAGGG AAGTCAGCCG GGCACTCCTT CCCAATCGAT ACCGTCCGGC
ACCGTGACGA CGGCTCCCGC GCCAGCCCCC TTTGCGCCAC CAACGCCGGT GGCAGAAGTC
GGCGCGCAAT TCGAGCGTTC ACCGACCGGG TTGCCGTATC CGAAGCTTGC AACCTTGTAT
CCGGGACACG ACGGCCCAAT CGTCGACAAC GGCATGATCT TGCCTTGGCT GTCGATGCGC
CCGCCATTGA AGTCGAACGT GATGGTTCAG ACCCCGTTCG ACACTGACCA GGATGGCAAG
CTCGACCGGA TCGCGCTGCG CATCGTGCAG CCGGCCGAAG TGGCGGAGGG GCTCAAGACA
CCGGTGATTG TGCGGCCATC GGTGTACTAT GCGGACCCGA CTTACGCGAC GCAAACGCGC
GCGCCGTTTC TCGGCGAGGC GGAATATTTG CGGATGGGCT ACACGATCGT CTACGCCGAT
TCGATCGGCA CCAATCAGTC GGACGGCTGC TGGTCGGTAA TGGATCGTAC CGAGCGCGAG
GCGATGGCGA GCGTCGTGCG CTGGCTGACG AACGATCCCG GCGCACCAGG CTTTGACGCC
CAAGGCAAAC AGGTCGCCGC GTCCTGGTCA ACCGGGCACG TCGCGATGGA AGGGGTTTCC
TACGGCGGCA CGCTGCCCAC GATGGTCGCG GCGACGGGGG TGCCGGGGCT CGAGGCGATC
GTGCCGGTAG AGGGCATCAG CAGCGGGTAC GACTATTTCC GCTACAACGG CGTGATCGCT
GATATCGACA ATACGGTATC GCTCGGCAGC TACATGAAAT CCCAGCAGGC GTTTGCTCGT
TCGTCAATCT GCGAACCCGC GCGCGTCGCG GCCGTCACGG CTTCCGACGA CGCAACCTAT
GCGTACAACG ATTTCTGGAA GGTACGCAAC ACGGTGTCGC TGGTCGACCG GATTCAAGCC
GCTACGCTGA TCGCACAGGG CCAAGCCGAC AACAACGTCA AGACGAAGAA CGCGACACAA
CTGTACGACG CGTTGTATCG CGCGAAGAAA CCTGTGCAGC TCTGGCTGCA CAGTCGAAAT
CACGACGATC CCGCATGGCA AAAGGAATGG CAGAAGCAGA TCGCGATGTG GTACTCACGC
TATCTGTTTG GTGTCAACAA CGGTGTCGAG ACGCAGCCGA CGTACGTGCG GGAGACGCCG
ACGGGTGACA TCCCTGTCGG CGCGACGCTT CCTCCCGATC CGAACGACAC GAGCGACACG
TTGATCGGCC ACTGCCATTC GGGACACAAT CCACGCGACT GCATTCCAAC GGGCGAGCTG
TTCATCAAGG AGGATGCATG GCCGAAGACG GTCGACACGT TCTACCATCT GCGCGGCGAT
GGCCGGGTGG GAGGACTGCT GACGCCGAGC CCGGCGGACG GCACGCAGGC AGCCTCGGTT
GACTTGAGCA ACGCAACTGC CGTGACCTAC GAGACGAAAT CACTCGCGAA CGTGACCCGC
TACGCCGGCG CGATCAGGGT CGCGATGCGT GGCCGCTTTG CTCCGGCCGT CAGCAACATC
AAGGCCACAT TGTCAGTGGA TGGGCACGAC GTTACGTACG GCTGGGCAAA TCCGCGCTTC
TACAAGGGCC TGGATGTCGC ACAACTGATC GTGCCGAACA CGGACTACGA TTTCACGCTG
GAGATGATGC CACGCGATTT CACGGTCCTG CCGGGTAGCA AGGTCATGTT GAAGCTCGAG
GGCTACCAGG GCACGTCACT GGTGACGCTC GATCTGTCGC ATACCGTGCT CGCAATGCCG
GTTGTTCCGA AAGCACACGT GGCGGCAGTC ATGGTGGGAA AGTAG
 
Protein sequence
MEIRQRSKIR PSFMRGVRLL VLLAPIAIQL AGCGGDDSVS SRASNEGSQP GTPSQSIPSG 
TVTTAPAPAP FAPPTPVAEV GAQFERSPTG LPYPKLATLY PGHDGPIVDN GMILPWLSMR
PPLKSNVMVQ TPFDTDQDGK LDRIALRIVQ PAEVAEGLKT PVIVRPSVYY ADPTYATQTR
APFLGEAEYL RMGYTIVYAD SIGTNQSDGC WSVMDRTERE AMASVVRWLT NDPGAPGFDA
QGKQVAASWS TGHVAMEGVS YGGTLPTMVA ATGVPGLEAI VPVEGISSGY DYFRYNGVIA
DIDNTVSLGS YMKSQQAFAR SSICEPARVA AVTASDDATY AYNDFWKVRN TVSLVDRIQA
ATLIAQGQAD NNVKTKNATQ LYDALYRAKK PVQLWLHSRN HDDPAWQKEW QKQIAMWYSR
YLFGVNNGVE TQPTYVRETP TGDIPVGATL PPDPNDTSDT LIGHCHSGHN PRDCIPTGEL
FIKEDAWPKT VDTFYHLRGD GRVGGLLTPS PADGTQAASV DLSNATAVTY ETKSLANVTR
YAGAIRVAMR GRFAPAVSNI KATLSVDGHD VTYGWANPRF YKGLDVAQLI VPNTDYDFTL
EMMPRDFTVL PGSKVMLKLE GYQGTSLVTL DLSHTVLAMP VVPKAHVAAV MVGK