Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2222 |
Symbol | pepX |
ID | 3694222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 2706335 |
End bp | 2708299 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637732476 |
Product | x-prolyl-dipeptidyl aminopeptidase |
Protein accession | YP_337373 |
Protein GI | 76819718 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0212036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATAC GGCAACGGTC GAAGATTCGT CCATCTTTCA TGCGTGGTGT CCGGCTGCTG GTCTTGCTCG CGCCGATAGC AATTCAGTTG GCTGGATGCG GTGGAGATGA TTCCGTGTCG TCGCGCGCGT CGAATGAGGG AAGTCAGCCG GGCACTCCTT CCCAATCGAT ACCGTCCGGC ACCGTGACGA CGGCTCCCGC GCCAGCCCCC TTTGCGCCAC CAACGCCGGT GGCAGAAGTC GGCGCGCAAT TCGAGCGTTC ACCGACCGGG TTGCCGTATC CGAAGCTTGC AACCTTGTAT CCGGGACACG ACGGCCCAAT CGTCGACAAC GGCATGATCT TGCCTTGGCT GTCGATGCGC CCGCCATTGA AGTCGAACGT GATGGTTCAG ACCCCGTTCG ACACTGACCA GGATGGCAAG CTCGACCGGA TCGCGCTGCG CATCGTGCAG CCGGCCGAAG TGGCGGAGGG GCTCAAGACA CCGGTGATTG TGCGGCCATC GGTGTACTAT GCGGACCCGA CTTACGCGAC GCAAACGCGC GCGCCGTTTC TCGGCGAGGC GGAATATTTG CGGATGGGCT ACACGATCGT CTACGCCGAT TCGATCGGCA CCAATCAGTC GGACGGCTGC TGGTCGGTAA TGGATCGTAC CGAGCGCGAG GCGATGGCGA GCGTCGTGCG CTGGCTGACG AACGATCCCG GCGCACCAGG CTTTGACGCC CAAGGCAAAC AGGTCGCCGC GTCCTGGTCA ACCGGGCACG TCGCGATGGA AGGGGTTTCC TACGGCGGCA CGCTGCCCAC GATGGTCGCG GCGACGGGGG TGCCGGGGCT CGAGGCGATC GTGCCGGTAG AGGGCATCAG CAGCGGGTAC GACTATTTCC GCTACAACGG CGTGATCGCT GATATCGACA ATACGGTATC GCTCGGCAGC TACATGAAAT CCCAGCAGGC GTTTGCTCGT TCGTCAATCT GCGAACCCGC GCGCGTCGCG GCCGTCACGG CTTCCGACGA CGCAACCTAT GCGTACAACG ATTTCTGGAA GGTACGCAAC ACGGTGTCGC TGGTCGACCG GATTCAAGCC GCTACGCTGA TCGCACAGGG CCAAGCCGAC AACAACGTCA AGACGAAGAA CGCGACACAA CTGTACGACG CGTTGTATCG CGCGAAGAAA CCTGTGCAGC TCTGGCTGCA CAGTCGAAAT CACGACGATC CCGCATGGCA AAAGGAATGG CAGAAGCAGA TCGCGATGTG GTACTCACGC TATCTGTTTG GTGTCAACAA CGGTGTCGAG ACGCAGCCGA CGTACGTGCG GGAGACGCCG ACGGGTGACA TCCCTGTCGG CGCGACGCTT CCTCCCGATC CGAACGACAC GAGCGACACG TTGATCGGCC ACTGCCATTC GGGACACAAT CCACGCGACT GCATTCCAAC GGGCGAGCTG TTCATCAAGG AGGATGCATG GCCGAAGACG GTCGACACGT TCTACCATCT GCGCGGCGAT GGCCGGGTGG GAGGACTGCT GACGCCGAGC CCGGCGGACG GCACGCAGGC AGCCTCGGTT GACTTGAGCA ACGCAACTGC CGTGACCTAC GAGACGAAAT CACTCGCGAA CGTGACCCGC TACGCCGGCG CGATCAGGGT CGCGATGCGT GGCCGCTTTG CTCCGGCCGT CAGCAACATC AAGGCCACAT TGTCAGTGGA TGGGCACGAC GTTACGTACG GCTGGGCAAA TCCGCGCTTC TACAAGGGCC TGGATGTCGC ACAACTGATC GTGCCGAACA CGGACTACGA TTTCACGCTG GAGATGATGC CACGCGATTT CACGGTCCTG CCGGGTAGCA AGGTCATGTT GAAGCTCGAG GGCTACCAGG GCACGTCACT GGTGACGCTC GATCTGTCGC ATACCGTGCT CGCAATGCCG GTTGTTCCGA AAGCACACGT GGCGGCAGTC ATGGTGGGAA AGTAG
|
Protein sequence | MEIRQRSKIR PSFMRGVRLL VLLAPIAIQL AGCGGDDSVS SRASNEGSQP GTPSQSIPSG TVTTAPAPAP FAPPTPVAEV GAQFERSPTG LPYPKLATLY PGHDGPIVDN GMILPWLSMR PPLKSNVMVQ TPFDTDQDGK LDRIALRIVQ PAEVAEGLKT PVIVRPSVYY ADPTYATQTR APFLGEAEYL RMGYTIVYAD SIGTNQSDGC WSVMDRTERE AMASVVRWLT NDPGAPGFDA QGKQVAASWS TGHVAMEGVS YGGTLPTMVA ATGVPGLEAI VPVEGISSGY DYFRYNGVIA DIDNTVSLGS YMKSQQAFAR SSICEPARVA AVTASDDATY AYNDFWKVRN TVSLVDRIQA ATLIAQGQAD NNVKTKNATQ LYDALYRAKK PVQLWLHSRN HDDPAWQKEW QKQIAMWYSR YLFGVNNGVE TQPTYVRETP TGDIPVGATL PPDPNDTSDT LIGHCHSGHN PRDCIPTGEL FIKEDAWPKT VDTFYHLRGD GRVGGLLTPS PADGTQAASV DLSNATAVTY ETKSLANVTR YAGAIRVAMR GRFAPAVSNI KATLSVDGHD VTYGWANPRF YKGLDVAQLI VPNTDYDFTL EMMPRDFTVL PGSKVMLKLE GYQGTSLVTL DLSHTVLAMP VVPKAHVAAV MVGK
|
| |