Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0974 |
Symbol | |
ID | 4887489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 942855 |
End bp | 944819 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640130914 |
Product | x-prolyl-dipeptidyl aminopeptidase |
Protein accession | YP_001061973 |
Protein GI | 126444858 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.455891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATAC GGCAACGGTC GAAGATTCGT CCATCTTTCA TGCGTGGTGT CCGGCTGCTG GCCTTGCTCG CGCCGGTAGC AATTCAGTTG GCTGGATGCG GTGGAGATGA TTCCGTGTCG TCGCGCGCGT CGAATGAGGG AAATCAGCCG GGCACTCCTT CCCAATCGAT ACCGTCCGGC ACCGTGACGA CGGCTCCCGC GCCAGCCCCC TTTGCGCCAC CAACGCCGGT GGCAGAAGTC GGCGCGCAAT TCGAGCGTTC ACCGACCGGG TTGCCGTATC CGAAGCTTGC AACCTTGTAT CCGGGACACG ACGGCCCAAT CGTCGACAAC GGCATGATCT TGCCTTGGCT GTCGATGCGC CCGCCATTGA AGTCGAACGT GATGGTTCAG ACCCCGTTCG ACACTGACCA GGATGGCAAG CTCGACCGGA TCGCGCTGCG CATCGTGCAG CCGGCCGAAG TGGCGGAGGG GCTCAAGACA CCAGTGATTG TGCGGCCATC GGTGTACTAT GCGGACCCGA CTTACGCGAC GCAAACGCGC GCGCCGTTTC TCGGCGAGGC GGAATATTTG CGGATGGGCT ACACGATCGT CTACGCCGAT TCGATCGGCA CCAATCAGTC GGACGGCTGC TGGTCGGTAA TGGATCGCAC CGAGCGCGAG GCGATGGCGA GCGTCGTGCG CTGGCTGACG AACGATCCCG GCGCACCAGG CTTTGACGCC CAAGGCAAAC AGGTCGCCGC GCTCTGGTCA ACCGGGCACG TCGCGATGGA AGGGGTTTCC TACGGCGGCA CGCTGCCCAC GATGGTCGCG GCGACGGGGG TGCCGGGGCT CGAGGCGATC GTGCCGGTAG AGGGCATCAG CAGCGGGTAC GACTATTTCC GCTACAACGG CGTGATCGCT GATATCGACA ATACGGTATC GCTCGGCAGC TACATGAAAT CCCAGCAGGC GTTTGCTCGT TCGTCAATCT GCGAACCCGC GCGCGTCGCG GCCGTCACGG CTTCCGACGA CGCAACCTAT GCGTACAACG ATTTCTGGAA GGTGCGCAAC ACGGTGTCGC TGGTCGACCG GATTCAAGCC GCTACGCTGA TCGCACAGGG CCAAGCCGAC AACAACGTCA AGACGAAGAA CGCGACGCAA CTGTACGACG CGTTGTATCG CGCGAAGAAA CCTGTGCAGC TCTGGCTGCA CAGTCGAAAT CACGACGATC CCGCATGGCA AAAGGAATGG CAGAAGCAGA TCGCGATGTG GTACTCGCGC TATCTGTTTG GTGTCAACAA CGGTGTCGAG ACGCAGCCGA CGTACGTGCG GGAGACGCCG ACGGGTGACA TCCCTGTTGG CGCGACGCTT CCTCCCGATC CGAACGACAC GAGCGACACG TTGATCGGCC ACTGCCATTC GGGACACAAT CCACGCGACT GCATTCCAAC GGGCGAGCTG TTCATCAAGG AGGATGCATG GCCGAAGACG GTCGACACGT TCTACCATCT GCGCGGCGAT GGCCGGGTGG GAGGACTGCT GACGCCGAGC CCGGCGGACG GCACGCAGGC AGCCTCGGTT GACTTGAGCA ACGCAACTGC CGTGACCTAC GAGACGAAAT CACTCGCGAA CGTGACCCGC TACGCCGGCG CAATCAGGGT CGCGATGCGT GGCCGCTTTG CTCCGGCCGT CAGCAACATC AAGGCCACAT TGTCAGTGGA TGGGCACGAC GTTACGTACG GCTGGGCAAA TCCGCGCTTC TACAAGGGGC TGGATGTCGC ACAACTGATC GTGCCGAACA CGGACTACGA TTTCACGCTG GAGATGATGC CACGCGATTT CACGGTCCTG CCGGGTAGCA AGGTCACGTT GAAGCTCGAG GGCTACCAGG GCACGTCACT GGTGACGCTC GATCTGTCGC ATACCGTGCT CGCAATGCCG GTTGTTCCGA AAGCACACGT GGCGGCAGTC ATGGTGGGAA AGTAG
|
Protein sequence | MEIRQRSKIR PSFMRGVRLL ALLAPVAIQL AGCGGDDSVS SRASNEGNQP GTPSQSIPSG TVTTAPAPAP FAPPTPVAEV GAQFERSPTG LPYPKLATLY PGHDGPIVDN GMILPWLSMR PPLKSNVMVQ TPFDTDQDGK LDRIALRIVQ PAEVAEGLKT PVIVRPSVYY ADPTYATQTR APFLGEAEYL RMGYTIVYAD SIGTNQSDGC WSVMDRTERE AMASVVRWLT NDPGAPGFDA QGKQVAALWS TGHVAMEGVS YGGTLPTMVA ATGVPGLEAI VPVEGISSGY DYFRYNGVIA DIDNTVSLGS YMKSQQAFAR SSICEPARVA AVTASDDATY AYNDFWKVRN TVSLVDRIQA ATLIAQGQAD NNVKTKNATQ LYDALYRAKK PVQLWLHSRN HDDPAWQKEW QKQIAMWYSR YLFGVNNGVE TQPTYVRETP TGDIPVGATL PPDPNDTSDT LIGHCHSGHN PRDCIPTGEL FIKEDAWPKT VDTFYHLRGD GRVGGLLTPS PADGTQAASV DLSNATAVTY ETKSLANVTR YAGAIRVAMR GRFAPAVSNI KATLSVDGHD VTYGWANPRF YKGLDVAQLI VPNTDYDFTL EMMPRDFTVL PGSKVTLKLE GYQGTSLVTL DLSHTVLAMP VVPKAHVAAV MVGK
|
| |