Gene BURPS668_A0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0974 
Symbol 
ID4887489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp942855 
End bp944819 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content62% 
IMG OID640130914 
Productx-prolyl-dipeptidyl aminopeptidase 
Protein accessionYP_001061973 
Protein GI126444858 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.455891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAC GGCAACGGTC GAAGATTCGT CCATCTTTCA TGCGTGGTGT CCGGCTGCTG 
GCCTTGCTCG CGCCGGTAGC AATTCAGTTG GCTGGATGCG GTGGAGATGA TTCCGTGTCG
TCGCGCGCGT CGAATGAGGG AAATCAGCCG GGCACTCCTT CCCAATCGAT ACCGTCCGGC
ACCGTGACGA CGGCTCCCGC GCCAGCCCCC TTTGCGCCAC CAACGCCGGT GGCAGAAGTC
GGCGCGCAAT TCGAGCGTTC ACCGACCGGG TTGCCGTATC CGAAGCTTGC AACCTTGTAT
CCGGGACACG ACGGCCCAAT CGTCGACAAC GGCATGATCT TGCCTTGGCT GTCGATGCGC
CCGCCATTGA AGTCGAACGT GATGGTTCAG ACCCCGTTCG ACACTGACCA GGATGGCAAG
CTCGACCGGA TCGCGCTGCG CATCGTGCAG CCGGCCGAAG TGGCGGAGGG GCTCAAGACA
CCAGTGATTG TGCGGCCATC GGTGTACTAT GCGGACCCGA CTTACGCGAC GCAAACGCGC
GCGCCGTTTC TCGGCGAGGC GGAATATTTG CGGATGGGCT ACACGATCGT CTACGCCGAT
TCGATCGGCA CCAATCAGTC GGACGGCTGC TGGTCGGTAA TGGATCGCAC CGAGCGCGAG
GCGATGGCGA GCGTCGTGCG CTGGCTGACG AACGATCCCG GCGCACCAGG CTTTGACGCC
CAAGGCAAAC AGGTCGCCGC GCTCTGGTCA ACCGGGCACG TCGCGATGGA AGGGGTTTCC
TACGGCGGCA CGCTGCCCAC GATGGTCGCG GCGACGGGGG TGCCGGGGCT CGAGGCGATC
GTGCCGGTAG AGGGCATCAG CAGCGGGTAC GACTATTTCC GCTACAACGG CGTGATCGCT
GATATCGACA ATACGGTATC GCTCGGCAGC TACATGAAAT CCCAGCAGGC GTTTGCTCGT
TCGTCAATCT GCGAACCCGC GCGCGTCGCG GCCGTCACGG CTTCCGACGA CGCAACCTAT
GCGTACAACG ATTTCTGGAA GGTGCGCAAC ACGGTGTCGC TGGTCGACCG GATTCAAGCC
GCTACGCTGA TCGCACAGGG CCAAGCCGAC AACAACGTCA AGACGAAGAA CGCGACGCAA
CTGTACGACG CGTTGTATCG CGCGAAGAAA CCTGTGCAGC TCTGGCTGCA CAGTCGAAAT
CACGACGATC CCGCATGGCA AAAGGAATGG CAGAAGCAGA TCGCGATGTG GTACTCGCGC
TATCTGTTTG GTGTCAACAA CGGTGTCGAG ACGCAGCCGA CGTACGTGCG GGAGACGCCG
ACGGGTGACA TCCCTGTTGG CGCGACGCTT CCTCCCGATC CGAACGACAC GAGCGACACG
TTGATCGGCC ACTGCCATTC GGGACACAAT CCACGCGACT GCATTCCAAC GGGCGAGCTG
TTCATCAAGG AGGATGCATG GCCGAAGACG GTCGACACGT TCTACCATCT GCGCGGCGAT
GGCCGGGTGG GAGGACTGCT GACGCCGAGC CCGGCGGACG GCACGCAGGC AGCCTCGGTT
GACTTGAGCA ACGCAACTGC CGTGACCTAC GAGACGAAAT CACTCGCGAA CGTGACCCGC
TACGCCGGCG CAATCAGGGT CGCGATGCGT GGCCGCTTTG CTCCGGCCGT CAGCAACATC
AAGGCCACAT TGTCAGTGGA TGGGCACGAC GTTACGTACG GCTGGGCAAA TCCGCGCTTC
TACAAGGGGC TGGATGTCGC ACAACTGATC GTGCCGAACA CGGACTACGA TTTCACGCTG
GAGATGATGC CACGCGATTT CACGGTCCTG CCGGGTAGCA AGGTCACGTT GAAGCTCGAG
GGCTACCAGG GCACGTCACT GGTGACGCTC GATCTGTCGC ATACCGTGCT CGCAATGCCG
GTTGTTCCGA AAGCACACGT GGCGGCAGTC ATGGTGGGAA AGTAG
 
Protein sequence
MEIRQRSKIR PSFMRGVRLL ALLAPVAIQL AGCGGDDSVS SRASNEGNQP GTPSQSIPSG 
TVTTAPAPAP FAPPTPVAEV GAQFERSPTG LPYPKLATLY PGHDGPIVDN GMILPWLSMR
PPLKSNVMVQ TPFDTDQDGK LDRIALRIVQ PAEVAEGLKT PVIVRPSVYY ADPTYATQTR
APFLGEAEYL RMGYTIVYAD SIGTNQSDGC WSVMDRTERE AMASVVRWLT NDPGAPGFDA
QGKQVAALWS TGHVAMEGVS YGGTLPTMVA ATGVPGLEAI VPVEGISSGY DYFRYNGVIA
DIDNTVSLGS YMKSQQAFAR SSICEPARVA AVTASDDATY AYNDFWKVRN TVSLVDRIQA
ATLIAQGQAD NNVKTKNATQ LYDALYRAKK PVQLWLHSRN HDDPAWQKEW QKQIAMWYSR
YLFGVNNGVE TQPTYVRETP TGDIPVGATL PPDPNDTSDT LIGHCHSGHN PRDCIPTGEL
FIKEDAWPKT VDTFYHLRGD GRVGGLLTPS PADGTQAASV DLSNATAVTY ETKSLANVTR
YAGAIRVAMR GRFAPAVSNI KATLSVDGHD VTYGWANPRF YKGLDVAQLI VPNTDYDFTL
EMMPRDFTVL PGSKVTLKLE GYQGTSLVTL DLSHTVLAMP VVPKAHVAAV MVGK