Gene BURPS1710b_A1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1728 
Symbol 
ID3694472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2107168 
End bp2108478 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content70% 
IMG OID637731981 
Productproline iminopeptidase 
Protein accessionYP_336884 
Protein GI76818974 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGCGG CGTTCGCCAC GCATGCGCGC GCGCGTCGAG CCGGCGCCCG CGCGCGCAAG 
CATTCATGCA TGCATCCATG CGGCGATGCA TTCGTGCGCG CCTTCGGCAG TCGACGCGCC
ACGCCGCGCC GCGACGAACG GCGCGCGAGG CGGCCGGGCG GCGGCTCGGC CACCTCCGTG
CAAGCGCGTC CCCGCGTTTT CCGGCGACTC GGCATAATGA AGCGTCGCTT TCGTCGCCGG
CGCCGCATCG GCGCGAGCCA ACGCGGCCGG CGCATCGCAT GGGGCGCACG CATGCGCCGC
GCGGCCGTTC CATTCATCGC GTTCGGCGAG GCACCCCCAG TCGTCTTCTT CCATTCAACC
GGAGCGTCTC TCTTGTATCC ACCGATCGAA CCTTATGCAC ACGGCTTCCT CGATACCGGC
GACGGCCATC GCGTGTACTG GGAGCTGTGC GGCAACCCCA ACGGCAAGCC GGCCGTCTTC
CTGCACGGCG GCCCCGGCAG CGGCTGCAGC GCCGATCACC GTCGCCTCTT CGATCCCGCG
CGCTACAACG TGCTGCTGTT CGACCAGCGC GGCTGCGGCC GCTCGACGCC GCACGCGAGC
CTCGAGAACA ACACGACATG GCATCTCGTC GACGACATCG AGCGGCTGCG CGCGATGATC
GGCGTCGAGC GCTGGCTCGT GTTCGGCGGC TCGTGGGGCA GCGCGCTCGC GCTCGCATAT
GCGCAAACGC ACCCGGCGCG CGTGGCCGAG CTCGTCGTGC GCGGCATCTT CACGGTGCGC
CGGTCCGAGC TGCTCTGGTA CTACCAGGAA GGCGCGTCGT GGCTGTTCCC GGATCTGTGG
GAAGACTTCA TCGCGCCCAT TCCGAGCGCC GAGCGCGCGG ATCTGATCGC CGCGTATCGC
CGCCGGCTGA CGGGCGACGA CGAGGCGGCC AAGCGCGAGG CCGCGCGCGC GTGGAGCGTC
TGGGAGGGCC GGACGATCGC GCTGCTGCCG AACGCCGCGC ACGAAACGTA TTTCGGCGAC
GCGCATTTCG CGCTCGCGTT CGCCCGCATC GAAAACCACT ACTTCGTTCA TCAAGGCTTC
ATGGAAGACG GGCAGTTGCT GCGCGATGCG CATCGTCTCG CGGACATCCC GGGCGTGATC
GTTCAGGGGC GCTACGACGT CGCGACGCCG GCGCGCACCG CGTGGGAACT CGCGAAGGCG
TGGCCGCGCG CGTCGCTCGA GATCGTGCCC GACGCGGGGC ACGCATACGA CGAGCCGGGC
ATTCTGCGCG CGCTGATCGC GGCGACCGAC CGCTTCGCGC GCGAGCGCTG A
 
Protein sequence
MRAAFATHAR ARRAGARARK HSCMHPCGDA FVRAFGSRRA TPRRDERRAR RPGGGSATSV 
QARPRVFRRL GIMKRRFRRR RRIGASQRGR RIAWGARMRR AAVPFIAFGE APPVVFFHST
GASLLYPPIE PYAHGFLDTG DGHRVYWELC GNPNGKPAVF LHGGPGSGCS ADHRRLFDPA
RYNVLLFDQR GCGRSTPHAS LENNTTWHLV DDIERLRAMI GVERWLVFGG SWGSALALAY
AQTHPARVAE LVVRGIFTVR RSELLWYYQE GASWLFPDLW EDFIAPIPSA ERADLIAAYR
RRLTGDDEAA KREAARAWSV WEGRTIALLP NAAHETYFGD AHFALAFARI ENHYFVHQGF
MEDGQLLRDA HRLADIPGVI VQGRYDVATP ARTAWELAKA WPRASLEIVP DAGHAYDEPG
ILRALIAATD RFARER