Gene BURPS1106A_A1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1824 
Symbol 
ID4905436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1789797 
End bp1792127 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content68% 
IMG OID640144930 
Productprolyl oligopeptidase family protein 
Protein accessionYP_001075858 
Protein GI126455706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAGTCG GAATCGCGCA TGGATACGGG CGAAGCGGAA AACGGCACGG CGTCGGCACC 
GGCTTCGGAC AGCCCCGGCA CGACGAAGGC ACGCTTGCGA CGTGTACGCG GCGCGCCACG
CGCGCGCCGC GAATGCCTGC GGCGGACGCG TCGCAATCGA CGAGCCGGCA GCCAAGCGCG
AGCGCCGCTC GTCTTACGCT ACCATTACAC GCCATGCCCC TTGCAAAGAA CGCCATGCCT
CATGCTTCCT GGCCCGAACA GGCCGATCCG CATCAATTCC TCGAAGAACT GGACAGCGCC
GCGAGCGTCG GCTGGGTCGA CGCGCAAAAC GCCCGCACGC ACGATGCGCC CTGGCTCGAC
GAAGCGCACT ATCGCGCGCT GGTCGAGCGC TTCACCCGGG CGCTGCTGCC GCGCGAGCGC
CCGGTGATTC CGCAGCGCTG GCAGGACTGG GCGTACGACG TCTGGCAGGA CGAACAGCAT
CCGAAGGGCC TGTGGCGGCG CACGCGATGG ACGAGCTGGC GCAGCGGCCA CGCGGACTGG
CAGACGTTGA TCGACCTCGA TGCGCTTGGT GAAGCGCAAG GCGTGCAGTG GGTGTTCGAC
GATCAGCTCA TCCTCGAGCC GGACGGCGAT CGTGCGCTGA TCGTGCTGTC CGACGGCGGC
GCCGACGCGG TCGTCGTCCG CGAGTTCGAC ATCGCGCAAT GCCGGTTCGT CGACGACGGC
TTCTCGATCG AAGCGGCCGG CAAGCATTCG GTCGAATGGA TCGATCGCGA CACGATCTAC
GTCGGCTGGG ACGACGGCGG CGCCACCGTC ACGCGCTCCG GCTATCCGCG CGAAGTGCGG
CGCTGGACGC GCGGCACGCC GCTGTCCAGC GCGCCCGTGG TGTTTCGCGG CGCGCGCGGC
GACATCTCGG TCGATGCGCA ATATGATCCG CTCGACCGGC ATCACGCGAT CGAGCAGGCG
ATCAATTTCT ACGACGCGAA CACGTATCGC CTCGCCGAGG ACGGCGCGTG GGCGCGCTAC
GACGTGCCAC CGCACGTCGA AGTCGGTTAC TGGAGCGGGT GGCTGCTGCT CCAGCCGCGG
CTCGACTGGA CTTGCGGCGG CGCGCGCTAC GCGGGCGGCA GCCTGCTCGC GATCCGCGAG
GACGCGTTCG TCGCCGGTGA GCGCGCGTTC GCCGCGCTGT TCGAGCCGAA CGAGCGCACG
TCCGCATGCG GCTGGACGCA CACGCGCCGC TACGTGCTGG TGTCGTGGCT CGACGACGTG
CTCACGCGCA CGATGCTCTG GCTTCCCGAA CGTCAGGATG ACGGAGCATG GCGCTGGCAT
GCTCGTCCGT TCCCCGCGCG AGGGCTCGCG CAAGTGGACG TGTCGCCCGT CGAGCCCACG
TTCGACGACG AGGTGTACGT GAGCGTCGAC GATTACCTGA AGCCGCCCGA GTATTCGCTC
GCGAATCTCG CCAGCGACGA CCTGTCCGCC TGGACGCTGC TCGACCGCTG GCCGACGCAG
TTCGACGCGT CCGAACTGAC GGTGCGGCGC GAACACGCGC GCTCGCGCGA CGGCACGCTC
GTGCCTTATA CGCTGGTCGG GCCGCGCGAC GTGCTGGACA ATGCGGCGCG CGCGCCGCGC
CCCTGCCTGT TGAACGGCTA CGGCGGCTTC GCGATTGCGC TCACGCCCGA TTACGATCCG
TTGCTCGGCA TCGGCTGGCT CGAGAAAGGC GGCATCGCGG TGTTCGCCCA TATTCGCGGC
GGCGGCGAGT TCGGCACGCA GTGGCACGAA TCGGCGCGGC AAACGCAACG GCAGCGATCG
TTCGACGATT TCATCGCGGT CGCCGAAAAA CTCGTCGCGG ACGGCGTGAC GAGCGCCGCG
CAACTGGGTA TTCGCGGCGG CAGCAACGGC GGGCTGCTGG TCGCGGCATG CATGATTCAG
CGCCCGGACC TGTTCGGCGC GGTGGTGAGC GACGTGCCGC TTCTCGACAT GCAGCGCTAT
GCGCTGCTGC ACGCGGGCGC ATCGTGGCTG GACGAATTCG GCGATCCCGA CGATCCGGCG
CATGCGTCGG CGCTCGCGGC CTACTCGCCG TATCACCGGG TCGCGCGCGA CATCGCGTAT
CCGCCCGCGC TGTTCACGAC ATCGACGAGC GACGACCGCG TGCATCCCGC CCATGCGAGA
AAAATGGTCG CGCGCATGCA GGCGCAAGGG CACCGAAACG TATGGCTGAT CGAGAAAACC
GATGGCGGCC ACGGCAGCGC GGACGCGATC GATACCGCCG AGCACGAAGC GATCGGCTAT
GTGTTTCTGT GGACTCACTT GTCCCGCGGC GCGCATGACG CGCGCGAGTG A
 
Protein sequence
MIVGIAHGYG RSGKRHGVGT GFGQPRHDEG TLATCTRRAT RAPRMPAADA SQSTSRQPSA 
SAARLTLPLH AMPLAKNAMP HASWPEQADP HQFLEELDSA ASVGWVDAQN ARTHDAPWLD
EAHYRALVER FTRALLPRER PVIPQRWQDW AYDVWQDEQH PKGLWRRTRW TSWRSGHADW
QTLIDLDALG EAQGVQWVFD DQLILEPDGD RALIVLSDGG ADAVVVREFD IAQCRFVDDG
FSIEAAGKHS VEWIDRDTIY VGWDDGGATV TRSGYPREVR RWTRGTPLSS APVVFRGARG
DISVDAQYDP LDRHHAIEQA INFYDANTYR LAEDGAWARY DVPPHVEVGY WSGWLLLQPR
LDWTCGGARY AGGSLLAIRE DAFVAGERAF AALFEPNERT SACGWTHTRR YVLVSWLDDV
LTRTMLWLPE RQDDGAWRWH ARPFPARGLA QVDVSPVEPT FDDEVYVSVD DYLKPPEYSL
ANLASDDLSA WTLLDRWPTQ FDASELTVRR EHARSRDGTL VPYTLVGPRD VLDNAARAPR
PCLLNGYGGF AIALTPDYDP LLGIGWLEKG GIAVFAHIRG GGEFGTQWHE SARQTQRQRS
FDDFIAVAEK LVADGVTSAA QLGIRGGSNG GLLVAACMIQ RPDLFGAVVS DVPLLDMQRY
ALLHAGASWL DEFGDPDDPA HASALAAYSP YHRVARDIAY PPALFTTSTS DDRVHPAHAR
KMVARMQAQG HRNVWLIEKT DGGHGSADAI DTAEHEAIGY VFLWTHLSRG AHDARE