Gene BURPS1106A_A0758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0758 
Symbol 
ID4905802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp750804 
End bp752507 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content68% 
IMG OID640143864 
Productmetallopeptidase domain-containing protein 
Protein accessionYP_001074794 
Protein GI126457443 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAT CCCGTAAAGC ACTGCCGCTC GCGCTGGGTC TTGCCATCGG CCTCGGCGCC 
GCGCTGCCCG CGTGGGCCGA TTCGAAGGCA CCGAGCCCGC AGGAGGAAAG CCGGCGCGCG
AGCCTCACGC GCGGCGTCGT GGCGCCTGCG GAACAGGCCG GCAAGACCGG ACAGTTCCGC
CCCGGGGCCG TTGCCGTGAC GCTGGCGAGC CCGGCGTTTC ATGCGAAGAA GGCCGATGCG
GCGGCGATGG CGCGCGAGTA CGTCACCGCG CGCGCAGCGC AGCTCGGCCT CGACAAGGCC
GCGCTCGCGA ATCTCGTCGT CGCGTCCGAA CGCGCCGATA CCGCGTTCAC CGTCGTGCGC
TTCCAGCAGC GCGCCGCGGG GCTGCCCGTC TATGACAGCG ACATCGCGGT CACGGTCGCG
CCGGACGGCC GCGTGCTGTA CGTCGCGAGC AAGGCGGTGA GCGGCGTCGC GGCCGTGTCG
AGCAAGACGC AGGCGGTCGA CGAGCAGCAG GCGCTCGACC GCGCGCGCGC CTACCTCGGC
GTCGGCGGCT TCGTGAACGT GCAGTCGCAG CTCGTCGCAT TCGTCGACGG CGCGGGCACA
CATACCGCGT GGAAGGTGAG CGGCAGGCCG CAGGACAGCC TGCACGGCGA CTGGGAGCTG
ATCATCGACG CGGGCAGCGG CGAAGTGCTG CGCGCGCAAG ACAAGGCATC CTACGCAACG
GACGGCAGCG GGCTCGTGTT CCGGCCGGAT CCGTTGTCCC CGACGAAAAG CAGCTACGGC
AGCCCCGGCT TCAAGGACAA CAACGATGCG GATTCGCCGC AACTGAGCGC CGCGCGCGTG
CGCGTGACGC TCAAGGATCT GACGCAGACG AGCGGCGGCT ACAAGCTGAG CGGCCCGTAT
GCATCGTGCA TCGATTTCGA TGCGCCGCTC GACAAGGCGT GCCCGGTTCA GGCGTCGACG
ACCTTCGATT TCACACGCTC GAACCTCTAT TTCGAGGCGG TGAACGCGTA TTACCACATC
GACACGTTCC TGCGCTACGT GAACCTGACG CTCGGCATCA AGGCGTTGCC GTACCAGTAC
GCGGGCGGCG TCCAGTACGA TCCGCACGGC CAATCCGGCG ACGATAACTC GTCGTACTCG
CCGAGCTCCG GCAGGTTGTC GTTCGGGCAA GGCGGCGTCG ACGACGCGGA AGACGCGGAT
GTCGTGATTC ACGAGCTCGG CCACGGCATC CATGACTGGA TCACCAACGG CGGACTGTCG
CAGGTCGAGG GGCTGTCCGA AGGCACGGGC GACTACCTCG CGGCCGCATA CAGCCGCGAC
TTCAACCAAT GGAGCCCGTC CGACGCGCAG TATCACTGGG TCTTCAACTG GGACGGCCAC
AACGAATTCT GGGCCGGCCG CGTCACCAAT TACAACGTCG GCCGCACGTA CGCGCAGATC
CGCAATGCCG CGATCCACAC CGCCGGCCAG TACTGGGCGT CGTGCAACAT GGTCGCGCGC
GATGCGATCG GCGGCGCGGC GATGGACAAG GCTTTCCTGA AAGGATTGTC GATGACGAAC
GGCTCGACGA ACCAGAAGGC CGCGGCGCAG GCGGTGCTGA CCGCGGCGGC GGCGCTCGGC
TACAGCAGCG CGCAGCTCAA TGCGATCGGC GATGCGTACA ACAAGAGCTG CACATACGGC
GTGACCGTGC CGCAGAAGCT GTAA
 
Protein sequence
MQTSRKALPL ALGLAIGLGA ALPAWADSKA PSPQEESRRA SLTRGVVAPA EQAGKTGQFR 
PGAVAVTLAS PAFHAKKADA AAMAREYVTA RAAQLGLDKA ALANLVVASE RADTAFTVVR
FQQRAAGLPV YDSDIAVTVA PDGRVLYVAS KAVSGVAAVS SKTQAVDEQQ ALDRARAYLG
VGGFVNVQSQ LVAFVDGAGT HTAWKVSGRP QDSLHGDWEL IIDAGSGEVL RAQDKASYAT
DGSGLVFRPD PLSPTKSSYG SPGFKDNNDA DSPQLSAARV RVTLKDLTQT SGGYKLSGPY
ASCIDFDAPL DKACPVQAST TFDFTRSNLY FEAVNAYYHI DTFLRYVNLT LGIKALPYQY
AGGVQYDPHG QSGDDNSSYS PSSGRLSFGQ GGVDDAEDAD VVIHELGHGI HDWITNGGLS
QVEGLSEGTG DYLAAAYSRD FNQWSPSDAQ YHWVFNWDGH NEFWAGRVTN YNVGRTYAQI
RNAAIHTAGQ YWASCNMVAR DAIGGAAMDK AFLKGLSMTN GSTNQKAAAQ AVLTAAAALG
YSSAQLNAIG DAYNKSCTYG VTVPQKL