Gene BURPS1710b_A0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0608 
SymbolscpA 
ID3693715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp800185 
End bp801918 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content71% 
IMG OID637730862 
Productkumamolisin 
Protein accessionYP_335767 
Protein GI76817289 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTTACA CTTGCTTTGC CGTACTGTCG CACGCTCCCC ATGGCGGCGT GTGTGCGCCG 
TCCGACATAC CGCCCGGAAC TGCGATCGCG CGACCGTGTC CGAATCCTGG CAATCCGTCA
CCACGGCTTC TGGAGGGTCC GAATATGGCA AGGCATCTTC ACGCCGGCAA CGAATCGCAT
CTCGTCGCCG AATCCACGTG CATCGGGCCG TGCGATCCGG CCGAGACGAT TCATGTAGTG
GTGATGTTGC GGCGACAGCA AGAGCAGCAC CTCGATTCAT TGTTGCAGGG CCTCGCGAGC
GGCGATCCGG GCGTGAAGCC TGTCTCGCGC GAGGCGTTCG CCCAGCGTTT CGGCGCGCAT
CCCGACGACG TCATGAAAGT CGAGGCATTC GCGCAGCAGC GCGGCCTCGC GGTCGCGCGC
GTCGATCCGG TCGAGAGCCT CGTCGTGCTG TCGGGCACGA TCGCGCAGTT CGAGGCGGCC
TTCGGCGTGA AGCTCGAGCG CTTCGAGCAT CGGTCGATCG GCCAGTATCG CGGCCGCACG
GGCGATATCA CGCTGCCCGA CGAGTTGCAC GGCATCGTCA CCGCGGTGCT CGGGCTCGAC
GATCGCCCGC AGGCCCGGCC GCATTTCCGG CTGCGGCCGA CTTTTCTGCC CGCGCGCGCG
CCGGCCGTCA CCTACACGCC GCCGCAGCTC GCGGCCCTCT ACGATTTCCC GCCCGGCGAC
GGCGCGGGCC AGTGCATCGC GATCGTCGAG CTCGGCGGCG GCTATCGGCC GGCCGAGATC
CAGCAGTATT TCGGCGGCCT CGGGCTCGCG CGGCAGCCGA AGCTCGTCGA CGTGAGCGTC
GGCGCGGGGC GCAACGCGCC GACGGGCGAT CCGAGCGGGC CGGACGGCGA AGTCGCGCTC
GATATCGAGA TCGCGGGCGC GATCGCGCCC GGCGCGACGA TTGCCGTCTA TTTCGCGCAG
AACAGCGACG CCGGCTTCAT CCAGGCGGTC AATCAGGCGG TGCACGACAC GACGAACCGG
CCCTCCGTCG TGTCGATCAG TTGGGGCGCG GCGGAGGCGA ACTGGACGTC GCAATCGATC
CAGGCCTTCG ATAGCGTGCT GCAGTCGGCC GCGGCGCTCG GCGTGACCGT GTGCGCGGCG
TCCGGCGATG ACGGCTCGAA CGACGGCCTG CAGGACGGCA CGAATCACGT CGATTTCCCG
GCATCGAGCC CGTACGTGCT CGCGTGCGGC GGCACGCGGC TCGACGCGCT GCCGGGGCAG
GGCATCCGCA GCGAAGTCGT GTGGAACGAC GAGGCGGCGG GCGGCGGCGC GACGGGCGGC
GGCGTCAGCG CCGTGTTCGA CGTGCCGCAG TGGCAGAGCG GCCTGAGCGC GACGCTCGCG
CAGGGTGGCG GCGCGTCGCC GCTCGCGAAG CGCGGCGTGC CGGACGTCGC GGGCGATGCG
TCGCCCGCGA CGGGCTACGA GGTGTTCGTC GCGGGCACGT CGACGGTGAT GGGCGGCACG
AGCGCCGTCG CACCGCTGTG GGCCGCGCTC GTCGCGCGGA TCAATGCGGC GGCGGGCAGC
CCCGCGGGCT GGATCAACCC GAAGCTGTAC CGGAACGCGG GCGCGCTGCA CGACATCTCG
GTGGGCGATA ACGGCGCGTA TGCGGCGACG CCGGGCTGGG ACGCGTGCAC GGGGCTCGGC
AGCCCGGACG GCGCGAAGGT CGCGGCGGCG CTGAAAGGCG GCGCGGCGGG CTGA
 
Protein sequence
MLYTCFAVLS HAPHGGVCAP SDIPPGTAIA RPCPNPGNPS PRLLEGPNMA RHLHAGNESH 
LVAESTCIGP CDPAETIHVV VMLRRQQEQH LDSLLQGLAS GDPGVKPVSR EAFAQRFGAH
PDDVMKVEAF AQQRGLAVAR VDPVESLVVL SGTIAQFEAA FGVKLERFEH RSIGQYRGRT
GDITLPDELH GIVTAVLGLD DRPQARPHFR LRPTFLPARA PAVTYTPPQL AALYDFPPGD
GAGQCIAIVE LGGGYRPAEI QQYFGGLGLA RQPKLVDVSV GAGRNAPTGD PSGPDGEVAL
DIEIAGAIAP GATIAVYFAQ NSDAGFIQAV NQAVHDTTNR PSVVSISWGA AEANWTSQSI
QAFDSVLQSA AALGVTVCAA SGDDGSNDGL QDGTNHVDFP ASSPYVLACG GTRLDALPGQ
GIRSEVVWND EAAGGGATGG GVSAVFDVPQ WQSGLSATLA QGGGASPLAK RGVPDVAGDA
SPATGYEVFV AGTSTVMGGT SAVAPLWAAL VARINAAAGS PAGWINPKLY RNAGALHDIS
VGDNGAYAAT PGWDACTGLG SPDGAKVAAA LKGGAAG