Gene BURPS1710b_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3582 
Symbol 
ID3689539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3904666 
End bp3906504 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content70% 
IMG OID637730037 
Productpeptidase 
Protein accessionYP_334947 
Protein GI76810146 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.955515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGA TTCGCTACAC GATCGTTCCG AAAGATCCCG CCGCGCACCT GTTCGAGGTG 
ACGCTCACGC TCGCCGATCC GGACCCGGCG GGCCAGCGCT TCGCGCTGCC CGTATGGATT
CCGGGCAGCT ATATGGTGCG CGAGTTCGCG CGCAATATCG TGACGCTGCG TGCGTTCAAC
GAAGCGGGCC GCAAGCTGCG GATCGGCAAG CTCGACAAGC AGACCTGGCA GGCCGCGCCG
GCGCCCGGGC CGATCACGCT GCGCTACGAC GTCTACGCGT GGGACCTGTC GGTGCGCGCC
GCGCACCTGG ACGACACGGG CGGCTTCTTC AACGGCACGA GCGTGTTTCT CGCGCCGCTC
GGCCGCGAGG ATGCGCCGTG CGAGGTAATG ATCGAGCGGC CGGCGGGCGA CGCGTACCGG
CGCTGGCGCG TCGCGACGGC GCTGCCGGAG GCGCGCGGCA CGAAACGCTA CGGCTTCGGC
GCGTACCGCG CGGAGAATTA CGACGAGCTG ATCGATCATC CGGTCACGCT CGGCGAATTC
GCGCTCGCGT CGTTCGACGC GCACGGCGTG CCGCACGACA TCGCGATCGC GGGCCGCGTG
ACCGGGCTCG ATCTCGAGCG GCTCGCGGCC GACCTGAAGC GGGTGTGCGA GGCGCAGATC
GCGCTGTTCG AGCCGAAGAC CCGGCGCGCG CCGATGTCGC GCTACGTGTT CATGACGCAG
GCGGTCAGCG ACGGCTACGG CGGGCTCGAG CATCGCGCGT CGACGGCGCT CGTCTGCAAT
CGCACCGATC TGCCGGTGAA GGGGCGCCCT GAGAAGACGG ACGGCTATCG GACTTACCTC
GGCCTGTGCA GCCACGAGTA CTTCCATACA TGGAACGTGA AGCGGATCAA GCCGGCCGCG
TTCGCGCCGT ACGATCTGTC GCAGGAGAAT TACACGTCGC TGCTGTGGCT CTTCGAGGGC
TTCACGTCGT ACTACGACGA CCTGATGCTC GCGCGCAGCG GCCTCATCTC GCAGGACGAC
TATTTCGCGC TCGTCGGCCG CACGATCGCC GGCGTGCAGC GCGGCGCCGG CCGGCTCAGG
CAGAGCGTCG CCGAAAGCTC GTTCGATGCG TGGATCAAGT ATTACCGGCA GGACGAAAAC
GCGACGAACG CGATCGTCAG CTATTACACG AAGGGCTCGC TCGTCGCGCT CGCGTTCGAT
CTGGCGATTC GCGCGCGCAG CCGCCACCGC AAATCGCTCG ACGACGTGAT GCGGCTTCTG
TGGCAGCGCT TCGGGCGCGA CTTCTACCAC GGCAAGCCGC AAGGCGTCGG CGAGGACGAC
GTGAAGGCGC TGATCGCCGA AGCGACGGGT GTCGATCTCG GCCGTCTTTT CGACGAAGCG
GTGTCCGGCA CGCGCGATCT GCCGCTCGCC GAACTCTTCG AGCCGTTCGG CGTGACGCTC
GCGCCGGACG GGGGCGCGGG CGGCGCGGGC GGCGCGGCCG ATGCGCCCGC GAAGCCGACG
CTCGGCGCGC GCACGCGCGG CGGCGCGGAA TGCACGCTCG CAGCCGTCTA CGAAGGCGGC
GCCGCGCATC GGGCCGGGCT GTCCGCGGGC GACGCGCTCG TCGCGATCGA CGGGCTGCGC
GTGACGGGCT CGAACCTCGA CGCGCTGCTC GCGCGCTACC GCGTCGGCGA CAAGGTCGAG
ATCCACGCAT TCCGGCGCGA CGAACTGCGC GTGGTGCAGC TCAAGCTCGA CGGCCCGGAC
ATCGCGCGCT ACAAGCTGGC CGCTCAGCCG AAGCCCGCGG CCGCGCGCGC CCGTCGCGAC
GCGTGGCTCG GGCTGCCGGC GGCGCGCGGC GGTCGATAA
 
Protein sequence
MKPIRYTIVP KDPAAHLFEV TLTLADPDPA GQRFALPVWI PGSYMVREFA RNIVTLRAFN 
EAGRKLRIGK LDKQTWQAAP APGPITLRYD VYAWDLSVRA AHLDDTGGFF NGTSVFLAPL
GREDAPCEVM IERPAGDAYR RWRVATALPE ARGTKRYGFG AYRAENYDEL IDHPVTLGEF
ALASFDAHGV PHDIAIAGRV TGLDLERLAA DLKRVCEAQI ALFEPKTRRA PMSRYVFMTQ
AVSDGYGGLE HRASTALVCN RTDLPVKGRP EKTDGYRTYL GLCSHEYFHT WNVKRIKPAA
FAPYDLSQEN YTSLLWLFEG FTSYYDDLML ARSGLISQDD YFALVGRTIA GVQRGAGRLR
QSVAESSFDA WIKYYRQDEN ATNAIVSYYT KGSLVALAFD LAIRARSRHR KSLDDVMRLL
WQRFGRDFYH GKPQGVGEDD VKALIAEATG VDLGRLFDEA VSGTRDLPLA ELFEPFGVTL
APDGGAGGAG GAADAPAKPT LGARTRGGAE CTLAAVYEGG AAHRAGLSAG DALVAIDGLR
VTGSNLDALL ARYRVGDKVE IHAFRRDELR VVQLKLDGPD IARYKLAAQP KPAAARARRD
AWLGLPAARG GR