Gene BURPS1106A_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3590 
Symbol 
ID4899255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3497062 
End bp3498891 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content70% 
IMG OID640136816 
ProductPDZ domain-containing protein 
Protein accessionYP_001067821 
Protein GI126453535 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGA TTTGCTACAC GATCGTTCCG AAAGATCCCG CCGCGCACCT GTTCGAGGTG 
ACGCTCACGC TCGCCGATCC GGACCCGGCG GGCCAGCGCT TCGCGCTGCC CGTATGGATT
CCGGGCAGCT ATATGGTGCG CGAGTTCGCG CGCAATATCG TGACGCTGCG TGCGTTCAAC
GAAGCGGGCC GCAAGCTGCG GATCGGCAAG CTCGACAAGC AGACCTGGCA GGCCGCGCCG
GCGCCCGGGC CGATCACGCT GCGCTACGAC GTCTACGCGT GGGACCTGTC GGTGCGCGCC
GCGCACCTGG ACGACACGGG CGGCTTCTTC AACGGCACGA GCGTGTTTCT CGCGCCGCTC
GGCCGCGAGG ATGCGCCGTG CGAGGTAATG ATCGAGCGGC CGGCGGGCGA CGCGTACCGG
CGCTGGCGCG TCGCGACGGC GCTGCCGGAG GCGCGCGGCA CGAAACGCTA CGGCTTCGGC
GCGTACCGCG CGGAGAATTA CGACGAGCTG ATCGATCATC CGGTCACGCT CGGCGAATTC
GCGCTCGCGT CGTTCGACGC GCACGGCGTG CCGCACGACA TCGCGATCGC GGGCCGCGTG
ACCGGGCTCG ATCTCGAGCG GCTCGCGGCC GACCTGAAGC GGGTGTGCGA GGCGCAGATC
GCGCTGTTCG AGCCGAAGAC CAGGCGCGCG CCGATGTCGC GCTACGTGTT CATGACGCAG
GCGGTCAGCG ACGGCTACGG CGGGCTCGAG CATCGCGCGT CGACGGCGCT CGTCTGCAAT
CGCACCGATC TGCCGGTGAA GGGGCGCCCT GAGAAGACGG ACGGCTATCG GACTTACCTC
GGCCTGTGCA GCCACGAGTA CTTCCATACA TGGAACGTGA AGCGGATCAA GCCGGCCGCG
TTCGCGCCGT ACGATCTGTC GCAGGAGAAT TACACGTCGC TGCTGTGGCT CTTCGAGGGC
TTCACGTCGT ACTACGACGA CCTGATGCTC GCGCGCAGCG GCCTCATCTC GCAGGACGAC
TATTTCGCGC TCGTCGGCCG CACGATCGCC GGCGTGCAGC GCGGCGCCGG CCGGCTCAGG
CAGAGCGTCG CCGAAAGCTC GTTCGATGCG TGGATCAAGT ATTACCGGCA GGACGAAAAC
GCGACGAACG CGATCGTCAG CTATTACACG AAGGGCTCGC TCGTCGCGCT CGCGTTCGAT
CTGGCGATTC GCGCGCGCAG CCGCCACCGC AAATCGCTCG ACGACGTGAT GCGGCTTTTG
TGGCAGCGCT TCGGGCGCGA CTTCTACCAC GGCAAGCCGC AAGGCGTCGG CGAGGACGAC
GTGAAGGCGC TGATCGCCGA AGCGACGGGT GTCGATCTCG GCCGTCTTTT CGACGAAGCG
GTGTCCGGCA CGCGCGATCT GCCGCTCGCC GAACTCTTCG AGCCGTTCGG CGTGACGCTC
GCGCCGGACG GGGGCGCGGG CGGCGCGGCC GATGCGCCCG CGAAGCCGAC GCTCGGCGCG
CGCACGCGCG GCGGCGCGGA ATGCACGCTC GCGGCCGTCT ACGAAGGCGG CGCCGCGCAT
CGGGCCGGGC TGTCCGCGGG CGACGCGCTC GTCGCGATCG ACGGGCTGCG CGTGACGGGC
TCGAACCTCG ACGCGCTGCT CGCGCGCTAC CGCGTCGGCG ACAAGGTCGA GATCCACGCA
TTCCGGCGCG ACGAACTGCG CGTGGTGCAG CTCAAGCTCG ACGGCCCGGA CATCGCGCGC
TACAAGCTGG CCGCTCAGCC GAAGCCCGCG GCCGCGCGCG CCCGTCGCGA CGCGTGGCTC
GGGCTGCCGG CGGCGCGCGG CGGTCGATAA
 
Protein sequence
MKPICYTIVP KDPAAHLFEV TLTLADPDPA GQRFALPVWI PGSYMVREFA RNIVTLRAFN 
EAGRKLRIGK LDKQTWQAAP APGPITLRYD VYAWDLSVRA AHLDDTGGFF NGTSVFLAPL
GREDAPCEVM IERPAGDAYR RWRVATALPE ARGTKRYGFG AYRAENYDEL IDHPVTLGEF
ALASFDAHGV PHDIAIAGRV TGLDLERLAA DLKRVCEAQI ALFEPKTRRA PMSRYVFMTQ
AVSDGYGGLE HRASTALVCN RTDLPVKGRP EKTDGYRTYL GLCSHEYFHT WNVKRIKPAA
FAPYDLSQEN YTSLLWLFEG FTSYYDDLML ARSGLISQDD YFALVGRTIA GVQRGAGRLR
QSVAESSFDA WIKYYRQDEN ATNAIVSYYT KGSLVALAFD LAIRARSRHR KSLDDVMRLL
WQRFGRDFYH GKPQGVGEDD VKALIAEATG VDLGRLFDEA VSGTRDLPLA ELFEPFGVTL
APDGGAGGAA DAPAKPTLGA RTRGGAECTL AAVYEGGAAH RAGLSAGDAL VAIDGLRVTG
SNLDALLARY RVGDKVEIHA FRRDELRVVQ LKLDGPDIAR YKLAAQPKPA AARARRDAWL
GLPAARGGR