Gene BURPS1106A_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0099 
Symbol 
ID4900869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp92566 
End bp95835 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content60% 
IMG OID640133329 
Producttype I site-specific deoxyribonuclease HsdR 
Protein accessionYP_001064384 
Protein GI126453319 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.26483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTGGG AACTCGAAGA AGTTGAACAG CCTTTCGTGC TCCAGCTTAA ACAACTGGAG 
TGGACGCACA TCCAGGGGGA CATCGATAAT CCGAGCATCA GCGGCCGCAC AAGCTTTGCC
GAGGTCATTC AGGAAGGCGT CCTGCGCGAG CAACTGCACT CCTTGAATCT TGGCCCAAGT
GGCGAGCCGT GGCTTGACGA CGCAAGGGTT TCAGAGGCCG TCGCATCCCT CACGCGCCTC
GGCACGCACC GGCTGATGGA GTCGAATCAA AAGGCCACCG AACTACTGAT CAGAGGCGTG
ACGGTCCCGG GCCTTTCGGG ATGGGACGGC GGGCGCAGCC AGACCATCCA CTACATCGAC
TGGGAACACC CGGGACGCAA TGAATTTACC GTCGTCAGCC AGTTCCGTGT GGACTGCCCT
CCCGGCTACA ACAGCGCCAA AGCGTTCATC GTGCCCGACC TCGTGCTGCT GGTAAACGGT
ATCCCGCTGG TCGTGGTCGA ATGCAAGAGC CCATCGATTC CCGAGCCGTT GGCCGAGGCG
ATCAACCAGT TGCGGCGCTA TACCAACCAA CGCCACGCAG ATCGCGAAGT CGACGACAAT
GAAGGCAACG AGACGCTATT TCACACGAAT CAGCTGTTGA TCGCGACAAG TTTCGACGAG
GCGCGAGTCG GGACTATCGG CGCGGCGTTT CGCCACTACA CCGCATGGAA GACGGTCGTC
CCGCACACCG AAGCGGAAGT CGCTGCTGGG CTTGATAAGC AGCAGTTATC AGCGCAAGAG
CGTCTCATCG CCGGCCTTCT GGACAAGCGG ACCCTGCTCG ATGTGATACG CCACTTCATA
TTGTTCATGG AGGCCGACGG CCAAACCGTA AAGGCCCTAT GTCGTTACCA GCAGTACCGC
GCAGTGACGC ATGCGATCCA CCGTCTTCGA ACAGGTAAAA CACGGCTCGA GGATGGCGAG
CACGACCGGC GCGGCGGCAT CATCTGGCAT ACGCAGGGTT CCGGCAAGAG CCTGACCATG
GTGTTCCTTG TGCGAAAGCT GCGCACCGAC CCGAAGCTTC GTCGGTTTAA GGTCGTGATA
GTCACCGATC GTACCGACCT TGAAAAACAG CTATCGGGTA CCGCGTCGCT AACAGACGAG
ATTGTCGAGA GGGCGACGTC CGCCAATGGC CTGCGGCGGC TGCTAAGCGT CCACGGACCA
GGACTTGTAT TCGGCATGAT CCAGAAGCAG CGCGGCGATG ATACGTCGAG CGAATCTGAT
TCGGCGGATG ACCGGCCCAA CTCCCATGCC CCTCAGATCG CAGAGCCGAT CAACGAGGAC
GACACCATCC TCGTGCTGGT CGATGAAGCG CATCGGAGCC AGGCGGGCGA TCTTCACTCT
GCCCTCCAAG CGGGTCTTCC CAACGCCGCT CGCATCGGTT TTACAGGCAC GCCCATCCTG
ATGGGCGAGA AGAAGCGAAC GCACGAGATT TTTGGCGACT TTATCGATCG CTACACGATT
CGTGAGGCCG AGGCGGACGG CGCCATCGTC CCCATCCTCT ACGAAGGCCG CACTGCGCAT
GGCGCGGTGA AAGATGGGGC GAACCTCGAC GAACTCTTTG AAGACTTATT TCGCGATCAC
ACCGCCGAAG AGCTGGAGAC CATTAGGCAG AAATATGCTA CTAAGGGGCA TATCTTCGAA
GCGCCTGCAC TTATCGCGGA CAAGGCCCGC GACATGCTGC GCCACTATGT GACGAATATC
TTGCCAAACG GGTTCAAGGC GCAAGTGGTC GCCTATAGCC GGTTGGCGGT CGTGCGGTAC
TACGAGGCGT TGCTTGCCGC CCGCGATGAG CTCCTTGCGC AAGCAGAAGC ACTCAGCCCC
GAAGATAAAG CGCTCGACGA GGAATCGCTC TGTAGCCGGC CACGGGACGT TAAAGCGCAA
CTGCAGGCAT GGCGATACCG CGAGACATTG CAGGCGCTTG AGTTCGCGCC GGTATTCTCA
GGCAGCAACA ACGATGACCC GGCGTGGAAG CAGTGGACCG ATAGCTCGGC GCAGGAGCAG
CGCATCGAGC GCTTCAAGAA GCCGCTTTTT CATGCGGACG CCGGCAAAAC CGACCCGCTT
GCGTTCTTAA TCGTCAAATC GATGTTGCTC ACGGGCTTCG ACGCCCCGAT CGAAGGGGTC
ATGTATCTGG ACCGACCCAT CCGCGAAGCT GAGCTGCTGC AAACGATCGC TCGCGTGAAC
CGCACGGGCT ACGGCAAGCG GTTCGGGATC GTTGTCGACT ATTTCGGCCT CGCGCATCAT
CTGAAGCAGG CGCTTGCTGT CTACGCTGCC GGCGATATCG AGGGAGCGCT ACAAAGCTTG
AAAGACGAGC TCCCGGTGCT TCGCGATCGC CACATCCGCG TCGTCGATCT ATTCCGTCAG
CGTGGGATTG AGAACCTCGC GGATCATGAA GCGTGTGTTG AGGCATTGCA GGATGAGCGG
CTTCGCGCCG AGTTCACCGT TAAATTCAAG CAGTTCCTCG AATTGCTTGA TGTGGTATTG
CCGCGTCCAG AAGGCTTACC ATTTTCCCCG GATGCCAAGC TGCTGGCCTT TATCTATGCG
CGCGCCCGCA ATCGCTACCG TGACACGCCT GTGCTGGGCA AAGACATCGG CGCAAAAGTG
CGCAAACTGA TCGATGACCA TGTGCTCTCG ATGGGTGTCG ACCCGAAGAT CCCGCCGGTG
TCGTTGACCG ATGCTGAGTT TGCGACGAAG GTCGCCCGGG AGCCCAACGA TCGCGCCAAA
GCCTCGGAGA TGGAACACGC GATCCGCGCG CACATCCGGG AGCACATGGA CCAAGATCCT
GTGACGTATC GGAAGCTGAG CGAGCGACTT CGTGATCTGC TGGAGCGGCT CGGGGAGCAG
TGGAACGAGC TGGCCGCCGC CTTGCAGGGC TTGATCGATC AGATCCAAAG CGGACGGGTT
GCACATGACG ATCGGCTGCC CGACCTCCCG GAACACTACG GTCCGTTTAT GCGGCTGATG
GTGGACGCGA CAGTCGGGGA AGAGCGCTTG ACTGAGGCCG AGCGTCAGCG TCTCGTGGAT
TTGGCGGTGG AGGTGGTTGA CATGATTGCT GCGGAGTTGA CCCCGAACTT CTGGCGCCCG
ACGCGACGGC CAGCACAAGA CGCGCTGAGC AGTCGTATCT TCGAGCTATT GATGCGCTCG
CGGTTGTTAC CGGCGCCGCA GATCGAAGCG TTGGTCGACA AGCTTATGGA ACTCGCCCGC
GCTAACCACG CCCAGTTGGT AAGTGTATGA
 
Protein sequence
MGWELEEVEQ PFVLQLKQLE WTHIQGDIDN PSISGRTSFA EVIQEGVLRE QLHSLNLGPS 
GEPWLDDARV SEAVASLTRL GTHRLMESNQ KATELLIRGV TVPGLSGWDG GRSQTIHYID
WEHPGRNEFT VVSQFRVDCP PGYNSAKAFI VPDLVLLVNG IPLVVVECKS PSIPEPLAEA
INQLRRYTNQ RHADREVDDN EGNETLFHTN QLLIATSFDE ARVGTIGAAF RHYTAWKTVV
PHTEAEVAAG LDKQQLSAQE RLIAGLLDKR TLLDVIRHFI LFMEADGQTV KALCRYQQYR
AVTHAIHRLR TGKTRLEDGE HDRRGGIIWH TQGSGKSLTM VFLVRKLRTD PKLRRFKVVI
VTDRTDLEKQ LSGTASLTDE IVERATSANG LRRLLSVHGP GLVFGMIQKQ RGDDTSSESD
SADDRPNSHA PQIAEPINED DTILVLVDEA HRSQAGDLHS ALQAGLPNAA RIGFTGTPIL
MGEKKRTHEI FGDFIDRYTI REAEADGAIV PILYEGRTAH GAVKDGANLD ELFEDLFRDH
TAEELETIRQ KYATKGHIFE APALIADKAR DMLRHYVTNI LPNGFKAQVV AYSRLAVVRY
YEALLAARDE LLAQAEALSP EDKALDEESL CSRPRDVKAQ LQAWRYRETL QALEFAPVFS
GSNNDDPAWK QWTDSSAQEQ RIERFKKPLF HADAGKTDPL AFLIVKSMLL TGFDAPIEGV
MYLDRPIREA ELLQTIARVN RTGYGKRFGI VVDYFGLAHH LKQALAVYAA GDIEGALQSL
KDELPVLRDR HIRVVDLFRQ RGIENLADHE ACVEALQDER LRAEFTVKFK QFLELLDVVL
PRPEGLPFSP DAKLLAFIYA RARNRYRDTP VLGKDIGAKV RKLIDDHVLS MGVDPKIPPV
SLTDAEFATK VAREPNDRAK ASEMEHAIRA HIREHMDQDP VTYRKLSERL RDLLERLGEQ
WNELAAALQG LIDQIQSGRV AHDDRLPDLP EHYGPFMRLM VDATVGEERL TEAERQRLVD
LAVEVVDMIA AELTPNFWRP TRRPAQDALS SRIFELLMRS RLLPAPQIEA LVDKLMELAR
ANHAQLVSV