Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0099 |
Symbol | |
ID | 4900869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 92566 |
End bp | 95835 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640133329 |
Product | type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_001064384 |
Protein GI | 126453319 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.26483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTGGG AACTCGAAGA AGTTGAACAG CCTTTCGTGC TCCAGCTTAA ACAACTGGAG TGGACGCACA TCCAGGGGGA CATCGATAAT CCGAGCATCA GCGGCCGCAC AAGCTTTGCC GAGGTCATTC AGGAAGGCGT CCTGCGCGAG CAACTGCACT CCTTGAATCT TGGCCCAAGT GGCGAGCCGT GGCTTGACGA CGCAAGGGTT TCAGAGGCCG TCGCATCCCT CACGCGCCTC GGCACGCACC GGCTGATGGA GTCGAATCAA AAGGCCACCG AACTACTGAT CAGAGGCGTG ACGGTCCCGG GCCTTTCGGG ATGGGACGGC GGGCGCAGCC AGACCATCCA CTACATCGAC TGGGAACACC CGGGACGCAA TGAATTTACC GTCGTCAGCC AGTTCCGTGT GGACTGCCCT CCCGGCTACA ACAGCGCCAA AGCGTTCATC GTGCCCGACC TCGTGCTGCT GGTAAACGGT ATCCCGCTGG TCGTGGTCGA ATGCAAGAGC CCATCGATTC CCGAGCCGTT GGCCGAGGCG ATCAACCAGT TGCGGCGCTA TACCAACCAA CGCCACGCAG ATCGCGAAGT CGACGACAAT GAAGGCAACG AGACGCTATT TCACACGAAT CAGCTGTTGA TCGCGACAAG TTTCGACGAG GCGCGAGTCG GGACTATCGG CGCGGCGTTT CGCCACTACA CCGCATGGAA GACGGTCGTC CCGCACACCG AAGCGGAAGT CGCTGCTGGG CTTGATAAGC AGCAGTTATC AGCGCAAGAG CGTCTCATCG CCGGCCTTCT GGACAAGCGG ACCCTGCTCG ATGTGATACG CCACTTCATA TTGTTCATGG AGGCCGACGG CCAAACCGTA AAGGCCCTAT GTCGTTACCA GCAGTACCGC GCAGTGACGC ATGCGATCCA CCGTCTTCGA ACAGGTAAAA CACGGCTCGA GGATGGCGAG CACGACCGGC GCGGCGGCAT CATCTGGCAT ACGCAGGGTT CCGGCAAGAG CCTGACCATG GTGTTCCTTG TGCGAAAGCT GCGCACCGAC CCGAAGCTTC GTCGGTTTAA GGTCGTGATA GTCACCGATC GTACCGACCT TGAAAAACAG CTATCGGGTA CCGCGTCGCT AACAGACGAG ATTGTCGAGA GGGCGACGTC CGCCAATGGC CTGCGGCGGC TGCTAAGCGT CCACGGACCA GGACTTGTAT TCGGCATGAT CCAGAAGCAG CGCGGCGATG ATACGTCGAG CGAATCTGAT TCGGCGGATG ACCGGCCCAA CTCCCATGCC CCTCAGATCG CAGAGCCGAT CAACGAGGAC GACACCATCC TCGTGCTGGT CGATGAAGCG CATCGGAGCC AGGCGGGCGA TCTTCACTCT GCCCTCCAAG CGGGTCTTCC CAACGCCGCT CGCATCGGTT TTACAGGCAC GCCCATCCTG ATGGGCGAGA AGAAGCGAAC GCACGAGATT TTTGGCGACT TTATCGATCG CTACACGATT CGTGAGGCCG AGGCGGACGG CGCCATCGTC CCCATCCTCT ACGAAGGCCG CACTGCGCAT GGCGCGGTGA AAGATGGGGC GAACCTCGAC GAACTCTTTG AAGACTTATT TCGCGATCAC ACCGCCGAAG AGCTGGAGAC CATTAGGCAG AAATATGCTA CTAAGGGGCA TATCTTCGAA GCGCCTGCAC TTATCGCGGA CAAGGCCCGC GACATGCTGC GCCACTATGT GACGAATATC TTGCCAAACG GGTTCAAGGC GCAAGTGGTC GCCTATAGCC GGTTGGCGGT CGTGCGGTAC TACGAGGCGT TGCTTGCCGC CCGCGATGAG CTCCTTGCGC AAGCAGAAGC ACTCAGCCCC GAAGATAAAG CGCTCGACGA GGAATCGCTC TGTAGCCGGC CACGGGACGT TAAAGCGCAA CTGCAGGCAT GGCGATACCG CGAGACATTG CAGGCGCTTG AGTTCGCGCC GGTATTCTCA GGCAGCAACA ACGATGACCC GGCGTGGAAG CAGTGGACCG ATAGCTCGGC GCAGGAGCAG CGCATCGAGC GCTTCAAGAA GCCGCTTTTT CATGCGGACG CCGGCAAAAC CGACCCGCTT GCGTTCTTAA TCGTCAAATC GATGTTGCTC ACGGGCTTCG ACGCCCCGAT CGAAGGGGTC ATGTATCTGG ACCGACCCAT CCGCGAAGCT GAGCTGCTGC AAACGATCGC TCGCGTGAAC CGCACGGGCT ACGGCAAGCG GTTCGGGATC GTTGTCGACT ATTTCGGCCT CGCGCATCAT CTGAAGCAGG CGCTTGCTGT CTACGCTGCC GGCGATATCG AGGGAGCGCT ACAAAGCTTG AAAGACGAGC TCCCGGTGCT TCGCGATCGC CACATCCGCG TCGTCGATCT ATTCCGTCAG CGTGGGATTG AGAACCTCGC GGATCATGAA GCGTGTGTTG AGGCATTGCA GGATGAGCGG CTTCGCGCCG AGTTCACCGT TAAATTCAAG CAGTTCCTCG AATTGCTTGA TGTGGTATTG CCGCGTCCAG AAGGCTTACC ATTTTCCCCG GATGCCAAGC TGCTGGCCTT TATCTATGCG CGCGCCCGCA ATCGCTACCG TGACACGCCT GTGCTGGGCA AAGACATCGG CGCAAAAGTG CGCAAACTGA TCGATGACCA TGTGCTCTCG ATGGGTGTCG ACCCGAAGAT CCCGCCGGTG TCGTTGACCG ATGCTGAGTT TGCGACGAAG GTCGCCCGGG AGCCCAACGA TCGCGCCAAA GCCTCGGAGA TGGAACACGC GATCCGCGCG CACATCCGGG AGCACATGGA CCAAGATCCT GTGACGTATC GGAAGCTGAG CGAGCGACTT CGTGATCTGC TGGAGCGGCT CGGGGAGCAG TGGAACGAGC TGGCCGCCGC CTTGCAGGGC TTGATCGATC AGATCCAAAG CGGACGGGTT GCACATGACG ATCGGCTGCC CGACCTCCCG GAACACTACG GTCCGTTTAT GCGGCTGATG GTGGACGCGA CAGTCGGGGA AGAGCGCTTG ACTGAGGCCG AGCGTCAGCG TCTCGTGGAT TTGGCGGTGG AGGTGGTTGA CATGATTGCT GCGGAGTTGA CCCCGAACTT CTGGCGCCCG ACGCGACGGC CAGCACAAGA CGCGCTGAGC AGTCGTATCT TCGAGCTATT GATGCGCTCG CGGTTGTTAC CGGCGCCGCA GATCGAAGCG TTGGTCGACA AGCTTATGGA ACTCGCCCGC GCTAACCACG CCCAGTTGGT AAGTGTATGA
|
Protein sequence | MGWELEEVEQ PFVLQLKQLE WTHIQGDIDN PSISGRTSFA EVIQEGVLRE QLHSLNLGPS GEPWLDDARV SEAVASLTRL GTHRLMESNQ KATELLIRGV TVPGLSGWDG GRSQTIHYID WEHPGRNEFT VVSQFRVDCP PGYNSAKAFI VPDLVLLVNG IPLVVVECKS PSIPEPLAEA INQLRRYTNQ RHADREVDDN EGNETLFHTN QLLIATSFDE ARVGTIGAAF RHYTAWKTVV PHTEAEVAAG LDKQQLSAQE RLIAGLLDKR TLLDVIRHFI LFMEADGQTV KALCRYQQYR AVTHAIHRLR TGKTRLEDGE HDRRGGIIWH TQGSGKSLTM VFLVRKLRTD PKLRRFKVVI VTDRTDLEKQ LSGTASLTDE IVERATSANG LRRLLSVHGP GLVFGMIQKQ RGDDTSSESD SADDRPNSHA PQIAEPINED DTILVLVDEA HRSQAGDLHS ALQAGLPNAA RIGFTGTPIL MGEKKRTHEI FGDFIDRYTI REAEADGAIV PILYEGRTAH GAVKDGANLD ELFEDLFRDH TAEELETIRQ KYATKGHIFE APALIADKAR DMLRHYVTNI LPNGFKAQVV AYSRLAVVRY YEALLAARDE LLAQAEALSP EDKALDEESL CSRPRDVKAQ LQAWRYRETL QALEFAPVFS GSNNDDPAWK QWTDSSAQEQ RIERFKKPLF HADAGKTDPL AFLIVKSMLL TGFDAPIEGV MYLDRPIREA ELLQTIARVN RTGYGKRFGI VVDYFGLAHH LKQALAVYAA GDIEGALQSL KDELPVLRDR HIRVVDLFRQ RGIENLADHE ACVEALQDER LRAEFTVKFK QFLELLDVVL PRPEGLPFSP DAKLLAFIYA RARNRYRDTP VLGKDIGAKV RKLIDDHVLS MGVDPKIPPV SLTDAEFATK VAREPNDRAK ASEMEHAIRA HIREHMDQDP VTYRKLSERL RDLLERLGEQ WNELAAALQG LIDQIQSGRV AHDDRLPDLP EHYGPFMRLM VDATVGEERL TEAERQRLVD LAVEVVDMIA AELTPNFWRP TRRPAQDALS SRIFELLMRS RLLPAPQIEA LVDKLMELAR ANHAQLVSV
|
| |