Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3644 |
Symbol | |
ID | 4884485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3571227 |
End bp | 3573827 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640129572 |
Product | type I restriction enzyme R protein N terminus (HSDR_N)/N-6 DNA methylase |
Protein accession | YP_001060649 |
Protein GI | 126440241 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.550519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAGT CTGCCAACGA GACCGAAACA GTCATCAAGC GCATCCTGCC CTATTTACAG CGTCGGGGCT ATGAGATTGA GACAGATCTC CATTTCGAGA CCGCTGCTTC CACTCCGGAA AGATATGAAG CGGGCTTCGT TGACATCTTG GTCTGGCCGA ACGACAAGAC CTATCCGAAG GGCAAACCCG CTTTCCTGAT CGAAGCGAAG CGCATTGCCA AGAAGCTATC GGAGCAAGAT AAAAAGCAAG CGCTTTCTTA CGCTCGTGCC GACGGCTACG ACGTTCCGTT TGTGGTCGTC TGCAATGGCG CAGAGATTCG ATCCTATAAC GCGAAGACCG GCGAGCCGAT TCAGTGGAAC GGCAAACTTT CCTCGAAGAT TCCGGCCAAG AGTCAGTTGA AGTCTGTGCT CAAGGACTTC AAGACAGATC CGCAGGCAGT TCGAATCGAA CTGCCCGGGG AGGATGGTCC CCTTGACGGG TCATCCGCGC TGCCTTTCCG CCCGAGCCTT CCGTTGCGAC AACTCAACGC CCTGTTCTCC AGGTGCCATG ATGCTATTCG GAAGAATGAA AAAGACGAGA ACCACATCTT CGACGATTTT TCCAAACTGC TGTTTCTCAA GCTGCTAGAG GAAAAGGCCG ATACCGAGGA AGGGTTCAAT TTGCCTTACA GCTACACCTT CCACGAACTC GCGGCGTTGC CCGACGCTAA GGCAGATCAG GTCCAGAACG CAATCATGGA CATGATCAAA AAAATTCGGA CCGACAAGTC GTACGGGGAC GTATTGGCGA ATCCGATCCA CCTGAAAGTG GCGAAGACGT TTCTGTACCT GGTTCGTCAG CTCGCAGCGG TTTCGTTCAC TGACAGCACG ACGGACTCCA AGGGGGCGGC ATTCGAGTAC TTCGTTCGGG CGACCCTCAA GGGGAAGAAG CTGGGGCAAT ACTTTACGCC TAGGCCCCTG GTTCGCCTCA TGTCAGCAAT TGTGGGGCAA GAGAAGATCG TCAATGCACT GCTTTCGGGG GCTGCGGCCC CTAAGGTCCT GGACCCCGCG TGCGGCACCG GCGGCTTCTT GGTGTACCTC ATGGGGGATA GCCTCAGGGT CGCGAATCAG AAGCTTGCCG ACCGAGCGAT TAATGCCGCC ACCCATCGCG AACTCGTTCG GAAGATTCGT CAGCAGGTCT TTTTTGGCTC CGACGCAAAT GAGGGCGTTG CATGTGCGGC GAAGATGAAC ATGATCGTCG CCGGTGACGG GCATTCAAAT ATTCAGCCCG AGAACAGCTT GGCACGGACG GCCAAGAACT GGAACATTCA AGATTCAGAT TGCGACTTCA TCTTGACTAA CCCGCCGTTC GGGACGTCTG AGAGCGGGGC CTTGTCTGAT AAAGACATGG GGCAGTTCGA AGTCCAGACG ACGAAGGGAC AACTCCTTTT CCTGCAGAAG ATGGTGTTGT CTGCTCGTCG TGGCGGGGAG ATCTGCACCG TCATTGATGA GGGGGTGCTC AATACTGATA CGGCGGCACC GATACGCAAA TGGCTCCTGA GCAAAGCGAA GCTTTTGGCA GTGGTGCGTT TGCCTGACGA GACATTCCGT CCGAACAAAA TCAATGTTCG ATCGAGCGTG CTTTACCTCC AGCGAATGAC CGAGGAGGAG GAGGAAATCG CCGACGATAT CAAGTATCCG GTTGCATTCT GCGATATCGA GACCTTCGGC ATGGATGGGG CGGGGGATAT TGCCCGCAAC TTTGACTTGG ATACTTTGAT TGATTCGGTC GGAAAGAACA TCTTGCGAAC TGGACGCACA CGGAGCGGAA AACATTGGTC TGTCTTCGAT GTTGCCGTTT CGAGGATTCG GGACGACAAG GCATCCCGAT TCGACGTGAA GTATTGGCGG CCGGGCGTAA CCAATGCGAT GAAAGAGATC GTTTCCGTCG GCGGAAAGAC AATCAAAGAG TTGAACACGA TCCCGACCGC ACGTGGGACG AGTCCCAGCG CCGATTCATA TGTTGATGCG GCCGATGGCT ACGCGTTGGT GATCAAGTCA GGTAGCAACA TCTCGCGTTA TGGCGAACTC ATAGTCGGCG GCGATTACAT CGAGAAGAGC TTGTTTGATG AGTACGTTGA AAAAGCACAT ACGCAAGGCC GCAATTTCAA CTTGGTCAGG CCGGGAGACG TACTCGTATC ATCGACCGGT GACGGAACGC TCGGTAAGTG CTGTGTGTAC CGGCCTTCGA CTGACCCAAT CACTGGCGAG ACCGTGCCCT ACGCAGCAAT AGCGGAAGGC CACGTGGCGA TCATTCGTGT TGATCCTGAC GTTATCTGGC CAGAGTACCT TTGTGACTAC CTGAGAGTGG GTTTCGGGGC ACAACAGATC GAGAGGCTCT ATACCGGCTC AACGGGGATG ATCGAACTTA CCCCCGCCGC GCTGGATGAA GTGGTAGTGA ACTTGCTTTC AGGAATCGAG GAGCAGAAGA AATACTCCGA AGCCCTACGG AAGGGCGAGG CACAAGCGCG CTCTACGTCG GAGCAAGCAG CGAGCGAAAT GAGTGCTGCC CTCGACTCCT TCCGAAGCAG CACCAGCACT TTGTTGGAGC TGACGACCTA A
|
Protein sequence | MSKSANETET VIKRILPYLQ RRGYEIETDL HFETAASTPE RYEAGFVDIL VWPNDKTYPK GKPAFLIEAK RIAKKLSEQD KKQALSYARA DGYDVPFVVV CNGAEIRSYN AKTGEPIQWN GKLSSKIPAK SQLKSVLKDF KTDPQAVRIE LPGEDGPLDG SSALPFRPSL PLRQLNALFS RCHDAIRKNE KDENHIFDDF SKLLFLKLLE EKADTEEGFN LPYSYTFHEL AALPDAKADQ VQNAIMDMIK KIRTDKSYGD VLANPIHLKV AKTFLYLVRQ LAAVSFTDST TDSKGAAFEY FVRATLKGKK LGQYFTPRPL VRLMSAIVGQ EKIVNALLSG AAAPKVLDPA CGTGGFLVYL MGDSLRVANQ KLADRAINAA THRELVRKIR QQVFFGSDAN EGVACAAKMN MIVAGDGHSN IQPENSLART AKNWNIQDSD CDFILTNPPF GTSESGALSD KDMGQFEVQT TKGQLLFLQK MVLSARRGGE ICTVIDEGVL NTDTAAPIRK WLLSKAKLLA VVRLPDETFR PNKINVRSSV LYLQRMTEEE EEIADDIKYP VAFCDIETFG MDGAGDIARN FDLDTLIDSV GKNILRTGRT RSGKHWSVFD VAVSRIRDDK ASRFDVKYWR PGVTNAMKEI VSVGGKTIKE LNTIPTARGT SPSADSYVDA ADGYALVIKS GSNISRYGEL IVGGDYIEKS LFDEYVEKAH TQGRNFNLVR PGDVLVSSTG DGTLGKCCVY RPSTDPITGE TVPYAAIAEG HVAIIRVDPD VIWPEYLCDY LRVGFGAQQI ERLYTGSTGM IELTPAALDE VVVNLLSGIE EQKKYSEALR KGEAQARSTS EQAASEMSAA LDSFRSSTST LLELTT
|
| |