Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0042 |
Symbol | |
ID | 3848251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 43366 |
End bp | 46395 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637839715 |
Product | type III restriction-modification system, res subunit |
Protein accession | YP_440602 |
Protein GI | 83721135 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTGC ATTTCGAGTC GGATCTCGAC TATCAGCTCG AAGCGATCGA GGCGGTATGC GGTCTGTTTC GCGGCCAGGA GGCGTGCCGC GCCGAATTCA GCGTGACCGC GCAGGCCGCG CGGCGGCGCG CGAGCCCGCA GATTTCGCTC GGGATGGCCG AATCGGGGCT CGGCGTCGGC AATCGCCTGA CGCTCGACGC GCACACGTTC GCCGAGAATC TCGCGCGCGT GCAGGTGCGC AACGGGCTGC CGCCGTCCGG CGCGCCGCGC TCGAACGATT TCACCGTCGA GATGGAAACG GGCACGGGCA AGACTTACGT ATACCTGCGC ACGATCTTCG AGCTGCACCG CCGCTTCGAC TTCACGAAGT TCGTGATCGT CGTGCCGTCG GTCGCGATCA AGGAAGGCGT GTACAAGACG CTGCAGATCA CCGAGCGGCA TTTCCGGCGC CTGTACGCGG GCGTGCCGTT CGACTATTTC GTCTACGATT CGGCGAAGCT CGGCGAGGTG CGCAGCTTCG CGTCGAAATC GACCGTGCAG ATCATGATCG TCACGGTCGC GGCGATCAAC AAGAAGGACG TCAACACGCT CTACAAGGAC AGCGAACAGA CGGGCGGCGA AAAGCCGATC GACCTGATTC GCGCGGCGCA TCCGATCGTG ATCGTCGACG AGCCGCAGAG CGTGGACGGC GGGCTCGACG GGCGCGGCAA GGAAGCGCTC GACGCGATGC ACCCGCTCTG CACGCTGCGC TATTCGGCGA CGCACGCGGA CAAGTATCAG ATGCTGTACC GGCTCGACGC GATCGACGCG TACGAGCGCA AGCTCGTCAA GCAGGTCGAG ATCGCGTCGG CGACGGTCGA GGATGCGCAC AACAAGCCGT TCATGCGCGT GATGTCGATC GGCAGCCGGC GCGGGGCGAT CGCCGCGCGC GTCGAGCTCG ACGTCGCGAC GGCCGCGGGC GACGTCGAGC GGCAGACGGT TTCCGTCTCC GACGGCGACG ATCTCGAGCG CATCGCGCGC CGCGCCGTCT ACGCGAACTT CAGGATCGGC GAGATTCACG CGGCGCGCGG CGCCGAGTAT CTGGTGCTGC GCTATCCGGG CGGCGATGCG TTCCTGTCGA TCGGCGACAC ATATGGCGAC GTCGATACGC TCGCGATCCA GCGCGAGATG ATCCGCCGCA CGATCCGCGA GCATCTCGAC AAGGAACTGC GGCTCACGCC GCTCGGCGTG AAGGTGCTGT CGCTCTTCTT CGTCGATGCG GTCGACAAGT ATCGCAAGTA CGACCGCCAC GGCCAGCCGT TCAAGGGCGA TTACGCGCGG CTGTTCGAAG AAGAATACCG GCGCGCGGCG AAGCTGCCGG AATATCGCGC GCTGTTTGCC GGCGTCGATT GCGCGATCGC GGCCGAGGCC GTGCACGACG GCTATTTCTC GATCGACAGG AAAGGCGGCT GGACCGACAC GAGCGACAAG AGCGCGGGCA GCCGAGAGAA CGCGGAGCGC GCGTACGGCC TCATCATGAA GGACAAGGAG CGGCTGTTGT CGTTCGACAC GCCGCTCAAG TTCATCTTCT CGCATTCGGC GCTGAAGGAA GGCTGGGACA ATCCGAACGT GTTCCAGATC TGCACGCTGC GCGACATCCG CAGCGAGCGC GAGCGGCGCC AGACGATCGG CCGCGGGCTG CGTCTCGCCG TCAACCAGCG CGGCGAGCGC GTGCGCGGCT TCGACGTCAA CACGCTGACG GTGATCGCGG GCGAGAGCTA CGAGCAGTTC GCCGAGAACC TGCAGAAGGA AATCGAAGCC GATACGGGCA TCCGCTTCGG CATCGTCGAA ACGCATCAGT TCGCCGCGCT GCCCGTGCCG GCGGGCGACG GCAGCGTGCA GCCGCTCGGC GTCGAACGGT CCACCGCGCT ATGGACGTAT CTGCGCGACG CCGGCTATCT CGATGCGCGC GGCCGCGTGC AGGACACGCT GCGCGCGGCG CTCAAGCTGC GCGCGCTGCC GGTGCCCGAT GAATTCGGCT CGCAGCGCGC GCTGATCGTC GACATGCTGC GCAAGCTCGC GGGGCGGCTC GACGTGCGCA ACGCGGACGA GCGCCGCCAC ATCGCGCTGC GGCCCGACGC GCATGGCAAG GCGGTGTACC TGGGCGACGA ATTCCGCGCG CTGTGGGAGC GCATCCAGTA CCGGACGACC TACCGCGTGA ACTTCGACAA CGCACGCCTG ATCGAGCGCT GCGTCGCCGC GCTGAAGGCC GCGCCCGCCG TCGCGCGCGC GCGGCTGCAG TGGCGCAAGG CCGAAATCGC GATCGACGCG GCGGGCGTCG AGGCGATCGA AACCGAGGCG GCGGGCGCGA TCGCGATCGA CGAAGGCGAA ATCGACCTGC CGGATCTGCT GACCGAGCTG CAGGACCGCA CGCAGCTCAC GCGGCGCACG ATCGCGACCG TGCTGATCGA AAGCGGCAGG CTCGACGAGT TCCCGCGCAA TCCGCAGCGC TTCATCGCGC TCGTCGCTGC GGCGCTCGAG CGCTGCAAGC GCGACGCGCT CGTCGACGGG ATCGAATACA AGTTGCTCGG CAAGGAGCAC GTGCATGCGC TGTCGCTGTT CGAGGACGAG CCGCTTACCG GCTATCTGTC GAGCATGCGG CGCGGCGCGG CGAAATCGAT CCATGAGGAC GTGCCGTGCG AAACGCCTGC CGAGCGCGTG TTCGTCGAAT CGCTCGAACG GGACGACGCG GTCAGGCTGT ACGCGAAGCT GCCCGGCTGG TTCAAGATCC CGACGCCGCT CGGCAGCTAC AGCCCCGACT GGGCGGTGCT GATCGCCGAG GGCGACGGGC CGCGCCTCTA TTTCGTCGTC GAATCGAAGA GCGGCATCGC CGACAGCGAT CTGCACGCTC ACGAGCGGCG CAGGATTCAG TGCGGCGCGG CGCATTTCCG CGCGCTCGAG GCGGCCTCGG TCAATCCTGC GCGCTATGTG CGCGCGCGTT GCGCCGACGA TCTGCCGACG GCCCCCGCCA ACGCGCGCGA CGCGGCCTGA
|
Protein sequence | MQLHFESDLD YQLEAIEAVC GLFRGQEACR AEFSVTAQAA RRRASPQISL GMAESGLGVG NRLTLDAHTF AENLARVQVR NGLPPSGAPR SNDFTVEMET GTGKTYVYLR TIFELHRRFD FTKFVIVVPS VAIKEGVYKT LQITERHFRR LYAGVPFDYF VYDSAKLGEV RSFASKSTVQ IMIVTVAAIN KKDVNTLYKD SEQTGGEKPI DLIRAAHPIV IVDEPQSVDG GLDGRGKEAL DAMHPLCTLR YSATHADKYQ MLYRLDAIDA YERKLVKQVE IASATVEDAH NKPFMRVMSI GSRRGAIAAR VELDVATAAG DVERQTVSVS DGDDLERIAR RAVYANFRIG EIHAARGAEY LVLRYPGGDA FLSIGDTYGD VDTLAIQREM IRRTIREHLD KELRLTPLGV KVLSLFFVDA VDKYRKYDRH GQPFKGDYAR LFEEEYRRAA KLPEYRALFA GVDCAIAAEA VHDGYFSIDR KGGWTDTSDK SAGSRENAER AYGLIMKDKE RLLSFDTPLK FIFSHSALKE GWDNPNVFQI CTLRDIRSER ERRQTIGRGL RLAVNQRGER VRGFDVNTLT VIAGESYEQF AENLQKEIEA DTGIRFGIVE THQFAALPVP AGDGSVQPLG VERSTALWTY LRDAGYLDAR GRVQDTLRAA LKLRALPVPD EFGSQRALIV DMLRKLAGRL DVRNADERRH IALRPDAHGK AVYLGDEFRA LWERIQYRTT YRVNFDNARL IERCVAALKA APAVARARLQ WRKAEIAIDA AGVEAIETEA AGAIAIDEGE IDLPDLLTEL QDRTQLTRRT IATVLIESGR LDEFPRNPQR FIALVAAALE RCKRDALVDG IEYKLLGKEH VHALSLFEDE PLTGYLSSMR RGAAKSIHED VPCETPAERV FVESLERDDA VRLYAKLPGW FKIPTPLGSY SPDWAVLIAE GDGPRLYFVV ESKSGIADSD LHAHERRRIQ CGAAHFRALE AASVNPARYV RARCADDLPT APANARDAA
|
| |