Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2740 |
Symbol | |
ID | 3848904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 3147581 |
End bp | 3150736 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637842408 |
Product | type I restriction-modification system endonuclease |
Protein accession | YP_443254 |
Protein GI | 83721466 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTTC TGTCGGAGGC CGCTGTAGAG CAAGCGCTAC TTGATCAGCT GCGCGCGCTC GACTACAGCA TCGAACGTGA GGAGGACATC GGCCCCGATG GACACCGCCC GGAACGCGAG AGTCACGATG GGGTTGTGCT CAAGAAGCGA TTCGAGGACG CTGTTGCTCG TCTCAACCCC GGTCTCCCAT TGGAGGCCCG GCAAGACGCC GCGCGCCGTG TGATGCAGTC CGAATTGCCA TCGCTGCTCG AGGAAAACCG CCGCATCCAC AAGCTGATGA TCGAAGGGGT GGACGTTGAG TACTACGCCG ACGACGGCAC TCTGACGGCG GGTAAGGTCG CGCTCATCAA CTTCGAACAA CCGGAGCAGA ACGACTGGCT GGCCGTGAGC CAATTTGTAG TCGTTGCTGG GCATTACAAC CGACGTCCCG ATGTTGTCGT GTTCGTGAAC GGGCTACCGC TCGGCGTGGT CGAGTTAAAG GCACCTGGAA GTGGCAATGC GACCTTGGTC GGGGCTTTCA ACCAATTGCA GACCTACAAG AAGCAGATTC CGGCGCTCTT CAATACCAAC GCACTGCTGG TGACGTCGGA TGGCATTACG GCACGAGTGG GGTCGCTGTC GGCAGACCTC GAGCGCTTCA TGCCATGGCG CACCACCGAC GGTACGGACG TCGCACCAAA AGGTGCGCCG GAGCTCTCGA CGCTGATCGA GGGCGTGTTC GAGCATCGCC GCCTCCTGGA TCTGCTCTGT CATTTCACGG CTTTCGGTGA AACCGGTTCT GGGCTGGCTA AGGTCATTGC GGGCTATCAC CAGTTCCACG CGGTGCGACA TGCGGTCAAC AGTACGGTGG CGGCCTCCTC TCCGGAGGGC AACAAACGAG TCGGCGTGAT CTGGCACACG CAAGGTTCCG GAAAAAGCCT GCTGATGGCG TTTTACGCCG GGCAACTCGT CAAACACCCG GCAATGGCCA ACCCGACGCT AGTGGTGCTG ACCGACCGGA ACGATCTTGA TGACCAGCTG TTCTCAACGT TTTCGATGTG CCGCGACTTG ATCCGGCAGA CGCCGGTGCA GGCCGAAAGT CGCGAAGATC TGCAGAAGGT CTTGAGCCGA GCGTCGGGCG GCGTGATTTT TACTACCTTG CAGAAGTTTG GCGAGCTCGC TGAGCCGCTC ACCACACGAC GTAACGTGGT CGTCATTGCG GATGAAGCGC ACCGCAGTCA ATACGGCTTC AAGGCCAAGG TGGACACAAA GACCGGCGAA ATTTCCTATG GCTTCGCCAA GTACATGCGG GACGCTCTGC CGAACGCATC CTTCATCGGC TTTACAGGCA CGCCCATAGA GGCGGACGAC GTTAACACCC CGGCGGTATT CGGCAACTAC ATCGATGTTT ACGACATCAG TCGCGCGGTC GAAGACGGTG CGACTGTGCC GATCTACTAC GAGTCTCGGC TGGCACGCAT TGAACTCGAT GAGGACGAGA AGCCAAAGAT CGACGCCGAG GTTGAAGATT TGACCGAGGA AGATTCCGAG GCAGACCAAG AGCGTTTTAA GAAGAAGTGG TCAACGGTCG AAACCCTGGT CGGCAGCGAC AAGCGTCTGG CGCTGGTGGC CAAGGACATG GTCGCCCACT TCGACGATCG TGTGGCTGCC CTCGATGGCA AAGCGATGGT GGTGTGTATG AGTCGCCGCA TTTGCGTAAA GTTGTACGAC GAGATTGTCA AGCTGCGCCC GGACTGGCAC AGCGCCGATG ACAACGCTGG TGCGATCAAG ATCGTGATGA CGGGCGCGGC GAGCGATCCG CAAGAGTGGC AGCAGCACAT TGGCAACAAG GCACGGCGCG ACCAGCTGGC CAAGCGTGCC CGTGATCCGA AAGACCCGCT CAAACTGGTG ATCGTGCGGG ATATGTGGCT GACCGGTTTT GACGCACCGT GTATGCACAC GATGTACGTG GACAAGCCGA TGCATGGGCA CGGCTTGATG CAGGCGATAG CACGCGTGAA CCGGGTGTTC CGCGACAAGC CAGCCGGACT GATCGTGGAC TACATCGGCA TTGCGCAAAA CCTGAAGTCG GCCCTGCAGC AGTACTCGAA GAACGATCAG GACAACACCG GCGTCGACGA GGCGCAAGCC ATCGCGGTGA TGATGGAGAA GTACGAGGTC GTCCGGGACA TGTATCACGG CTTCGACTAC GCCTCGGCGC TGACCGGGAC ACCACAGGAG CGTCTGGCGA TGATGGCGGG AGCGATCGAG TGGCTTCTCG ACTTGCAGCA GAGGCTGGCA GCGAAGGAGA AAACAGAGGG TGGCAAAAAG AACGCTCATC GCCGCTATCA AGATGCAGTG CTCGCGCTGT CCAAGGCGTA CTCACTGGCA TCGGCATCCG ACGAGGCCCG CGAAATTCGG GAGGAAGTTG GCTTCTTCCA GGCTATTCGA GCCGCGTTGG TCAAGAGCGC GACGGGCTCA GGTGTCACCG AGCAAGAGCG CGAGTTGGCC ATCCAACAGA TCGTGAGCCG CGCAGTGGTC TCGACCGAGA TCGTCGACAT CCTGGCCGCA GCGGGAATCA AGAGCCCGGA CATCTCCATC CTGTCCGACG AGTTTCTCGC GGAGGTTCAG CGGATGGAGC GAAAGAATCT CGCTCTGGAG GCGTTGCGCA AGTTGATCAA TGACGGCATC CGCTCGCGGA GCAAAGCTAA CGTCGTACAG ACCAGGGCAT TCTCAGAGCG CTTGGAAGAC GCAGTGGCGC GCTATCATGC CAATGCGATC ACGACGGCCG AAGTGCTGCA GGAGCTGATC AAATTGGCCA AGGACATCCG GGCTGCCCGT AAGCGGGGCG AGGAGTCAGG GCTATCTGAC GAAGAGATCG CTTTCTATGA TGCTCTGGCC GAGAATGACA GTGCAGTGCA GATGATGGGC GACGACAAGC TTCGGTTGAT CGCTCACGAG CTCTTGGTGA GCCTTAGAGA AAACGTGTCA GTGGATTGGG CTCATCGCGA GTCAGCGCGA GCGCGGATGC GGGTGTTGGT GAAGCGGATT TTGCGAAAAT ATGGCTATCC TCCTGACTTG CAGTATTCCG CCGTGCAGAC GGTACTACAA CAGGCCGAGG CGCTGTCGTC AGGGTGGGTG TTTTCGCGTG GCGGGGCGAC ATCACATGGT GACTGA
|
Protein sequence | MAFLSEAAVE QALLDQLRAL DYSIEREEDI GPDGHRPERE SHDGVVLKKR FEDAVARLNP GLPLEARQDA ARRVMQSELP SLLEENRRIH KLMIEGVDVE YYADDGTLTA GKVALINFEQ PEQNDWLAVS QFVVVAGHYN RRPDVVVFVN GLPLGVVELK APGSGNATLV GAFNQLQTYK KQIPALFNTN ALLVTSDGIT ARVGSLSADL ERFMPWRTTD GTDVAPKGAP ELSTLIEGVF EHRRLLDLLC HFTAFGETGS GLAKVIAGYH QFHAVRHAVN STVAASSPEG NKRVGVIWHT QGSGKSLLMA FYAGQLVKHP AMANPTLVVL TDRNDLDDQL FSTFSMCRDL IRQTPVQAES REDLQKVLSR ASGGVIFTTL QKFGELAEPL TTRRNVVVIA DEAHRSQYGF KAKVDTKTGE ISYGFAKYMR DALPNASFIG FTGTPIEADD VNTPAVFGNY IDVYDISRAV EDGATVPIYY ESRLARIELD EDEKPKIDAE VEDLTEEDSE ADQERFKKKW STVETLVGSD KRLALVAKDM VAHFDDRVAA LDGKAMVVCM SRRICVKLYD EIVKLRPDWH SADDNAGAIK IVMTGAASDP QEWQQHIGNK ARRDQLAKRA RDPKDPLKLV IVRDMWLTGF DAPCMHTMYV DKPMHGHGLM QAIARVNRVF RDKPAGLIVD YIGIAQNLKS ALQQYSKNDQ DNTGVDEAQA IAVMMEKYEV VRDMYHGFDY ASALTGTPQE RLAMMAGAIE WLLDLQQRLA AKEKTEGGKK NAHRRYQDAV LALSKAYSLA SASDEAREIR EEVGFFQAIR AALVKSATGS GVTEQERELA IQQIVSRAVV STEIVDILAA AGIKSPDISI LSDEFLAEVQ RMERKNLALE ALRKLINDGI RSRSKANVVQ TRAFSERLED AVARYHANAI TTAEVLQELI KLAKDIRAAR KRGEESGLSD EEIAFYDALA ENDSAVQMMG DDKLRLIAHE LLVSLRENVS VDWAHRESAR ARMRVLVKRI LRKYGYPPDL QYSAVQTVLQ QAEALSSGWV FSRGGATSHG D
|
| |