Gene BTH_I2740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2740 
Symbol 
ID3848904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3147581 
End bp3150736 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content59% 
IMG OID637842408 
Producttype I restriction-modification system endonuclease 
Protein accessionYP_443254 
Protein GI83721466 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTC TGTCGGAGGC CGCTGTAGAG CAAGCGCTAC TTGATCAGCT GCGCGCGCTC 
GACTACAGCA TCGAACGTGA GGAGGACATC GGCCCCGATG GACACCGCCC GGAACGCGAG
AGTCACGATG GGGTTGTGCT CAAGAAGCGA TTCGAGGACG CTGTTGCTCG TCTCAACCCC
GGTCTCCCAT TGGAGGCCCG GCAAGACGCC GCGCGCCGTG TGATGCAGTC CGAATTGCCA
TCGCTGCTCG AGGAAAACCG CCGCATCCAC AAGCTGATGA TCGAAGGGGT GGACGTTGAG
TACTACGCCG ACGACGGCAC TCTGACGGCG GGTAAGGTCG CGCTCATCAA CTTCGAACAA
CCGGAGCAGA ACGACTGGCT GGCCGTGAGC CAATTTGTAG TCGTTGCTGG GCATTACAAC
CGACGTCCCG ATGTTGTCGT GTTCGTGAAC GGGCTACCGC TCGGCGTGGT CGAGTTAAAG
GCACCTGGAA GTGGCAATGC GACCTTGGTC GGGGCTTTCA ACCAATTGCA GACCTACAAG
AAGCAGATTC CGGCGCTCTT CAATACCAAC GCACTGCTGG TGACGTCGGA TGGCATTACG
GCACGAGTGG GGTCGCTGTC GGCAGACCTC GAGCGCTTCA TGCCATGGCG CACCACCGAC
GGTACGGACG TCGCACCAAA AGGTGCGCCG GAGCTCTCGA CGCTGATCGA GGGCGTGTTC
GAGCATCGCC GCCTCCTGGA TCTGCTCTGT CATTTCACGG CTTTCGGTGA AACCGGTTCT
GGGCTGGCTA AGGTCATTGC GGGCTATCAC CAGTTCCACG CGGTGCGACA TGCGGTCAAC
AGTACGGTGG CGGCCTCCTC TCCGGAGGGC AACAAACGAG TCGGCGTGAT CTGGCACACG
CAAGGTTCCG GAAAAAGCCT GCTGATGGCG TTTTACGCCG GGCAACTCGT CAAACACCCG
GCAATGGCCA ACCCGACGCT AGTGGTGCTG ACCGACCGGA ACGATCTTGA TGACCAGCTG
TTCTCAACGT TTTCGATGTG CCGCGACTTG ATCCGGCAGA CGCCGGTGCA GGCCGAAAGT
CGCGAAGATC TGCAGAAGGT CTTGAGCCGA GCGTCGGGCG GCGTGATTTT TACTACCTTG
CAGAAGTTTG GCGAGCTCGC TGAGCCGCTC ACCACACGAC GTAACGTGGT CGTCATTGCG
GATGAAGCGC ACCGCAGTCA ATACGGCTTC AAGGCCAAGG TGGACACAAA GACCGGCGAA
ATTTCCTATG GCTTCGCCAA GTACATGCGG GACGCTCTGC CGAACGCATC CTTCATCGGC
TTTACAGGCA CGCCCATAGA GGCGGACGAC GTTAACACCC CGGCGGTATT CGGCAACTAC
ATCGATGTTT ACGACATCAG TCGCGCGGTC GAAGACGGTG CGACTGTGCC GATCTACTAC
GAGTCTCGGC TGGCACGCAT TGAACTCGAT GAGGACGAGA AGCCAAAGAT CGACGCCGAG
GTTGAAGATT TGACCGAGGA AGATTCCGAG GCAGACCAAG AGCGTTTTAA GAAGAAGTGG
TCAACGGTCG AAACCCTGGT CGGCAGCGAC AAGCGTCTGG CGCTGGTGGC CAAGGACATG
GTCGCCCACT TCGACGATCG TGTGGCTGCC CTCGATGGCA AAGCGATGGT GGTGTGTATG
AGTCGCCGCA TTTGCGTAAA GTTGTACGAC GAGATTGTCA AGCTGCGCCC GGACTGGCAC
AGCGCCGATG ACAACGCTGG TGCGATCAAG ATCGTGATGA CGGGCGCGGC GAGCGATCCG
CAAGAGTGGC AGCAGCACAT TGGCAACAAG GCACGGCGCG ACCAGCTGGC CAAGCGTGCC
CGTGATCCGA AAGACCCGCT CAAACTGGTG ATCGTGCGGG ATATGTGGCT GACCGGTTTT
GACGCACCGT GTATGCACAC GATGTACGTG GACAAGCCGA TGCATGGGCA CGGCTTGATG
CAGGCGATAG CACGCGTGAA CCGGGTGTTC CGCGACAAGC CAGCCGGACT GATCGTGGAC
TACATCGGCA TTGCGCAAAA CCTGAAGTCG GCCCTGCAGC AGTACTCGAA GAACGATCAG
GACAACACCG GCGTCGACGA GGCGCAAGCC ATCGCGGTGA TGATGGAGAA GTACGAGGTC
GTCCGGGACA TGTATCACGG CTTCGACTAC GCCTCGGCGC TGACCGGGAC ACCACAGGAG
CGTCTGGCGA TGATGGCGGG AGCGATCGAG TGGCTTCTCG ACTTGCAGCA GAGGCTGGCA
GCGAAGGAGA AAACAGAGGG TGGCAAAAAG AACGCTCATC GCCGCTATCA AGATGCAGTG
CTCGCGCTGT CCAAGGCGTA CTCACTGGCA TCGGCATCCG ACGAGGCCCG CGAAATTCGG
GAGGAAGTTG GCTTCTTCCA GGCTATTCGA GCCGCGTTGG TCAAGAGCGC GACGGGCTCA
GGTGTCACCG AGCAAGAGCG CGAGTTGGCC ATCCAACAGA TCGTGAGCCG CGCAGTGGTC
TCGACCGAGA TCGTCGACAT CCTGGCCGCA GCGGGAATCA AGAGCCCGGA CATCTCCATC
CTGTCCGACG AGTTTCTCGC GGAGGTTCAG CGGATGGAGC GAAAGAATCT CGCTCTGGAG
GCGTTGCGCA AGTTGATCAA TGACGGCATC CGCTCGCGGA GCAAAGCTAA CGTCGTACAG
ACCAGGGCAT TCTCAGAGCG CTTGGAAGAC GCAGTGGCGC GCTATCATGC CAATGCGATC
ACGACGGCCG AAGTGCTGCA GGAGCTGATC AAATTGGCCA AGGACATCCG GGCTGCCCGT
AAGCGGGGCG AGGAGTCAGG GCTATCTGAC GAAGAGATCG CTTTCTATGA TGCTCTGGCC
GAGAATGACA GTGCAGTGCA GATGATGGGC GACGACAAGC TTCGGTTGAT CGCTCACGAG
CTCTTGGTGA GCCTTAGAGA AAACGTGTCA GTGGATTGGG CTCATCGCGA GTCAGCGCGA
GCGCGGATGC GGGTGTTGGT GAAGCGGATT TTGCGAAAAT ATGGCTATCC TCCTGACTTG
CAGTATTCCG CCGTGCAGAC GGTACTACAA CAGGCCGAGG CGCTGTCGTC AGGGTGGGTG
TTTTCGCGTG GCGGGGCGAC ATCACATGGT GACTGA
 
Protein sequence
MAFLSEAAVE QALLDQLRAL DYSIEREEDI GPDGHRPERE SHDGVVLKKR FEDAVARLNP 
GLPLEARQDA ARRVMQSELP SLLEENRRIH KLMIEGVDVE YYADDGTLTA GKVALINFEQ
PEQNDWLAVS QFVVVAGHYN RRPDVVVFVN GLPLGVVELK APGSGNATLV GAFNQLQTYK
KQIPALFNTN ALLVTSDGIT ARVGSLSADL ERFMPWRTTD GTDVAPKGAP ELSTLIEGVF
EHRRLLDLLC HFTAFGETGS GLAKVIAGYH QFHAVRHAVN STVAASSPEG NKRVGVIWHT
QGSGKSLLMA FYAGQLVKHP AMANPTLVVL TDRNDLDDQL FSTFSMCRDL IRQTPVQAES
REDLQKVLSR ASGGVIFTTL QKFGELAEPL TTRRNVVVIA DEAHRSQYGF KAKVDTKTGE
ISYGFAKYMR DALPNASFIG FTGTPIEADD VNTPAVFGNY IDVYDISRAV EDGATVPIYY
ESRLARIELD EDEKPKIDAE VEDLTEEDSE ADQERFKKKW STVETLVGSD KRLALVAKDM
VAHFDDRVAA LDGKAMVVCM SRRICVKLYD EIVKLRPDWH SADDNAGAIK IVMTGAASDP
QEWQQHIGNK ARRDQLAKRA RDPKDPLKLV IVRDMWLTGF DAPCMHTMYV DKPMHGHGLM
QAIARVNRVF RDKPAGLIVD YIGIAQNLKS ALQQYSKNDQ DNTGVDEAQA IAVMMEKYEV
VRDMYHGFDY ASALTGTPQE RLAMMAGAIE WLLDLQQRLA AKEKTEGGKK NAHRRYQDAV
LALSKAYSLA SASDEAREIR EEVGFFQAIR AALVKSATGS GVTEQERELA IQQIVSRAVV
STEIVDILAA AGIKSPDISI LSDEFLAEVQ RMERKNLALE ALRKLINDGI RSRSKANVVQ
TRAFSERLED AVARYHANAI TTAEVLQELI KLAKDIRAAR KRGEESGLSD EEIAFYDALA
ENDSAVQMMG DDKLRLIAHE LLVSLRENVS VDWAHRESAR ARMRVLVKRI LRKYGYPPDL
QYSAVQTVLQ QAEALSSGWV FSRGGATSHG D