Gene BTH_I2742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2742 
Symbol 
ID3848469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3151479 
End bp3152675 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID637842410 
Producttype I restriction-modification system specificity determinant 
Protein accessionYP_443256 
Protein GI83721596 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.971087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCTT TGCCTGTTCA CGCAAGGTCC GTTGAGCGAA TTGAAACTCG TGAATTCACT 
GGATCAGGCA CGCGTTTTCA GAACGGCGAC ACCTTGATCG CGCGTATAAC TCCATGCCTT
GAGAATGGCA AGACGGCTTA TATCTCCGAG CTTCCGGAAG GTGTCGTGGC TCACGGGTCT
ACGGAATATA TCGTGCTCAG TGGAAAGGTA AATCAGAGTG ACAGCTTGTT TGGCTATTAC
CTCGTCCGAT CTCCCGATTT TCGACGTCAT GCGATCGGTC ACATGGAAGG TACGTCGGGA
AGGCAGCGTG TCCCTTCATC CGCAGTAGAG AGATACTCCA CCCGTTTGCC CCCGCTTGCT
GAACAGCGCG CCATTGCCAA GATCCTTGGC AGCTTGGACG ACAAGATTGA ACTCAACCGC
GAGAGGAGTG AGACTCTGGA GGCAATGGGC CGCGCCTTGT TCAAGGACTG GTTCGTCGAT
TTTGGTCCCG TGCGCGCGAA GCAGGAAGGC CGTAGTCCTT ATCTGCCGCG CGAAATTTGG
GACTTGTTCC CAGAACGGCT GGACACCAAC GAATTGCCGG AAGGCTGGAA GCTTTTGAAG
GCGAGCGAAC TCATTGAGTT TAATCCTACC GAGTCCTTGC GTAAGGGCGA AGTCGCGCCT
TACCTCGACA TGGCTTCGCT CCCAACTCAA GGAAGCTGGC CTGATCCCTA TGTCATGCGC
CCTTTCGGGA GTGGCATGCG CTTCCGCAAT GGCGACACGT TGTTGGCGCG AATTACACCT
TGTCTGGAGA ACGGAAAAAC AGCATTTATT CAATGTCTTC CCGATGACGT CGTCGGTTGG
GGATCGACGG AATACATTGT GATGCGGCCA AAGGGGCCTG TGCCTGCGGC GTTTGCTTAC
TTGTTAGCAA GGAATGATGC CTTCCGAGAA CATGCTATCC GGAGCATGAC TGGTACGTCC
GGACGCCAGC GCGCTCAGGG CGACGCGGTT GCCGCCTACC AGCTTGCTGC CCCGTTGTGG
GACGACAAAT TGTGGGCCGT GCTTGCGAGC ATTGTTTCGT TGTTGTTCGA TGGAATCAGA
TCCAATTCCG AGACGTCGGT AAATCTTGCA AAAATGCGCG ATAACTTGCT TCCCATGTTG
ATCGCCGGCG CGCTTCGGGT GAAGAACGCC GAGCGAATCC TTGGAGCCGC GACGTGA
 
Protein sequence
MDALPVHARS VERIETREFT GSGTRFQNGD TLIARITPCL ENGKTAYISE LPEGVVAHGS 
TEYIVLSGKV NQSDSLFGYY LVRSPDFRRH AIGHMEGTSG RQRVPSSAVE RYSTRLPPLA
EQRAIAKILG SLDDKIELNR ERSETLEAMG RALFKDWFVD FGPVRAKQEG RSPYLPREIW
DLFPERLDTN ELPEGWKLLK ASELIEFNPT ESLRKGEVAP YLDMASLPTQ GSWPDPYVMR
PFGSGMRFRN GDTLLARITP CLENGKTAFI QCLPDDVVGW GSTEYIVMRP KGPVPAAFAY
LLARNDAFRE HAIRSMTGTS GRQRAQGDAV AAYQLAAPLW DDKLWAVLAS IVSLLFDGIR
SNSETSVNLA KMRDNLLPML IAGALRVKNA ERILGAAT