Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1890 |
Symbol | |
ID | 3847073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2136274 |
End bp | 2138811 |
Gene Length | 2538 bp |
Protein Length | 845 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637841559 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_442420 |
Protein GI | 83718537 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCGA GGTGCGGCGC GTTCGCGCTC GGCGTCGTCG CGCTGCAGCA GCAGGCGGCG TTGCCGGGCG CGGCGGCATG GGCGGGCGGC GCGCTCGCGT TCGGCTTGTG CGTGTGGCTC GCGCTCGCGT GGCGCGACGG CGTGCGTGCG CGCACGCGAT CGATCGGGTT CTGCGCATGC TGTTGCGCGG CCGCGCTCGC GGGCTTCGGC TACGCGGCGG CGCGCGCGCA GTGGCGGCTT GCCGATGCGC TGCCCGCGCA GTGGGAGGGG CGCGACATCG TCGTGACGGG CGCCGTGCGC GGCTTGCCGT CGCGCGACGC GAACGGCACG CGTTTCCTGT TCGACGTCGA TGAAAACGAC GCGCGCATCG CGCGGTTTCC GGCGACGCTG TCGCTTGCGT GGTACACGTT CGGCCGCTCG GCCGCGTCGC CGCCCGAGCT CGTGCCGGGC GACCGATGGC GGCTGCGCGT GCGTCTGAAA CGCCCGCACG GCAATGCGAA TTTCGGGGTG CGCGATGCGG AAGCGGCATG GCTCGCGCGC GGCATCCGGG CGCTCGGCTA CGTATCGGCG GCGCACGATG CGCAACGGCT CGCGGGGCGC GCGTCCGGCG TCGCGGCGAT GGTCGACCGG TTGCGTGCGC GCCTGCGCGG GCGCATCGCC GATGCGCTCG GCGACGCCGC GCATCGCGGG ATCGTCGTCG CGCTCGCGAT CGGCGCGCAG GACGACATCG TCGACGGCGA CCGGCGCATC CTGCGCGATA CCGGCACGAG CCACCTCGTC GCGATTTCCG GGCTGCACGT CGGGATGGTC GGCGGCCTGT GCGCGTGGCT CGCAGGCGGG TTCTGGCGGC GCTCGGGCTA CGTCGGGCGC AACTGGCCGC TTGTCGTGCC CGCGCAGAAG GTCGCGGCGC TCGGCGCGAT CGTCGGCGGC GCCGGCTATG CGGCGCTCGC GGGCTTCAAC GTGCCCGCGC AGCGCGCGTG GTGGATGCTC GCCGCCGCGG GCGTCGCATA TCTGAGCGGG CGCTCGCTCG CGCCGTCTTC GGTGCTGGCG GCGGCGCTCG GCTGCGTGCT GATCGTCGAT CCGTGGGCAG TGACGTCGCC GGGGTTCTGG CTGTCGTTCT GCGCGGTCGC TGCGATCCTG TTCGCGTCGT CGGGGCGAAG CGCCGCGCGC GAAGCGCGCG ACCTCGACGA GGCGCGCGGC TCGATCGACG GCGCATGTCG CGAACGCGCG TCGCCGCCGG CGTGTCCCGC GCGGTGGCGT GCCGCGTGTG CGCGGGCACG GATGAGGGCG AGGCGCGCGA TCGGGCGGCT CGTTCGGCGC GTGCGCGATG CCGCGCGGGC GCAGTTCGCG GTGACGATCG CGCTCGCGCC GCTCACCGCG CTGTGGTTCG CGCAGATTCC GCTCACCGGC CCGCTCGCAA ATGCGTTCGC GATTCCGTGG GTCGGCTCGC TCGTCACGCC GATCGTGCTC GCGGGCGTCG TGCTGCCCGC GCCGCTCGAC GCACCGGCGT ACGTGCTCGG CGAAGCGCTC GTCGCGGCGC TGATGCGGTT CCTCGAAGCG GCAGCCGGCG CGGGGCGCAC GGTCTGGATG CTGCCGGTGC CGGGCGGCTT CGCGCTCGCC ACGGCGGCGG CGGGTGTCGT GTGGGCGCTG ACGCCGCGCG GCTGGCCGCT GCGCGGTGCG GCGCCGCTCG CGTGGCTGCC GCTTGTCGTG CCGGCTCCGC TCGCCCTGCC CGACGGCACG TTCCGGTTGA CGGCGCTCGA CGTCGGGCAG GGCTCGGCGG TGTTGATCGA AACCGCGCGG CATGCGCTGC TGTTCGACGC GGGCCCGGGG CCGGAGGCGT CGAATGCGGG CGAGCGGGTC GTCGTGCCGT TCTTGCGCGC GCGGGGCGTG CGCATGCTCG ACACGCTCGT CGTGAGCCAC GCGGATTCGG ACCACGCGGG CGGCGCGCCC GCCGTCCTCG AAGCGATCAC GGTCGCGCAG GTGACGGGCG GGCTGCCGCC GTCGAACCGC CTGTGGCGCG TCGCGCACGC AGCCGGCGTG GCCGACGCGC TGCCGTGCGC GGCGGGGCAG CGCTGGCGCT GGGACGGCGT CGAGTTCGCG ACGCTCTGGC CCGCCGGCGG GCCGCGCGCG GGCGGCGCGA CGAACGCTCA GTCGTGCGTG CTGCGCGTGT CGGCGGGCGA GCGTGCGGCG CTCCTGACGG GCGATGTCGA CGCACGCTCC GAGCGCGCGC TCGTCGCCGG GGCGCGCCGA GCGCTCGCGG CGCAGGTGCT CGTCGTCCCG CACCATGGCA GCCGCACGTC GTCGACCGAG CCTTTCCTCG ATTCGGTCGA GCCGCGCATT GCAATATTTC AGGTAGGCTA CGCCAACAGG TTTCACCATC CGCATCCGAC CGTCTGGGCG CGCTATGCCG GGCGCGGCAT CGAGTTGCCG CGTACCGACC GCGACGGCGC CGTGCGCGTC GACATGACGT CGAGCGGCGC GCTCCCCGCG CCGGTACGGT ATCGGGACGC GCACCGGCGC TACTGGATGG ACCGTTGA
|
Protein sequence | MRARCGAFAL GVVALQQQAA LPGAAAWAGG ALAFGLCVWL ALAWRDGVRA RTRSIGFCAC CCAAALAGFG YAAARAQWRL ADALPAQWEG RDIVVTGAVR GLPSRDANGT RFLFDVDEND ARIARFPATL SLAWYTFGRS AASPPELVPG DRWRLRVRLK RPHGNANFGV RDAEAAWLAR GIRALGYVSA AHDAQRLAGR ASGVAAMVDR LRARLRGRIA DALGDAAHRG IVVALAIGAQ DDIVDGDRRI LRDTGTSHLV AISGLHVGMV GGLCAWLAGG FWRRSGYVGR NWPLVVPAQK VAALGAIVGG AGYAALAGFN VPAQRAWWML AAAGVAYLSG RSLAPSSVLA AALGCVLIVD PWAVTSPGFW LSFCAVAAIL FASSGRSAAR EARDLDEARG SIDGACRERA SPPACPARWR AACARARMRA RRAIGRLVRR VRDAARAQFA VTIALAPLTA LWFAQIPLTG PLANAFAIPW VGSLVTPIVL AGVVLPAPLD APAYVLGEAL VAALMRFLEA AAGAGRTVWM LPVPGGFALA TAAAGVVWAL TPRGWPLRGA APLAWLPLVV PAPLALPDGT FRLTALDVGQ GSAVLIETAR HALLFDAGPG PEASNAGERV VVPFLRARGV RMLDTLVVSH ADSDHAGGAP AVLEAITVAQ VTGGLPPSNR LWRVAHAAGV ADALPCAAGQ RWRWDGVEFA TLWPAGGPRA GGATNAQSCV LRVSAGERAA LLTGDVDARS ERALVAGARR ALAAQVLVVP HHGSRTSSTE PFLDSVEPRI AIFQVGYANR FHHPHPTVWA RYAGRGIELP RTDRDGAVRV DMTSSGALPA PVRYRDAHRR YWMDR
|
| |