Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2693 |
Symbol | |
ID | 3847017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 3081345 |
End bp | 3084173 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637842362 |
Product | Rhs element Vgr protein, putative |
Protein accession | YP_443208 |
Protein GI | 83718482 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATCC CAACTGACGT TCTGCAAGCA CTCTTCGGCG GCTGGTCGCA ACACGACCGC TTTCTCTGGA TCACGACGCC GCTCGGCGCG AACGCGCTCG TCGCGGAAAG CCTGCATGGC TGGGAGGCAC TCGATCAGGG GGGCTTTCGT TTTCAGCTCA CCGCGCTCGC CGAGAATCCG TCGCTGCCGC TCGCGCAGTT GATCGGCGCG CCGATCCTGA TCGAATGGCA GGCGCAGGAA GGGCGCGACG CGCGCCGGCC GTTCCACGGC CACGTGATCG CCGCCGAGCT CGTCGGCTAC AACGGCGGCC TCGCACGCGT GCGGCTCGTC GTCGAGCCGT GGCTCACGCT GCTGCGGCAG CGCGTCGACA GCTACAACTA TCTGAACGCG AGCGTCGTCG AGATCAGCGA ACAGGTGTTC CGCCGTTACG CGCGCGGCGC GATCGCGCCC GCATGGCGAT GGGCGCTCGC GGATGCGGCG AAGTACCCGA AGCGCAGCCT GACCGCGCAA GCCGGCGAAT CCGATTTCGA CTTCCTCGAG CGCCTCTGGG CGGAGGAAGG CATCTTCTAC TGGTTCGAGC ACGAAGGCGA CGCGCGTGCG TCGAGCCTCG GCAAGCACAC GCTCGTGCTC GCCGATTCGA ATCAGCGCTT CGCGCCCGAC GAACCCGAGC TCGTCGGCTT CCATCAGACG AGCGACGACG ATCCGCAAGG CTGCATCCAG CACTTCATGC ACGCGCGGCG CTGGCGGATC GGCAGCGTCG CGCGCGCGAG CTGGGATCAC CGCAGCCTGT CGACGCGGCC GACGGGCGCG CGCGCGAACG GCGCGGTCGC GCCGGGCGAG GACCGCGACG TCGCCGGCCC GTACGCATAC CAGACTGGCG CGATCGGAGA CCGGCGCGCG CAGCAGCAAC TCGATGCGCA GCGCGTGGCC GCGCTGCAGA GCGAAGGCCG CGGCACGCGC CGCGATCTGC GGCCTGGGCT GCGTTTTGCG ATTGCGCATC ATCCGACGCT CGGCGCATCG GATGCGTTCA TCTGTCTGCG CGTCGAGCAT TCGGCGCGCG CGAACGTCGA TGCGACCGTG CGCAGCGCGA TCGAACAGCG CCTCGGCGCG ATTGCGTCGA TCGCCGATGC CGCGCCGGCG CCGTATGGTC CCGCGAGCGC GCTGAATGCG GCGCTCGGCG CCGACACGCA TCACGGCGGA TCGCTGATGC AGGACGACGC TGTCTATCGC AATCGCTTCG TCGCGTTGCC CGCCGAGCAG GCGTATCGGC CGCTCGCCGC GTCCGGCCAT GGCGCGCGCA TGCACCCGGT CGCCGTGATG CCGGGGGCGC AGACGGCGAT CGCCGTGGGC GCGGGCGATC CCGTTCACAC CGACCGGGAT CACCGGATTC GGATTCAGCA TCACGCGCAG CGCGGCCGGA ATGCCGCGAG CCGCGAAGAT CATCCGCATG CGGCGAATGC GCCGGCGGAC CGAGGCGCCG GCACGTGGAC GCGCATGCTG ACGCCCGTGG GCGGCGACAA CTGGGGCGGC GTGAGCGTGC CGCGCGTCGG GCAGGAAGTG TGGACTGAAT GGCTCGAAGG CCAGCCCGAC CGGCCGGTTG CGGTGGCCGC GCTCTACAAC GGCCGGGGCA ATGCCGACGC GCAGCACAAC GCGCAGGCGG GAGGCCCGAG CGGCAGTACC GGCAATGCGG CCGCATGGTT TGCCGGCAAT ACGCACGCGG CGGTGCTCAC GGGCTTCAAG ACGCAGGACA TGAGCATGAG CCAGCAGGGC ACGGGCGGCT ATCGGCAATT CATGCTCGAC GATACGGCCG GCCAGTCGAG CGCGCGTCTG TACACGACGG ACCGCAACAG CGGGCTCACG CTCGGGCACA TGAAGCAGAC GCAGGACAAC CAGCGCCAGG CCGATCGCGG CTATGGCGCG GAACTGGCGA CGGATGGCGC CGGCGCACTG CGCGGCGGCG CGGGGCTCTT GATCAGCACG GCGCCCGGCG TGAGTCAGAT GGATGCGAGC GCGCCGAGCC AGGTGCTCGC GCAGCATCGC CAGACGTTGC AAAGCCTCGC GGAGCTCGCG CAAAAGCAAG GCGCGGAGCC GGGCGGCGCG GTGCCTGAGG CGGCGAGCGG AGCGGGCGCG ACGTCGACGG GGGCGGCGGC GGGCAAGCCT TTGCCTGCAG TCGACGGCAT CGAGCAGAGC CGCGAAGCGA TCGGCGCGAC GCGAGAAGGA GGGGGCGGCG ACACGGCGGG CGGCGGGAGC GGCAGCGCGG TCGCGTGGAG CAAGCCGCAT CTCGTCGCGC ACGGCGAGGC GGGGCTGGCC GCGATGTCGG CGAAGAGCCA CGTGTGGGTG TCGGGTACCG AGACGGTGCT GAGCGCCGGA CAGGACGTGC AATTGACGGC GAAGGGCAAG ACGAGCGTCG TCGCGAATCA CGGCGTCTCG CTGTATACGC AGGGCGCGGC GGGCGATGGG CGGCCGGTTG CCGGCCAGGG CATCGCGTTG CACGCGGCGT CCGGTTCGGT GAGCGTGCAG GCGCAGAACG CCGGCAAGCT GAGCGCGAGC GCGCAGAAGG CGGTGACGCT CGCGAGCGCG CAGGGCAGCG CGTCCGTGCA GGCGCAACAG CGCGTCCTGC TGAGCGCGGC GAAGGCGTAT CTGAAGATGG AAGGCAACGA CATCGTCGTC GGCGCGCCGG GGCGCGCGGA TTTCAAGGCG GCGGCGCATC AGTTGACGGG GCCGAAGAGC GCGGGGGCGC AGAACGCGTT GGGCAAGGGT GCTTCGAAGG ATTGTCCGCA GACGATGGGC GACATGATTT CGTCCAGCGC CGCATTCGCC GATCTATAA
|
Protein sequence | MSIPTDVLQA LFGGWSQHDR FLWITTPLGA NALVAESLHG WEALDQGGFR FQLTALAENP SLPLAQLIGA PILIEWQAQE GRDARRPFHG HVIAAELVGY NGGLARVRLV VEPWLTLLRQ RVDSYNYLNA SVVEISEQVF RRYARGAIAP AWRWALADAA KYPKRSLTAQ AGESDFDFLE RLWAEEGIFY WFEHEGDARA SSLGKHTLVL ADSNQRFAPD EPELVGFHQT SDDDPQGCIQ HFMHARRWRI GSVARASWDH RSLSTRPTGA RANGAVAPGE DRDVAGPYAY QTGAIGDRRA QQQLDAQRVA ALQSEGRGTR RDLRPGLRFA IAHHPTLGAS DAFICLRVEH SARANVDATV RSAIEQRLGA IASIADAAPA PYGPASALNA ALGADTHHGG SLMQDDAVYR NRFVALPAEQ AYRPLAASGH GARMHPVAVM PGAQTAIAVG AGDPVHTDRD HRIRIQHHAQ RGRNAASRED HPHAANAPAD RGAGTWTRML TPVGGDNWGG VSVPRVGQEV WTEWLEGQPD RPVAVAALYN GRGNADAQHN AQAGGPSGST GNAAAWFAGN THAAVLTGFK TQDMSMSQQG TGGYRQFMLD DTAGQSSARL YTTDRNSGLT LGHMKQTQDN QRQADRGYGA ELATDGAGAL RGGAGLLIST APGVSQMDAS APSQVLAQHR QTLQSLAELA QKQGAEPGGA VPEAASGAGA TSTGAAAGKP LPAVDGIEQS REAIGATREG GGGDTAGGGS GSAVAWSKPH LVAHGEAGLA AMSAKSHVWV SGTETVLSAG QDVQLTAKGK TSVVANHGVS LYTQGAAGDG RPVAGQGIAL HAASGSVSVQ AQNAGKLSAS AQKAVTLASA QGSASVQAQQ RVLLSAAKAY LKMEGNDIVV GAPGRADFKA AAHQLTGPKS AGAQNALGKG ASKDCPQTMG DMISSSAAFA DL
|
| |