Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2097 |
Symbol | |
ID | 3848970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2374892 |
End bp | 2376154 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637841766 |
Product | twin-arginine translocation pathway signal sequence domain-containing protein |
Protein accession | YP_442621 |
Protein GI | 83720073 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC GCGATTTTCT GGCCTTGGCG AGTCTTGCCG GCGCGGCGGG CGTTTCGCTG TCGATGCCGC ACGCGTTCGC GGCGGCATCC GCCGCTTCGG GCGCGAAGGG GGCAATCGGC GCGAATGGCG CGATTGACAT GGCGGGTGCC GCGCGCTACT CGAACCTGCT CATCCTGGTC GAGCTGAAGG GCGGCAACGA CGGGCTCAAC ACGGTGATTC CGTACGCGGA CCCGCTGTAC CGCACGCTGC GCCCGACGAT CGGCGTCAAG CGCGAGCAGG TCGTGCAGCT CGACGAGCGC GCCGCGCTGC ATCCGGCGCT CGAGCCGCTC GTGCCGATCT GGCGTGATGG GCGGCTCGCG ATCGTCGATG GCGTCGGCTA TCCGCAGCCG AATCTGTCGC ATTTTCGCTC GATCGAGATC TGGGATACCG CGTCGCGCGC GGACGAGTAT CTGCGTGAAG GGTGGCTCAC GCGCGCATTC GCGCAGGCCG GCGTGCCGCC CGGCTTCGCG GCGGACGGCA TCGTGCTCGG CAGCGCGGAA ATGGGGCCGC TCGCGAACGG CGCGCGCGCG ATCGCACTCG TCAATCCCGC ACAATTCGCG CGTGCGGCGC GGCTCGTGCA GCCCGTATCG CTGCGCGAGC AGAATCCCGC GCTCGCGCAC GTGATCGACA TCGAGAACGA CATCGTCAAG GCCGCCGATC GGCTGCGTCC GCACGCGGGC ACGCCCGCGC TCGCGACCGC GTTTCCGGGC GGGCCGTTCG GCGCGTCGGT GAAGACCGCG ATGCAGGTGC TCGCCGCGTG CGACACGCCA CAGCGTACGC CGGCGCCGGG GCAGGGCGTC GCGGCGCTGC GTCTCACGCT GAACGGCTTC GACACGCATC AGAACCAGCC CGGCCAGCAG GCGGGATTGC TCAAGCAACT GGCGCTGGGG TTCGTCGCGA TGCGTTCGGC GTTGATCGAA CTCGGGCGCT GGAACGACAC GCTCGTGATG ACGTATGCGG AATTCGGCCG GCGCGCGCGA GAGAACCAGA GCAACGGGAC GGATCACGGC ACGGCCGCTC CGCATTTCGT GATGGGCGGG CGCGTGCGCG GCGGGCTGTA CGGCGCGCCG CCTGCGCTCA CCGCGCTCGA CGGCAACGGC AACCTGCCCG TCGCCGTCGA TTTCCGGCAG CTCTATGCGA CCGTGCTCGG CCCGTGGTGG GGGCTTGACG CGACGAGCGT GCTCAAGCGG CGCTTCGAGC CGTTGCCGCT GCTGCGTGCC TGA
|
Protein sequence | MKRRDFLALA SLAGAAGVSL SMPHAFAAAS AASGAKGAIG ANGAIDMAGA ARYSNLLILV ELKGGNDGLN TVIPYADPLY RTLRPTIGVK REQVVQLDER AALHPALEPL VPIWRDGRLA IVDGVGYPQP NLSHFRSIEI WDTASRADEY LREGWLTRAF AQAGVPPGFA ADGIVLGSAE MGPLANGARA IALVNPAQFA RAARLVQPVS LREQNPALAH VIDIENDIVK AADRLRPHAG TPALATAFPG GPFGASVKTA MQVLAACDTP QRTPAPGQGV AALRLTLNGF DTHQNQPGQQ AGLLKQLALG FVAMRSALIE LGRWNDTLVM TYAEFGRRAR ENQSNGTDHG TAAPHFVMGG RVRGGLYGAP PALTALDGNG NLPVAVDFRQ LYATVLGPWW GLDATSVLKR RFEPLPLLRA
|
| |