Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0878 |
Symbol | |
ID | 3844736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1030900 |
End bp | 1032477 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637838181 |
Product | hemagglutinin domain-containing protein |
Protein accession | YP_439075 |
Protein GI | 83717789 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCGC CGACAGAACG GCTTCGGCAG CGGTGGCGCC GATCGTTCGG TCGATCATTC AGGAGCGCGA TTATTTCAAG CAGCCGGCAT TCGGGCGTCG GGGAAACAAC CGATATCGGG CGATTCGAGT GCATCGATTT GATCGAGGTC GGACTGCGCG CGAGCGCGGG GAACCCGAGC AAGCAGGATC GTGCCGGAGA CTGCGCTCGC GGTGTTCGCG GCGATGGCGA ACGATGGCGG GCAGGGGCGT GCGACGCGTG CGAAGCGGCG GCCGCCCGCG CCGAGGCATC GGGCATCCTC CCGATTCAGT CGCGGCCGTT CGGGGAAAGA ATAGACGGCG CGCGCGTCAT GCGCAGACGC GCCGATCGTT CGGCGGCCGG CGGGCGTGAG CGCGAGGCGG TCGCGCGGCA GTCAATTTTT TTCGAGGTGA TTGGAATGAA AGTAATGGGA CGGCGCCGGA AGCCTGTCGG GCGGCTCGCG GTGTTTGGCT TGCGGCATGA TCGCACCGGC GCCTGCTCGC CGACGATGCG CTCAGCCGAC GTCGCGCGCA TGGCTGGAAT GCCGCGCGGC GTTCGGTCGC TGGTCGCTGG TGTCGTCGTG AGCGTGTTGG CCGGTCTCTC GCATGCGACG TCGGTCACGG GCGATCTGTC GGCCAACGCG GGCGAGAATA CGTTCGATGG TCGAGGCGCT CCGTTCGCGG GCTCGTTATC GAACTCGACG GATCATTCGG ATATCGAATC GATCTGGAAT TGGGAAAACT ATTGGACGCC GTTCACGTTC GGGACGCCGT CCGAGCGCTC GCGCGCCGCA TCGATCGCCG AATGGCTCGA TCGTTCCGAT GGCGGGACGA GCGGCGAATG GGACAGCCGA TCGTCAATCC CGGAGAGCGA ATCCGATGCG CTCGAATGGT TCCACTATTG GCCACGTCAT GAAGACGCAT CGTCGCTCGT CGGGGCAAGC CGGTCGGGCG TCGCGGCCGG AACCGGCGGC GGCGTGCAGG CTTCGTCGGA TTCGGCTCGC TTCGCACGTC CCCCGTGCGC GGCGGGCGAC GGCGCAACGG CGCTGGGGCG CAACGCGCGC GCGACAGGCG CGCGCGCGAT CGCGCTGGGC ACGGATGCGG CGGCCACGGG TGTCGATTCG GTCGCGCTGG GCGCGGGCTC GGTCGCGGCG CGGGACAACG TGGTGTCGGT CGGTCAGGCC GGTCGCGAGC GCCGGATCGT GCACGTCGCG CCCGGCACCG AAGGCACGGA TGCGGTAAAC AGGAATCAGC TCGACTCGGC ACTGAAGGCG GCGACGTTGC ATGCGGATCA CCGCTTCGCC AATCTGCAAC GGCAGATCGA CGCAACCGCG CGAAGCGCAT ATTCGGGCAT CGCGGCCGTG ACCGCGTTGT CGATGATTCC GGACGTCGAC AGCGGCAGGA CGCTGGCGAT CGGCATCGGT TTGGGGTCCT ACAAGGGCTG CCACGCGATG GCGCTCGGCG GCACCGCATG GCTTGCCCGC AACCTGAAAG TGCGCACGGG AATCGGGATG GGCGCGGAGG GCAAGACGAT CGGCGTCGGC GCAAGTTGGC AGCACTGA
|
Protein sequence | MRAPTERLRQ RWRRSFGRSF RSAIISSSRH SGVGETTDIG RFECIDLIEV GLRASAGNPS KQDRAGDCAR GVRGDGERWR AGACDACEAA AARAEASGIL PIQSRPFGER IDGARVMRRR ADRSAAGGRE REAVARQSIF FEVIGMKVMG RRRKPVGRLA VFGLRHDRTG ACSPTMRSAD VARMAGMPRG VRSLVAGVVV SVLAGLSHAT SVTGDLSANA GENTFDGRGA PFAGSLSNST DHSDIESIWN WENYWTPFTF GTPSERSRAA SIAEWLDRSD GGTSGEWDSR SSIPESESDA LEWFHYWPRH EDASSLVGAS RSGVAAGTGG GVQASSDSAR FARPPCAAGD GATALGRNAR ATGARAIALG TDAAATGVDS VALGAGSVAA RDNVVSVGQA GRERRIVHVA PGTEGTDAVN RNQLDSALKA ATLHADHRFA NLQRQIDATA RSAYSGIAAV TALSMIPDVD SGRTLAIGIG LGSYKGCHAM ALGGTAWLAR NLKVRTGIGM GAEGKTIGVG ASWQH
|
| |