Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0875 |
Symbol | |
ID | 3845515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1026584 |
End bp | 1028185 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637838178 |
Product | Hep_Hag family protein |
Protein accession | YP_439072 |
Protein GI | 83716561 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.190664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTGTTCAT CCATCGCACC GCTCGCGCTC GGATTTTCCG CGGATGCATT CGCCGCTGAC GAGACGATGG CTTCTCCATT CAATCGAGGC GCGCCGAACG ACGCTCACGG AAATCTGCTC GATGAGATCA GAAGAGGCGT GCCGCTCAGA CATGTTCCGG CATCCGAACG AAACACGCGA GGCGCGGGCG GCTCGACCCT CGCGGACGCG ATGAGGCGGG TAATCGACTC ACGGCGCACG GCGTTCGATT CGCCTCCGGC GACTCCCGCA TCGCCCTCGC CGTCATGGTC GGACGACGAA TCCCCTCCGC CGACGCCGAT CGCAACCCGA CCGGCATCGC GCCCGGAGTC TGCGGCGCGC TCCCCGCGTC ATTCGTCTCC TCCGCATTCG CCGCCAGCTT CCGCCGAGTC GCCCTCGCCG CGATCGCCCG ATGCGTCGCC TTCGCGAACG CCTTCGCCTA CGTTTTCGTT CCCGTCTCCG TCGCGCACGT CGACGCCGAG GACGCAGCCG CCATCGCCGC TGCGCGAGCG CCCGGAACGC TCGCCGGCCG CATCGCCGCG CGTCGCGTCG CCGCGGTCCG CGCATTCGCG CGGCTCGACG CAGCCGCCTT CGAACCTCTC CACGCCGCGG TACGAACCGC CCACTCCGCT GCAGGAGGAC CCGGAACGAA CGCCGGTCGC ATCGCCGCGC GTTGCGTCGC CGCGGTCCGC GCATTCGCGC GGCTCGACGC AGCCGCCTTC GAACCTCTCC ACGCCGCGGT ACGAACCGCC CACTCCGCTG CAGGAGGACC CGGAACGAAC GCCGGTCGCA TCGCCTCACG TCACGCCCGC GGAGCACGCG CAGCGAAGGC CGTTTCTGCT GCAAAAGCCG CCGCAGGTAC CCTCGTGGCG AAAGAAGGCT CCCTCCGCAA CCCTGCCCGA TTCGCACGCA CCCGCTCGGC CTGGGGGGGG GCAGTTCACC ACGCCGGCTT CCGGGGCCGC GAAATACGTT GCGGTCAACT CCGGGGCGAG CGACGCGTTC GCGGCAGGCG TCAACGCAGT GGCGATTGGA GCCGACGCGC GAGCGCAAGG TCAGGAATCG CTCGCGACCG GCTGGCGTGC GCAAGCAGAC GGCCATCGCG CGGTCGCGAC CGGCGCACGC GCAATCGCAT CCGGCCGCGA CGCCGTCGCG CTCGGCGCAG GATCGATAGC GGACCGCGAC AACACCGTAT CTGTCGGTCA GCGCGGCAGC GAACGCCAGA TCGTGCATGT CGCCCCCGGC GCCCAAGGCA CCGATGCGGT GAACGTCGAT CAGTTGAATC TCGCAATATC GAACTCGAAC GCGTACACGA ACCAGCGCAT CGGCGACCTT CAGCAAAGCA TCACCGAAAC CGCGCGCGAC GCGTATTCCG GCGTCGCCGC GGCGACCGCG CTCACGATGA TTCCCGATGT CGACCGCGAC AAGATGTTGT CGATCGGCGT AGGCGGCGCG GTCTACAAGG GCCATCGCGC CGTCGCGCTC GGCGGCACCG CGCGCATCGG CGAAAACCTC AAGGTGCGCG CGGGCGTCGC GATGAGCGCG GGCGGCAATA CGGTCGGCGT AGGCATGAGC TGGCAATGGT GA
|
Protein sequence | MCSSIAPLAL GFSADAFAAD ETMASPFNRG APNDAHGNLL DEIRRGVPLR HVPASERNTR GAGGSTLADA MRRVIDSRRT AFDSPPATPA SPSPSWSDDE SPPPTPIATR PASRPESAAR SPRHSSPPHS PPASAESPSP RSPDASPSRT PSPTFSFPSP SRTSTPRTQP PSPLRERPER SPAASPRVAS PRSAHSRGST QPPSNLSTPR YEPPTPLQED PERTPVASPR VASPRSAHSR GSTQPPSNLS TPRYEPPTPL QEDPERTPVA SPHVTPAEHA QRRPFLLQKP PQVPSWRKKA PSATLPDSHA PARPGGGQFT TPASGAAKYV AVNSGASDAF AAGVNAVAIG ADARAQGQES LATGWRAQAD GHRAVATGAR AIASGRDAVA LGAGSIADRD NTVSVGQRGS ERQIVHVAPG AQGTDAVNVD QLNLAISNSN AYTNQRIGDL QQSITETARD AYSGVAAATA LTMIPDVDRD KMLSIGVGGA VYKGHRAVAL GGTARIGENL KVRAGVAMSA GGNTVGVGMS WQW
|
| |