Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2118 |
Symbol | |
ID | 4888655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2051578 |
End bp | 2052714 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640132055 |
Product | hemagglutinin domain-containing protein |
Protein accession | YP_001063112 |
Protein GI | 126445307 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0929674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTCTAT ACATCCGTAT GAAATATCAC CGTTTTCCCC GCTCTCATGC TCAACAAGAC ACCGGGCGAG CCGCATCGAC CGTTCCATTT CAGCGCTTCG CGCATCTACT ATGTTCGTCC ATCGCTCCGC TGGCCCTCGG CTTTTCCACG GATGCGCTCG CTATCGAACA GGCTGAAAGT ACGGCGTTTA ACGCGGTGAT CGATCAGATA AAAAAAGGTG ACTTTAAGTT GAAACCAGTT GGGGACCGCA CGCTACCAAA CAAAGTCCCG CCACCGCCAC CGCCACCGCC GTCGACGACG ACGCCACCGC CGCCACCGCC ACCGCCGCCG CCGCCGCCGT CGACGACGCC ACCACCGCCA CCGCCGCCGT CGACGACGCC ATCGCCACCG CCACCGACGA CGACGCCACC GACGAGGACG ACGCCATCGA CGACGACGCC GACACCATCG ATGCACCCGA TACAGCCGAC ACAACTGCCG TCGATTCCTA ACGCGACACC AACCTCAGGA TCCGCGACAA ACGTCACCAT CAACTTCAAT TCGACCGGTG CCTCAGCAAT GGGCACGAAC TCTATCGCCC TTGACTTCCA TGCACGCGCT AAGGACAGCG ATTCGCTCGC GAGCGGACGG CTCGCTCATG CGAGCGGCCC CCGGTCAACC GCGATCGGTG CCGAAGCAAA TGCGTCCGGT CAAAACACTG TCGCGCTCGG CGCTGGCTCC ATAGCGGATC GTAACAACAC GGTATCCGTC GGTCGTCACG GTGACGAACG ACAAATAGTG CACGTCGCAG CCGGCACGCA AGCCACCGAT GCCGTGAATG TCGGTCAGTT GAACCTCGCA ATGTCGAACG CCAACGCGTA CACGAACCAG CGCATCGGCG ATCTTCAGCA GAGCATCACC GACACCGCGC GCGACGCATA TTCCGGCGTC GCCGCCGCGA CCGCGCTGAC GATGATTCCC GATGTCGACC GCGACAAGAG GGTGTCGATC GGCGTCGGCG GCGCGGTCTA CAAGGGCCAT CGCGCCGTCG CGCTCGGCGG CACCGCGCGC ATCAACGAAA ACCTCAAGGT GCGGGCGGGC GTCGCGATGA GCGCGGGCGG CAATGCCGTG GGCATCGGCA TGAGCTGGCA ATGGTAA
|
Protein sequence | MVLYIRMKYH RFPRSHAQQD TGRAASTVPF QRFAHLLCSS IAPLALGFST DALAIEQAES TAFNAVIDQI KKGDFKLKPV GDRTLPNKVP PPPPPPPSTT TPPPPPPPPP PPPSTTPPPP PPPSTTPSPP PPTTTPPTRT TPSTTTPTPS MHPIQPTQLP SIPNATPTSG SATNVTINFN STGASAMGTN SIALDFHARA KDSDSLASGR LAHASGPRST AIGAEANASG QNTVALGAGS IADRNNTVSV GRHGDERQIV HVAAGTQATD AVNVGQLNLA MSNANAYTNQ RIGDLQQSIT DTARDAYSGV AAATALTMIP DVDRDKRVSI GVGGAVYKGH RAVALGGTAR INENLKVRAG VAMSAGGNAV GIGMSWQW
|
| |