Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1590 |
Symbol | |
ID | 4900623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1538477 |
End bp | 1541845 |
Gene Length | 3369 bp |
Protein Length | 1122 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640134820 |
Product | haemagluttinin family protein |
Protein accession | YP_001065861 |
Protein GI | 126453767 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTTA GGAATGTTCA GATTCGGATT GCGCCTCGGC AAATAAATTT CAATTGCAGC ACAAAACTAT TCGAAGCTTA TTTGCGTGAG GATGAAATGA ACAAGACTTA TCGGGTTAGC TGGAGCGCGT CGCGGGGTGC GTGGATGGTG GCGCCGGAGA CGGCGCGTCG CAAAGGGAAA GGACATTCGC TGACGATCGT GTGCGCGATC GCCTCAGGCC TGCTGCTTGC GGCGCCTGCG TGGGCGGACA CGGTGTCGCC CTCGGGCACG GATAACGTCT ACGGCGTCGA CGCGACCGAT CCCGGCGTGT CGACGAACCA GGGCAATACG GCCTACGGGG CGCAGGCGGG CGCGAAGGTC ACGGGTTCGT ACAACACCGC GATCGGGTAT CAAGCAGGGC AGAACGTGAA CGCCATCGAT ACCGTATCGA TCGGCAAGCA GGCCACCGCG AGCGCGAATG ACGCGATCGC GATCGGCACG AACACGAAGG CGAGCGGGCC GGCCGACATC TACATGGGGC TGAACGCAGG CGCCGGCGCC GGCTCGACGA CGAGCCCGGA CGGCACCGTC ACGCTCGGCA TTCGCAACAT GGGCCTCGGG GAATCCGCGG GCTCGTACGT GACGGGCCAG AACAACACGG GGATCGGCTA TCAGTCGGGC ATGAACGTGA CGGGCGACCA GAACGTCGGC CTCGGGCAGC AGGCGGGACA ATTCGTGACC GGGACCGGCA ACTCGGCGAT GGGGCATCTG GCGGGGTCGA CGGTGTCGGG CAGCTACAAC GCCGCGTTCG GCGAGTATGC GGGGACCAAC ACGAGCGGCG GCGCCAATGC CGCGTTCGGC TTCTATGCGG GGCGCTACAT CAACGGCACG AACAACACGG CGCTCGGCGC GTACGATCTG CCGGTCGTCA ATGGCACCTG GTACGGTTCG TACGTGACGG GCAGCAACAA CCTCGGCGCC GGCCATAATT CGGGCGCCTA CGTGAGCGGC GCGAGCAACG TCGGGCTCGG CGACGGCGCG GGCACGTTCG TGACCGGCAG CAACAACGTC GCCATCGGCA CGGCAGCGGG CTCGGGCGCG TATACCAGCG GTCCGAGCGG CGCGACGCTC AATGCGGCGC TCGTCGCGAG CAACACCGTG AGCATCGGTA CCCGCGCCAC GGCGAGCCAG AGCGACGCGA TCGCGATCGG CAAGGGCGCG ACCGCGAGCG GCGCGCAATC GATCAGCATC GGCACCGGCA ACGTCGTGAG CGGCAAGGGA AGCGGTGCGA TCGGCGATCC GAGCACCGTC AGCGGCGCGG GGTCCTATTC GATCGGCAAC AACAATACCG TCGCGAACAG CAACACGTTC GTGCTCGGCA ACGGCGTGAC GACGACGCAG GACAACAGCG TCGTGCTCGG CAATCAGAGC ACCGACCGCG CGGCCGTCGC GGTTTCGAGC GAAACCATCA ATGGCACGAC GTACAACTAC GCGGGCGTCG CGAGCCCGGC CAACGGCGTC GTCAGCATCG GCGGCGTGGG CACGGAACGC CAGCTCATCA ACGTGGCGGC GGGCCAGGTG AGCGCGACCA GCACGGACGC GATCAACGGC AGCCAGCTGT ACGCGACGAA CCAGGCGGTG ATCGCGGAGG ACGCGAAAGT GAATTCGCTC GGCGGCGGCG TGGCGAGCGC GCTCGGCGGC AACGCGGCGT ACAACGCGAC GACCGGCGCG ATCACCGCGC CGAGCTACGC GGTCTACGGG ACCACGCAAA ACTCCGTGGG CGGCGCGATC GATGCGCTGC AGGCCCTCGC GCCGCTGCAG TACACGTCCG GCCCGGGCGT GACCACGCCG AACGCGCCGG GATCGGCGCC GACGAACACG GTGACGCTCG TCGGCGCCGG CGGGCCGGGA GCCAACACCA CGCCGGTGAC GCTCACGAAC GTCGCGCCGG GCAAACTCTC CGCGACCAGC ACGGACGCGG TCAACGGCTC GCAGCTCTAC GCGACCAACC AGCAGGTCGC GAACCTCGTG AGCTCGGTGA ACAACGGCGG CGTCGGCCCG GTGCAGTACA GCGATCCTAG CGCGCCGACG ACGCCCAACG GCGGCAAGCC CTCGCAGGAC CTGACGCTCG TCGGCGCGGC AAGCGGCCCT GTCGCGCTGC ATAACGTCGC GCCGGGCACG GCGTCCACCG ATGCGGTCAA CGTCGGGCAG CTCGGCGCGG TGACGACCGG CCTGGGCGGC GGCGCGGCGA TCGATCCGAA GACGGGCGCC GTGACCGCGC CGTCGTACAC GGTCTACAAC GCCGACGGCA CGACGTCGAA CGTCGGCAAC GTCGGCGCGG CGATCGATGC GATCAACTCG ACCGGCATCA AGTATTTCCA CGCGAACAGC ACGAAGCCGG ACAGCCAGGC GCTCGGCGCG GACAGCGTCG CGATCGGCCC GAACGCCGTC GCGAACAACG CGGGCGACGT CGCGCTCGGT TCGGGAGCGG TCACGTCGCA AGCGGGCGGC ACGCTGAGCG AAACGATCAA CGGCGTGACC TACTCGTTCG CCGGCACGAC GCCGATCGGC ACGGTGAGCG TCGGCGCGCC GGGCGTCGAG CGCACGATCA CCAACGTTGC CGCGGGGCGC ATCGGGCAGT CGAGCACGGA CGCGATCAAC GGCTCGCAAC TGTACGGCAC CAACCAGTCG ATCGAGGCGT TGACGGACAA GATGAACAGC CTCGGCAACA CCGTGGCGAA CACGCTCGGC AGCGGCGCGT CGTACAACCC GCAAACAGGC GCGGTGAACG GCCCGGCCAA CTCGGGCGGC GTGGTCACGC CCACGGTGAT CCAGGAGGCG GCGAACAAAT GGGTGAGCGC CAATCCGTCG ACCTACGTGG CGCCCGTCGC GACGGGCACG AACGGCATGG CGGTCGGCAG CGGCGCGGTT TCGACGGGCC AGAACTCGGT CGCGCTCGGC ACGAACGCGT CGGACGGCGG CCGCTCGAAC GTCGTGAGCG TCGGGGCGCC GGGCGCGGAG CGCCAGGTGA CGAACGTGGC GGCCGGCACG CAGGCGACCG ATGCGGTCAA CCTCGGGCAG ATGAACGGCG CGCTCGCGCA GCAAACCGAC AGCTTCAATC AGCGGCTGGG CGCGGTTCAG CAGGACGTCG ACAACGTCGC GCGCGCCGCC TACGGCGGCA TCGCGGCCGC GACCGCGCTC ACGATGATCC CCGAGGTCGA CAAGGACAAG ACGATCGCGG TGGGCATCGG CGGCGGCACG TATCGCGGCT ACCAGGCGGT GGCGCTCGGC GCGACGGCGC GCATCACCGA GAACATCAAG GTTCGTGCGG GCGTCGGCAT GAGCTCGGGC GGGACGACGG CCGGCATCGG CGCATCGATG CAGTGGTAA
|
Protein sequence | MRFRNVQIRI APRQINFNCS TKLFEAYLRE DEMNKTYRVS WSASRGAWMV APETARRKGK GHSLTIVCAI ASGLLLAAPA WADTVSPSGT DNVYGVDATD PGVSTNQGNT AYGAQAGAKV TGSYNTAIGY QAGQNVNAID TVSIGKQATA SANDAIAIGT NTKASGPADI YMGLNAGAGA GSTTSPDGTV TLGIRNMGLG ESAGSYVTGQ NNTGIGYQSG MNVTGDQNVG LGQQAGQFVT GTGNSAMGHL AGSTVSGSYN AAFGEYAGTN TSGGANAAFG FYAGRYINGT NNTALGAYDL PVVNGTWYGS YVTGSNNLGA GHNSGAYVSG ASNVGLGDGA GTFVTGSNNV AIGTAAGSGA YTSGPSGATL NAALVASNTV SIGTRATASQ SDAIAIGKGA TASGAQSISI GTGNVVSGKG SGAIGDPSTV SGAGSYSIGN NNTVANSNTF VLGNGVTTTQ DNSVVLGNQS TDRAAVAVSS ETINGTTYNY AGVASPANGV VSIGGVGTER QLINVAAGQV SATSTDAING SQLYATNQAV IAEDAKVNSL GGGVASALGG NAAYNATTGA ITAPSYAVYG TTQNSVGGAI DALQALAPLQ YTSGPGVTTP NAPGSAPTNT VTLVGAGGPG ANTTPVTLTN VAPGKLSATS TDAVNGSQLY ATNQQVANLV SSVNNGGVGP VQYSDPSAPT TPNGGKPSQD LTLVGAASGP VALHNVAPGT ASTDAVNVGQ LGAVTTGLGG GAAIDPKTGA VTAPSYTVYN ADGTTSNVGN VGAAIDAINS TGIKYFHANS TKPDSQALGA DSVAIGPNAV ANNAGDVALG SGAVTSQAGG TLSETINGVT YSFAGTTPIG TVSVGAPGVE RTITNVAAGR IGQSSTDAIN GSQLYGTNQS IEALTDKMNS LGNTVANTLG SGASYNPQTG AVNGPANSGG VVTPTVIQEA ANKWVSANPS TYVAPVATGT NGMAVGSGAV STGQNSVALG TNASDGGRSN VVSVGAPGAE RQVTNVAAGT QATDAVNLGQ MNGALAQQTD SFNQRLGAVQ QDVDNVARAA YGGIAAATAL TMIPEVDKDK TIAVGIGGGT YRGYQAVALG ATARITENIK VRAGVGMSSG GTTAGIGASM QW
|
| |