Gene BURPS1106A_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1590 
Symbol 
ID4900623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1538477 
End bp1541845 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content69% 
IMG OID640134820 
Producthaemagluttinin family protein 
Protein accessionYP_001065861 
Protein GI126453767 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTTA GGAATGTTCA GATTCGGATT GCGCCTCGGC AAATAAATTT CAATTGCAGC 
ACAAAACTAT TCGAAGCTTA TTTGCGTGAG GATGAAATGA ACAAGACTTA TCGGGTTAGC
TGGAGCGCGT CGCGGGGTGC GTGGATGGTG GCGCCGGAGA CGGCGCGTCG CAAAGGGAAA
GGACATTCGC TGACGATCGT GTGCGCGATC GCCTCAGGCC TGCTGCTTGC GGCGCCTGCG
TGGGCGGACA CGGTGTCGCC CTCGGGCACG GATAACGTCT ACGGCGTCGA CGCGACCGAT
CCCGGCGTGT CGACGAACCA GGGCAATACG GCCTACGGGG CGCAGGCGGG CGCGAAGGTC
ACGGGTTCGT ACAACACCGC GATCGGGTAT CAAGCAGGGC AGAACGTGAA CGCCATCGAT
ACCGTATCGA TCGGCAAGCA GGCCACCGCG AGCGCGAATG ACGCGATCGC GATCGGCACG
AACACGAAGG CGAGCGGGCC GGCCGACATC TACATGGGGC TGAACGCAGG CGCCGGCGCC
GGCTCGACGA CGAGCCCGGA CGGCACCGTC ACGCTCGGCA TTCGCAACAT GGGCCTCGGG
GAATCCGCGG GCTCGTACGT GACGGGCCAG AACAACACGG GGATCGGCTA TCAGTCGGGC
ATGAACGTGA CGGGCGACCA GAACGTCGGC CTCGGGCAGC AGGCGGGACA ATTCGTGACC
GGGACCGGCA ACTCGGCGAT GGGGCATCTG GCGGGGTCGA CGGTGTCGGG CAGCTACAAC
GCCGCGTTCG GCGAGTATGC GGGGACCAAC ACGAGCGGCG GCGCCAATGC CGCGTTCGGC
TTCTATGCGG GGCGCTACAT CAACGGCACG AACAACACGG CGCTCGGCGC GTACGATCTG
CCGGTCGTCA ATGGCACCTG GTACGGTTCG TACGTGACGG GCAGCAACAA CCTCGGCGCC
GGCCATAATT CGGGCGCCTA CGTGAGCGGC GCGAGCAACG TCGGGCTCGG CGACGGCGCG
GGCACGTTCG TGACCGGCAG CAACAACGTC GCCATCGGCA CGGCAGCGGG CTCGGGCGCG
TATACCAGCG GTCCGAGCGG CGCGACGCTC AATGCGGCGC TCGTCGCGAG CAACACCGTG
AGCATCGGTA CCCGCGCCAC GGCGAGCCAG AGCGACGCGA TCGCGATCGG CAAGGGCGCG
ACCGCGAGCG GCGCGCAATC GATCAGCATC GGCACCGGCA ACGTCGTGAG CGGCAAGGGA
AGCGGTGCGA TCGGCGATCC GAGCACCGTC AGCGGCGCGG GGTCCTATTC GATCGGCAAC
AACAATACCG TCGCGAACAG CAACACGTTC GTGCTCGGCA ACGGCGTGAC GACGACGCAG
GACAACAGCG TCGTGCTCGG CAATCAGAGC ACCGACCGCG CGGCCGTCGC GGTTTCGAGC
GAAACCATCA ATGGCACGAC GTACAACTAC GCGGGCGTCG CGAGCCCGGC CAACGGCGTC
GTCAGCATCG GCGGCGTGGG CACGGAACGC CAGCTCATCA ACGTGGCGGC GGGCCAGGTG
AGCGCGACCA GCACGGACGC GATCAACGGC AGCCAGCTGT ACGCGACGAA CCAGGCGGTG
ATCGCGGAGG ACGCGAAAGT GAATTCGCTC GGCGGCGGCG TGGCGAGCGC GCTCGGCGGC
AACGCGGCGT ACAACGCGAC GACCGGCGCG ATCACCGCGC CGAGCTACGC GGTCTACGGG
ACCACGCAAA ACTCCGTGGG CGGCGCGATC GATGCGCTGC AGGCCCTCGC GCCGCTGCAG
TACACGTCCG GCCCGGGCGT GACCACGCCG AACGCGCCGG GATCGGCGCC GACGAACACG
GTGACGCTCG TCGGCGCCGG CGGGCCGGGA GCCAACACCA CGCCGGTGAC GCTCACGAAC
GTCGCGCCGG GCAAACTCTC CGCGACCAGC ACGGACGCGG TCAACGGCTC GCAGCTCTAC
GCGACCAACC AGCAGGTCGC GAACCTCGTG AGCTCGGTGA ACAACGGCGG CGTCGGCCCG
GTGCAGTACA GCGATCCTAG CGCGCCGACG ACGCCCAACG GCGGCAAGCC CTCGCAGGAC
CTGACGCTCG TCGGCGCGGC AAGCGGCCCT GTCGCGCTGC ATAACGTCGC GCCGGGCACG
GCGTCCACCG ATGCGGTCAA CGTCGGGCAG CTCGGCGCGG TGACGACCGG CCTGGGCGGC
GGCGCGGCGA TCGATCCGAA GACGGGCGCC GTGACCGCGC CGTCGTACAC GGTCTACAAC
GCCGACGGCA CGACGTCGAA CGTCGGCAAC GTCGGCGCGG CGATCGATGC GATCAACTCG
ACCGGCATCA AGTATTTCCA CGCGAACAGC ACGAAGCCGG ACAGCCAGGC GCTCGGCGCG
GACAGCGTCG CGATCGGCCC GAACGCCGTC GCGAACAACG CGGGCGACGT CGCGCTCGGT
TCGGGAGCGG TCACGTCGCA AGCGGGCGGC ACGCTGAGCG AAACGATCAA CGGCGTGACC
TACTCGTTCG CCGGCACGAC GCCGATCGGC ACGGTGAGCG TCGGCGCGCC GGGCGTCGAG
CGCACGATCA CCAACGTTGC CGCGGGGCGC ATCGGGCAGT CGAGCACGGA CGCGATCAAC
GGCTCGCAAC TGTACGGCAC CAACCAGTCG ATCGAGGCGT TGACGGACAA GATGAACAGC
CTCGGCAACA CCGTGGCGAA CACGCTCGGC AGCGGCGCGT CGTACAACCC GCAAACAGGC
GCGGTGAACG GCCCGGCCAA CTCGGGCGGC GTGGTCACGC CCACGGTGAT CCAGGAGGCG
GCGAACAAAT GGGTGAGCGC CAATCCGTCG ACCTACGTGG CGCCCGTCGC GACGGGCACG
AACGGCATGG CGGTCGGCAG CGGCGCGGTT TCGACGGGCC AGAACTCGGT CGCGCTCGGC
ACGAACGCGT CGGACGGCGG CCGCTCGAAC GTCGTGAGCG TCGGGGCGCC GGGCGCGGAG
CGCCAGGTGA CGAACGTGGC GGCCGGCACG CAGGCGACCG ATGCGGTCAA CCTCGGGCAG
ATGAACGGCG CGCTCGCGCA GCAAACCGAC AGCTTCAATC AGCGGCTGGG CGCGGTTCAG
CAGGACGTCG ACAACGTCGC GCGCGCCGCC TACGGCGGCA TCGCGGCCGC GACCGCGCTC
ACGATGATCC CCGAGGTCGA CAAGGACAAG ACGATCGCGG TGGGCATCGG CGGCGGCACG
TATCGCGGCT ACCAGGCGGT GGCGCTCGGC GCGACGGCGC GCATCACCGA GAACATCAAG
GTTCGTGCGG GCGTCGGCAT GAGCTCGGGC GGGACGACGG CCGGCATCGG CGCATCGATG
CAGTGGTAA
 
Protein sequence
MRFRNVQIRI APRQINFNCS TKLFEAYLRE DEMNKTYRVS WSASRGAWMV APETARRKGK 
GHSLTIVCAI ASGLLLAAPA WADTVSPSGT DNVYGVDATD PGVSTNQGNT AYGAQAGAKV
TGSYNTAIGY QAGQNVNAID TVSIGKQATA SANDAIAIGT NTKASGPADI YMGLNAGAGA
GSTTSPDGTV TLGIRNMGLG ESAGSYVTGQ NNTGIGYQSG MNVTGDQNVG LGQQAGQFVT
GTGNSAMGHL AGSTVSGSYN AAFGEYAGTN TSGGANAAFG FYAGRYINGT NNTALGAYDL
PVVNGTWYGS YVTGSNNLGA GHNSGAYVSG ASNVGLGDGA GTFVTGSNNV AIGTAAGSGA
YTSGPSGATL NAALVASNTV SIGTRATASQ SDAIAIGKGA TASGAQSISI GTGNVVSGKG
SGAIGDPSTV SGAGSYSIGN NNTVANSNTF VLGNGVTTTQ DNSVVLGNQS TDRAAVAVSS
ETINGTTYNY AGVASPANGV VSIGGVGTER QLINVAAGQV SATSTDAING SQLYATNQAV
IAEDAKVNSL GGGVASALGG NAAYNATTGA ITAPSYAVYG TTQNSVGGAI DALQALAPLQ
YTSGPGVTTP NAPGSAPTNT VTLVGAGGPG ANTTPVTLTN VAPGKLSATS TDAVNGSQLY
ATNQQVANLV SSVNNGGVGP VQYSDPSAPT TPNGGKPSQD LTLVGAASGP VALHNVAPGT
ASTDAVNVGQ LGAVTTGLGG GAAIDPKTGA VTAPSYTVYN ADGTTSNVGN VGAAIDAINS
TGIKYFHANS TKPDSQALGA DSVAIGPNAV ANNAGDVALG SGAVTSQAGG TLSETINGVT
YSFAGTTPIG TVSVGAPGVE RTITNVAAGR IGQSSTDAIN GSQLYGTNQS IEALTDKMNS
LGNTVANTLG SGASYNPQTG AVNGPANSGG VVTPTVIQEA ANKWVSANPS TYVAPVATGT
NGMAVGSGAV STGQNSVALG TNASDGGRSN VVSVGAPGAE RQVTNVAAGT QATDAVNLGQ
MNGALAQQTD SFNQRLGAVQ QDVDNVARAA YGGIAAATAL TMIPEVDKDK TIAVGIGGGT
YRGYQAVALG ATARITENIK VRAGVGMSSG GTTAGIGASM QW