Gene BURPS1710b_A0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0801 
Symbol 
ID3692060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1050523 
End bp1053204 
Gene Length2682 bp 
Protein Length893 aa 
Translation table11 
GC content68% 
IMG OID637731055 
Producthemagglutinin, homlog 
Protein accessionYP_335960 
Protein GI76819161 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTGC CGATGCCGGC GGCGGAGGTG TCGCGCGGGC GCGGCAAGCT CGGCTGCGGC 
GGCGTGCGGG CGCAACGTCG CGGCGGTGCG GCGTGTGCGG CGCTGCTTGG GGTGGCCGGG
CCGTCCTTGG CGTTCGCGGC GGTGGTGGCG GACCCGAACG GGGGCGCGCA GCGGCCCGGC
ATGGCGACGA CGGCGAACGG GACGGACTTG GTCAATATCG TCGCGCCGGA CGCGACGGGG
TTGTCGCACA ACAAGTTCAA CGAGTTCAGC CCGGTTGGAC GCGGCGTGGT GTTGAACAAC
AGCGTGCGGC CCGGGGAATC GCAGATCGGC GGCATGGCGG CGCAGAACCC GAACTTGATG
CAACCGGCCA CCCGGGCATT GCTCGAGGTG ACGCAGCAAC GCAGCGTGCT GCAGGGCACG
CTGGAGGCGT TCGGCGGCAA GCTCGACGTG CTGGTGGCGA ACCAGCATGG AGTGACGATC
AACGGCTTGA CGACGCTGAA CGTGGGCCGG CTCGGCGTGA CGACGGGGCA GGTGCTGCCG
CAAGCAGCCG GGCAGTTGCG TTTGGGCGTG ACGCAAGGCG ACGTGCTGAT CGACCATGGG
GGCATCGATA CCCAGGGCCT GGATATGTTC GACGTGGTGA GCCGCAGCAT CGCCGTGCGC
GGGCCGATCC ACGATTCGAG CCGCGCCGCG GGCGCCGACG TGCGCCTCGT GGCGGGCGCG
ACGGCCTACG ATCCGCAGAC CGGTCATTAT GAGGCGATCG CGGCGGACGA ATCGAAGGCG
CCGGTGCAGG AGGGAATCAG CGGCGAACTG CTGGGAGCGA TGCACGGCCG TCACATTGTG
CTGGTGAGCA CGGAATCGGG CGTGGGCGTG CGGCACGACG GACCGATCAA GTCGGCGAAC
GACATTCGGG TGAGCGCGAA CGGCGAGGTG ACGCTGGGCG GGCCGCAGCG GGCGGCCCAG
GAGGCGGTTG CAGGAGCGCA GGCGGTAGGC GGCGCCGGCA TGCAGAACGT GATCGCGGGC
GGCACGGTGA GCGTCTGCGC GCGTGGGCAC GTCGCGATCC GGGGCGCGGT GATCGCGGGA
CAGGATGTGG ATCTGCAGGG GAAAAGCGTG AAGGCCGGCC GGATGAGCGC GCAGCGCGAC
GCGCTGGTGA CGGTGGCGGA TGGCGTGACG CTCGATGGTC CGGTGGACGC GAAGCGTCAC
GTGTGGATCG GAGCCCACGA TGATGTGGTG ATCCGTGAAG CGGCGGCGGG GCAGAACGTG
GTGCTGCTGG GGCGCAGCGT AACGGCCGGC CGGTTGGACG CGCAGCGCGA CGTATTGGCG
GCGGCCCGCG ACGGCGTGAC GATCCATGAA GCGGCGGCCG CGGGGCAGGA TGTGGTGCTG
CAGGGAAGCA GCGCGAGGGT CGGCCAGATG AGCGCGCAGC GCGATGTGCT GGTGATGGCG
GCAGATGGCG TGACGCTCGA TGGGCCGGTG AGCGCGCAGC GCGCCGTATG GGTCGAGACC
CAAGGTGACG TGGCGGGCAG TGAGTGGATC AAGGCCGGAC GGGACGTGCA AATCGGCGCG
GCGGCGGATC TGGCGGGCGC GGTAACGGCC GAAGAGATGC AGCAACTCAA GGCCCATGGT
GACGCGGCGA ACAGGCGGCG CGTCAAAGCC GGGCGGAACG AGCCAGCCGG CGCGGCGGCT
GAACGTCCGG CCGCGGCGGA GCAGACGGTG GCCGTCGCTG ACGCGATGCG CGAGATCGGC
GTGGGCGGCG ATCGGCTGTC CGGATTGGAT GCCGCGCCGG GTACGCCGGG TACGCCCTTC
GGCGCACACC CGCAAGCGAT GTTCGACGAT CCGGCGGCGC AGATTGCGCG ATCGGCTCGA
TCCACGGCAA CGGCGGGCGG ACATGCGGGT TCGTTCATGC GCGTCGGAGA CGGTCACATC
GCCAAAATGA CCACGTCCAG AGAGGCGGAG ATATACGAGA ATTACCGCTT GGCTCTTGCC
GGCGTCATCC CCGACACCGT GCCGCCTGAA GAGGTGGATT GGCGGGTCGG TGTCACGGCC
AGGCAGAGGC AGGCCATGGC GACTTTCAAA GGGTGGGCGG AGATGAAAGG CCAGCGGGTT
GTCGTCATGC AGGCGCTGGG CGCGAAGATC GCGCCGGAGG ACAAGATCGA GCTGGACGTC
AAGATCGGCG CCAGTACGGT GTCGCGCACC GAGTTGATCG GCGCCGGCAG GACTCGCTGG
CAGGCCTTGA GCAAGAAGGT GAGATTGACG GCGGCGGACC TGCTGCGGGG CTCGCGTTCG
TTGGTGGGCG ACGATCGCGG CTATACGCTC GCCGGCCGCA CGAGCGGGGG GATTGCCCTG
GACGCGAGGA ATTCACGCAA CTCCGTCGGC CGATCCAGCG AATCGCTGAT TCGCGAGGCG
CTGGATCGCT CGCCCGATAC GCGCTGGCGG AACGCGCAGC ACTTGCTCGG GCAGTTGCAG
ACCATTCGAG AGAAGATGCA CGCGTTGCCG CTCACCTTCG TCGCCTCCAG CGTCCTCATT
GCAATCGACA AACGGAAACC GGAAAACTCG GTCGCCCGGC TGATCGATCT CGCGCACCCG
GTGCAGCCTT TCGAAAACGA AGCGGACTAT GAGAAAGTCA ATCACCGCTT CGAGGATGGT
CTTGACAAGC TGATCAGACT CTTCCAGCAG GTGGAAAAAT AG
 
Protein sequence
MPVPMPAAEV SRGRGKLGCG GVRAQRRGGA ACAALLGVAG PSLAFAAVVA DPNGGAQRPG 
MATTANGTDL VNIVAPDATG LSHNKFNEFS PVGRGVVLNN SVRPGESQIG GMAAQNPNLM
QPATRALLEV TQQRSVLQGT LEAFGGKLDV LVANQHGVTI NGLTTLNVGR LGVTTGQVLP
QAAGQLRLGV TQGDVLIDHG GIDTQGLDMF DVVSRSIAVR GPIHDSSRAA GADVRLVAGA
TAYDPQTGHY EAIAADESKA PVQEGISGEL LGAMHGRHIV LVSTESGVGV RHDGPIKSAN
DIRVSANGEV TLGGPQRAAQ EAVAGAQAVG GAGMQNVIAG GTVSVCARGH VAIRGAVIAG
QDVDLQGKSV KAGRMSAQRD ALVTVADGVT LDGPVDAKRH VWIGAHDDVV IREAAAGQNV
VLLGRSVTAG RLDAQRDVLA AARDGVTIHE AAAAGQDVVL QGSSARVGQM SAQRDVLVMA
ADGVTLDGPV SAQRAVWVET QGDVAGSEWI KAGRDVQIGA AADLAGAVTA EEMQQLKAHG
DAANRRRVKA GRNEPAGAAA ERPAAAEQTV AVADAMREIG VGGDRLSGLD AAPGTPGTPF
GAHPQAMFDD PAAQIARSAR STATAGGHAG SFMRVGDGHI AKMTTSREAE IYENYRLALA
GVIPDTVPPE EVDWRVGVTA RQRQAMATFK GWAEMKGQRV VVMQALGAKI APEDKIELDV
KIGASTVSRT ELIGAGRTRW QALSKKVRLT AADLLRGSRS LVGDDRGYTL AGRTSGGIAL
DARNSRNSVG RSSESLIREA LDRSPDTRWR NAQHLLGQLQ TIREKMHALP LTFVASSVLI
AIDKRKPENS VARLIDLAHP VQPFENEADY EKVNHRFEDG LDKLIRLFQQ VEK