Gene BURPS1106A_A2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2341 
Symbol 
ID4905635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2320691 
End bp2323396 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content68% 
IMG OID640145446 
Producthemagglutinin-like protein 
Protein accessionYP_001076374 
Protein GI126455972 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTGCTGA ACAGGGTGGC GGGCATGCCG GTGCCGATGC CGGCGGCGGA GGTGTCGCGC 
GGGCGCGGCA AGCTCGGCTG CGGCGGCGTG CGGGCGCAAC GTCGCGGCGG TGCGGCGTGC
GCGGCGCTGC TTGGGGTGGC CGGGCCGTCC TTGGCGTTCG CGGCGGTGGT GGCGGACCCG
AACGGGGGCG CGCAGCGGCC CGGCATGGCG ACGACGGCGA ACGGGACGGA CCTGGTCAAT
ATCGTCGCGC CGGACGCGAC GGGGTTGTCG CACAACAAGT TCAACGAGTT CAGCCCGGTT
GGACGCGGCG TGGTGTTGAA CAACAGCGTG CGGCCCGGGA AATCGCAGAT CGGCGGCATG
GCGGCGCAGA ACCCGAACTT GATGCAACCG GCCACCCGGG CATTGCTCGA GGTGACGCAG
CAACGCAGCG TGCTGCAGGG CACGCTGGAG GCGTTCGGCG GCAAGCTCGA CGTGCTGGTG
GCGAACCAGC ATGGAGTGAC GATCAACGGC TTGACGACGC TGAACGTGGG CCGGCTCGGC
GTGACGACGG GGCAGGTGCT GCCGCAAGCG GCCGGGCAGT TGCGTTTGGG CGTGACGCAA
GGCGACGTGC TGATCGACCA TGGGGGCATC GATACCCAGG GCCTGGACAT GTTCGACGTG
GTGAGCCGCA GCATCGCCGT GCGCGGGCCG ATCCACGATT CGAGCCGCGC CGCGGGCGCC
GACGTGCGCC TCGTGGCGGG CGCGACGGCC TACGATCCGC AGACCGGTCA TTATGAGGCG
ATCGCGGCGG ACGAATCGAA GGCGCCGGTG CAGGAGGGAA TCAGCGGCGA ACTGCTGGGA
GCGATGCACG GCCGTCACAT TGTGCTGGTG AGCACGGAAT CGGGCGTGGG CGTGCGGCAC
GACGGACCGA TCAAGTCGGC GAACGACATT CGGGTGAGCG CGAACGGCGA GGTGACGCTG
GGCGGGCCGC AGCAGGCGGC TCAGGAGGCG GTTGCAGGAG CGCAGGCGGT AGGCGGCGCC
GGCATGCAGA ACGTGATCGC GGGCGGCACG GTGAGCGTCT GCGCGCGTGG GCACGTCGCG
ATCCGGGGCG CGGTGATCGC GGGACAGGAT GTGGATCTGC AGGGGAAAAG CGTGAAGGCC
GGCCGGATGA GCGCGCAGCG CGACGCGCTG GTGACGGCGG CGGATGGCGT GACGCTCGAT
GGTCCGGTGG ACGCGAAGCG TCACGTGTGG ATCGGAGCCC ACGATGATGT GGTGATCCGT
GAAGCGGCGG CGGGGCAGAA CGTGGTGCTG CTGGGGCGCA GCGTAACGGC CGGCCGGTTG
GACGCGCAGC GCGACGTATT GGCGGCGGCC CGCGACGGCG TGACGATCCA TGAAGCGGCG
GCCGCGGGGC AGGATGTGGT GCTGCAGGGA AGCAGCGCGA GGGTCGGCCA GATGAGCGCG
CAGCGCGATG TGCTGGTGAT GGCGGCAGAT GGCGTGACGC TCGATGGGCC GGTGAGCGCG
CAGCGCGCCG TATGGGTCGA GACCCAAGGT GACGTGGCGG GCAGTGAGTG GATCAAGGCC
GGACGGGACG TGCAAATCGG CGCGGCGGCG AATCTGGCGG GCGCGGTAAC GGCCGAAGAG
ATGCAGCAAC TCAAGGCCCA TGGTGACGCG GCGAACAGGC GGCGCGTCAA AGCCGGGCGG
AACGAGCCAG CCGGCACGGC GGCTGAACGT CCGGCCGCGG CGGAGCAGAC GGTGGCCGTC
GCTGACGCGA TGCGCGAGAT CGGCGTAGGC GGCGATCGGC TGTCCGGATT GGATGCCGCG
CCGGGTACGC CGGGTACGCC CTTCGGCGCA CACCCGCAAG CGATGTTCGA CGATCCGGCG
GCGCAGATTG CGCGATCGGC TCGATCCACG GCAACGGCGG GCGGACATGC GGGTTCGTTC
ATGCGCGTCG GAGACGGTCA CATCGCCAAA ATGACCACGT CCAGAGAGGC GGAGATATAC
GAGAATTACC GCTTGGCTCT TGCCGGCGTC ATCCCCGACA CCGTGCCGCC TGAAGAGGTG
GATTCGCGGG TCGGTGTCAC GGCCAGGCAG AGGCAGGCCA TGGCGACTTT CAAAGGGTGG
GCGGAGATGA AAGGCCAGCG GGTTGTCGTC ATGCAGGCGC TGGGCGCGGA GATCGCGCCG
GAGGACAAGA TCGAGCTGGA CGTCAAGATC GGCGCCAGTA CGGTGTCGCG CACCGAGTTG
ATCGGCGCCG GCAGGACTCG CTGGCAGGCC TTGAGCAAGA AGGTGAGATT GACGGCGGCG
GACCTGCTGC GGGGCTCGCG TTCGTTGGTG GGCGACGATC GCGGCTATAC GCTCGCCGGC
CGCACGAGCG GGGGGATTGC CCTGGACGCG AGGAATTCAC GCAACTCCGT CGGCCGATCC
AGCGAATCGC TGATTCGCGA GGCGCTGGAT CGCTCGCCCG ATACGCGCTG GCGGAACGCG
CAGCACTTGC TCGGGCAGTT GCAGACCATT CGAGAGAAGA TGCACGCGTT GCCGCTCACC
TTCGTCGCCT CCAGCGTCCT CATTGCAATC GACAAACGGA AACCGGAAAA CTCGGTCGCC
CGGCTGATCG ATCTCGCGCA CCCGGTGCAG CCTTTCGAAA ACGAAGCGGA CTATGAGAAA
GTCAATCACC GCTTCGAGGA TGGTCTTGAC AAGCTGATCA GACTCTTTCA GCAGGTGGAA
AAATAG
 
Protein sequence
MVLNRVAGMP VPMPAAEVSR GRGKLGCGGV RAQRRGGAAC AALLGVAGPS LAFAAVVADP 
NGGAQRPGMA TTANGTDLVN IVAPDATGLS HNKFNEFSPV GRGVVLNNSV RPGKSQIGGM
AAQNPNLMQP ATRALLEVTQ QRSVLQGTLE AFGGKLDVLV ANQHGVTING LTTLNVGRLG
VTTGQVLPQA AGQLRLGVTQ GDVLIDHGGI DTQGLDMFDV VSRSIAVRGP IHDSSRAAGA
DVRLVAGATA YDPQTGHYEA IAADESKAPV QEGISGELLG AMHGRHIVLV STESGVGVRH
DGPIKSANDI RVSANGEVTL GGPQQAAQEA VAGAQAVGGA GMQNVIAGGT VSVCARGHVA
IRGAVIAGQD VDLQGKSVKA GRMSAQRDAL VTAADGVTLD GPVDAKRHVW IGAHDDVVIR
EAAAGQNVVL LGRSVTAGRL DAQRDVLAAA RDGVTIHEAA AAGQDVVLQG SSARVGQMSA
QRDVLVMAAD GVTLDGPVSA QRAVWVETQG DVAGSEWIKA GRDVQIGAAA NLAGAVTAEE
MQQLKAHGDA ANRRRVKAGR NEPAGTAAER PAAAEQTVAV ADAMREIGVG GDRLSGLDAA
PGTPGTPFGA HPQAMFDDPA AQIARSARST ATAGGHAGSF MRVGDGHIAK MTTSREAEIY
ENYRLALAGV IPDTVPPEEV DSRVGVTARQ RQAMATFKGW AEMKGQRVVV MQALGAEIAP
EDKIELDVKI GASTVSRTEL IGAGRTRWQA LSKKVRLTAA DLLRGSRSLV GDDRGYTLAG
RTSGGIALDA RNSRNSVGRS SESLIREALD RSPDTRWRNA QHLLGQLQTI REKMHALPLT
FVASSVLIAI DKRKPENSVA RLIDLAHPVQ PFENEADYEK VNHRFEDGLD KLIRLFQQVE
K