Gene BURPS1710b_A2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2506 
Symbol 
ID3693439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp3021493 
End bp3023988 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content70% 
IMG OID637732760 
Producthaemagluttinin family protein 
Protein accessionYP_337656 
Protein GI76818882 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAAAA TCTACAATGT GGTTTGGAGT CGGGTGCGAG GCCAATTGAT TGCGGTTTCT 
GAATTTTCCC GGTCGAATGG CAAGTGTTCG ACGACGCAAG TCGTCACGGC GGCGCCGGGC
GTTGCCGGTC GTACCGCGGC TTCTGGCCGA TCGCGCCCGT CGTGGACGAA GCTCGGGCTG
ATGTCGCTGG CGGTGAGCGC GGCGATGGGC TGCATGGCGA CCGACGCCGC GGCGCAGGTC
AGCTATGCGG CGGGCGAGAA CGCCTATGCC GGCCCCGGCG GCAATACCGG CCCGTGGGCA
TTCTACAACC CGGCCTTCAG CGCGGGCACG CTGCTGTACG GCACCGCGGT CGGCAACTAC
GCCTATGCGA ACGGCGAGGG CAGCTCGGCC TACGGCGATC ACGCGACGGT GAAGGGGCGC
ATCGGCTCCG CGTTCGGCGC GTATTCGGAA GCGGCGGGCG ACGGCAGCAC TGCAATCGGC
GCCAGCGCGC GGGCGCTGCC GGACTTCAGC ATCGCGATCG GCACGAACGC GCAGGCGCTG
AAGGACACGG GCCAATCGAT TCCCGGCCGC GAAGACATCG GCACGATCGC GATCGGCGCG
GGCGCGCTCG CGCAGGGCGA CAACAGCGAT CCGCTGCACG TGTCCGCGCC GAACGCGTTC
GGCGGCTATT CGAGCGCGAC GGCAAGCGGC GCGGTGGCGC TGGGCGAAGG CGCCGCATCG
TCCGGCTATT ACGCGAACGC GCTCGGCTCG TATTCGAAGG CGTCGGGTGC GGGTGCGGTC
GCGGTGGGCG GCGGCGCGCA AGCGAGCGCG CAAGGCGCGG TGGCGATCGG CGGCGCGACG
AGCGTCGACA ACGCAACCGC GCTGTCCGGC TACGCGAGCG CAAGCGGCGT CAACGCGATC
GCGATCGGTT CCGGCGCGCA GGCGACGGGC GCCCGGTCGA TCAGCATCGG CACGGGCAAC
GTCGTGTCGG GGGCGAGCTC GGGCGCCTTC GGCGATCCGT CGACGGTCAC GGGCACGGGC
TCGTATTCGT TCGGCAACAA CAACACGATC AATTCGAACA ACGCGTTCGT GCTCGGCAAC
AACGTGACGA TCGGCCCAGG GTTCGACGGC TCGGTCGCGC TCGGTAGCGG CACGACGCTC
GCCGCGGCGA ACCCCACCGG CAGCGCGACG ATCACGACGA GCTCGGGCGG CCAGTTGACG
CTGTCCGGCT TCGCCGGCGC GAATCCGACG AGCGTCGTCA GCGTCGGCGC GCCCGGCGCC
GAGCGCCAGA TCACGAACGT CGCGGCGGGG CGCATCACGC CGACGTCGAC GGATGCCGTC
AACGGCAGCC AGCTGTATGC GGTCGCGAGC ACGATCGACA ATGCGGTGAA CGGCGGCGGG
ATCAAGTACT TCCACGCGAA TTCGACCCTG GCCGATTCGA CGGCGGCGGG CACGGACAGC
GTCGCGGTCG GGCCGGCCGC GCTCGCCTAC GGCAACGATT CGATCGCCGA AGGCACGAAC
GCGACGGCGG GCGTGAGCGG CAATCCGGCG GTGGCGGGCG ATGTCGCGCT CGGCAGCGGC
GCGCAGGCGA CGGGCGGCCG CTCGCTCGCG CTCGGCGCGA ACGCGTCGGT CAACACGGCG
GGCGGCGTGG CGCTCGGCGC CGGCTCGGTC GCGAACCGCG CGGCCGGCAC GTACACCGAT
CCGATCACGG GCAGCAGCTT CACGACCGCA TTCGGCGCGG TGTCGGTCGG CCTCGAGGGT
TCGCTGCGCC AGATCACCAA CGTCGCGGCG GGCACGCAGG CAACGGATGC GGTAAACGTC
GGTCAGTTGC AAGGCGCGAT TGCGCAGTTG AATCAGACGA TCCAGAACAT CACGAACGGC
TCCAACTCGG GCAACACCGG CAATAACGGC AACAACACCG GGCAGACCGT GTCGGGCCAG
TGGATCACGG GCAACCCGTC GACCTATACG CCGCCCGTGG CGAGCGGCAT CGGCTCGACC
GCCGCGGGCA GCGGCAGCGT GGCGTCCGGC GCGAACAGCG TCGCGATCGG CGACGGCGCG
TCGGCCTCCG GCAACAACTC GGTGGCGCTC GGCGCCCATT CGGTCGCGAG CGCGCCGAAC
ACGGTGTCGG TCGGCTCGGT CGGCAACGAG CGGACGATCT CGAACGTCGC GCCGGGCGTG
AACGGCACCG ATGCGGTGAA CGTGAACCAG TTGAACAGCG GTATCGGCAA TGCGGTCGGC
CAGGCGAATC AGTACACGGA TCAGAAGGTC GACCATCTGC GGCGCGAGAT GAACGGCGGC
GTGGCCGCGG CGATGGCCGT GGCGGGCTTG CCGCAGCCGA CCGCGCCCGG CAAGAGCATG
GTCGCGATCG CCGGCTCGAC GTGGCAGGGG CAGCAGGGCT TCGCGCTTGG CGTATCGACG
ATTTCCGAGA ACGGCAAGTG GCTGTACAAG GGCTCGCTCA CGACCAGCAC GCGCGGCGGC
ACGGGCGCGG TGCTCGGGGC CGGTTATCAG TGGTGA
 
Protein sequence
MNKIYNVVWS RVRGQLIAVS EFSRSNGKCS TTQVVTAAPG VAGRTAASGR SRPSWTKLGL 
MSLAVSAAMG CMATDAAAQV SYAAGENAYA GPGGNTGPWA FYNPAFSAGT LLYGTAVGNY
AYANGEGSSA YGDHATVKGR IGSAFGAYSE AAGDGSTAIG ASARALPDFS IAIGTNAQAL
KDTGQSIPGR EDIGTIAIGA GALAQGDNSD PLHVSAPNAF GGYSSATASG AVALGEGAAS
SGYYANALGS YSKASGAGAV AVGGGAQASA QGAVAIGGAT SVDNATALSG YASASGVNAI
AIGSGAQATG ARSISIGTGN VVSGASSGAF GDPSTVTGTG SYSFGNNNTI NSNNAFVLGN
NVTIGPGFDG SVALGSGTTL AAANPTGSAT ITTSSGGQLT LSGFAGANPT SVVSVGAPGA
ERQITNVAAG RITPTSTDAV NGSQLYAVAS TIDNAVNGGG IKYFHANSTL ADSTAAGTDS
VAVGPAALAY GNDSIAEGTN ATAGVSGNPA VAGDVALGSG AQATGGRSLA LGANASVNTA
GGVALGAGSV ANRAAGTYTD PITGSSFTTA FGAVSVGLEG SLRQITNVAA GTQATDAVNV
GQLQGAIAQL NQTIQNITNG SNSGNTGNNG NNTGQTVSGQ WITGNPSTYT PPVASGIGST
AAGSGSVASG ANSVAIGDGA SASGNNSVAL GAHSVASAPN TVSVGSVGNE RTISNVAPGV
NGTDAVNVNQ LNSGIGNAVG QANQYTDQKV DHLRREMNGG VAAAMAVAGL PQPTAPGKSM
VAIAGSTWQG QQGFALGVST ISENGKWLYK GSLTTSTRGG TGAVLGAGYQ W