Gene BURPS668_A2480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2480 
Symbol 
ID4887868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2397897 
End bp2400641 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content68% 
IMG OID640132417 
Producthemagglutinin-like protein 
Protein accessionYP_001063474 
Protein GI126444573 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.168613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGGA ATGAGGTCGT GAACAGGAAC GTGTTTCGTT TGGTGCTGAA CAGGGTGGCG 
GGCATGCCGG TGCCGATGCC GGCGGCGGAG GTGTCGCGCG GGCGCGGCAA GCTCGGCTGC
GGCGGCGTGC GTGCGCAACG TCGCGGCGGT GCGGCGTGCG CGGCGCTGCT TGGGGTGGCC
GGGCCGTCCT TGGCGTTCGC GGCGGTGGTG GCGGACCCGA ACGGGGGCGC GCAGCGGCCC
GGCATGGCGA CGACGGCGAA CGGGACGGAC CTGGTCAATA TCGTCGCGCC GGACGCGACG
GGGTTGTCGC ACAACAAGTT CAACGAGTTC AGCCCGGTTG GACGCGGCGT GGTGTTGAAC
AACAGCGTGC GGCCCGGGGA ATCGCAGATC GGCGGCATGG CGGCGCAGAA CCCGAACTTG
ATGCAACCGG CCACCCGGGC ATTGCTCGAG GTGACGCAGC AACGCAGCGT GCTGCAGGGC
ACGCTGGAGG CGTTCGGCGG CAAGCTCGAC GTGCTGGTGG CGAACCAGCA TGGAGTGACG
ATCAACGGCT TGACGACGCT GAACGTGGGC CGGCTCGGCG TGACGACGGG GCAGGTGCTG
CCGCAAGCGG CCGGGCAGTT GCGTTTGGGC GTGACGCAAG GCGACGTGCT GATCGACCAT
GGGGGCATCG ATACCCAGGG CCTGGACATG TTCGACGTGG TGAGCCGCAG CATCGCCGTG
CGCGGGCCGA TCCACGATTC GAGCCGCGCC GCGGGCGCCG ACGTGCGCCT CGTGGCGGGC
GCGACGGCCT ACGATCCGCA GACCGGTCAT TATGAGGCGA TCGCGGCGGA CGAATCGAAG
GCGCCGGTGC AGGAGGGAAT CAGCGGCGAA CTGCTGGGAG CGATGCACGG CCGTCACATT
GTGCTGGTGA GCACGGAATC GGGCGTGGGC GTGCGGCACG ACGGACCGAT CAAGTCGGCG
AACGACATTC GGGTGAGCGC GAACGGCGAG GTGACGCTGG GCGGGCCGCA GCGGGCGGCC
CAGGAGACGG TTGCAGGAGC GCAGGCGGTA GGCGGCGCCG GCATGCAGAA CGTGATCGCG
GGCGGCACGG TGAGCGTCTG CGCGCGTGGG CACGTCGCGA TCCAGGGCGC GATGATCGCG
GGACAGGATG TGGATCTGCA GGGGAAAAGC GTGAAGGCTG GCCGGATGAG CGCGCAGCGC
GACGCGCTGG TGACGGCGGC GGATGGCGTG ACGCTCGATG GTCCGGTGGA CGCGAAGCGT
CACGTGTGGA TCGGAGCCCA CGGTGATGTG GTGATCCGTG AAGCGGCGGC GGGGCAGAAC
GTGGTGCTGC TGGGGCGCAG CGTAACGGCC GGCCGGTTGG ACGCGCAGCG CGACGTATTG
GCGGCGGCCC GCGACGGCGT GACGATCCAT GAAGCGGCGG CCGCGGGGCA GGACGTGGTG
CTGCAGGGAA GCAGCGCGAG GGTCGGCCAG ATGAGCGCGC AGCGCGATGT GCTGGTGATG
GCGGCAGATG GCGTGACGCT CGATGGGCCG GTGAGCGCGC AGCGCGCCGT ATGGGTCGAG
ACCCAAGGTG ACGTGACGGG CAGTGAGTGG ATCAAGGCCG GACGGGACGT GCAAATCGGC
GCGGCGGCGG ATCTGGCGGG CGCGGTAACG GCCGAAGAGA TGCAGCAACT CAAGGCCCAT
GGTGACGCGG CGAACAGGCG GCGCGTCAAA GCCGGACGGA ACGAGCCAGC CGGCACGGCG
GCTGAACGTC CGGCCGCGGC GGAGCAGACG GTGGCCGTCG CTGACGCGAT GCGCGAGATC
GGCGTGGGCG GCGATCGGCT GTCCGGATTG GATGCCGCGC CGGGTACGCC GGGTACGCCC
TTCGGCGCAC ACCCGCAAGC GATGTTCGAC GATCCGGCGG CGCAGATTGC GCGATCGGCT
CGATCCACGG CAACGGCGGG CGGACATGCG GGTTCGTTCA TGCGCGTCGG AGACGGTTAC
ATCGCCAAAA TGACCACGTC CAGAGAGGCG GAGATATACG AGAATTATCG CTTGGCTCTT
GCCGGCGTCA TCCCCGACAC CGTGCCGCCT GAACAGATGG ATTCGCTGGC CGGTGTCACG
GCCAGGCAGA GGCAGGCCAT GGCGAGTTTC AAGGAGTGGG CGGAGGTGAA CGACCAGCGG
GTTGTCGTCA TGCAGGCGCT GGGCGCGGAG ATCGCGCCGG AGGACAAGAT CGAGCTGGAC
GTCAAGATCG GCGCCAGTAC GGTGTCGCGC ACCGAGTTGA TCGGCGCCGG CAGGACTCGC
TGGCAGGCCT TGAGCAAGAA GGTGAGATTG ACGGCGGCGG ACCTGCTGCG GGGCTCGCGT
TCGCTGGTGG GCGACGATCG CGGCTATACG CTCGCCGGCC GCACGAGCGG GGGGATTGCC
CTGGACGCGA GGAATTCACG CAACTCCGTC GGCCGATCCA GCGAATCGCT GATTCGCGAG
GCGCTGGATC GCTCGCCCGA TACGCGCTGG CGGAACGCGC AGCACTTGCT CGGGCAGTTG
CAGACCATTC GAGAGAAGAT GCACGCGTTG CCGCTCACCT TCGTCGCCTC CAGCGTCCTC
ATTGCAATCG ACAAACGGAA ACCGGAAAAC TCGGTCGCCC GGCTGATCGA TCTCGCGCAC
CCGGTGCAGC CTTTCGAAAA CGAAGCGGAC TATGAGAAAG TCAATCACCG CTTCGAGGAT
GGTCTTGACA AGCTGATCAG ACTCTTCCAG CAGGTGGAAA AATAG
 
Protein sequence
MQRNEVVNRN VFRLVLNRVA GMPVPMPAAE VSRGRGKLGC GGVRAQRRGG AACAALLGVA 
GPSLAFAAVV ADPNGGAQRP GMATTANGTD LVNIVAPDAT GLSHNKFNEF SPVGRGVVLN
NSVRPGESQI GGMAAQNPNL MQPATRALLE VTQQRSVLQG TLEAFGGKLD VLVANQHGVT
INGLTTLNVG RLGVTTGQVL PQAAGQLRLG VTQGDVLIDH GGIDTQGLDM FDVVSRSIAV
RGPIHDSSRA AGADVRLVAG ATAYDPQTGH YEAIAADESK APVQEGISGE LLGAMHGRHI
VLVSTESGVG VRHDGPIKSA NDIRVSANGE VTLGGPQRAA QETVAGAQAV GGAGMQNVIA
GGTVSVCARG HVAIQGAMIA GQDVDLQGKS VKAGRMSAQR DALVTAADGV TLDGPVDAKR
HVWIGAHGDV VIREAAAGQN VVLLGRSVTA GRLDAQRDVL AAARDGVTIH EAAAAGQDVV
LQGSSARVGQ MSAQRDVLVM AADGVTLDGP VSAQRAVWVE TQGDVTGSEW IKAGRDVQIG
AAADLAGAVT AEEMQQLKAH GDAANRRRVK AGRNEPAGTA AERPAAAEQT VAVADAMREI
GVGGDRLSGL DAAPGTPGTP FGAHPQAMFD DPAAQIARSA RSTATAGGHA GSFMRVGDGY
IAKMTTSREA EIYENYRLAL AGVIPDTVPP EQMDSLAGVT ARQRQAMASF KEWAEVNDQR
VVVMQALGAE IAPEDKIELD VKIGASTVSR TELIGAGRTR WQALSKKVRL TAADLLRGSR
SLVGDDRGYT LAGRTSGGIA LDARNSRNSV GRSSESLIRE ALDRSPDTRW RNAQHLLGQL
QTIREKMHAL PLTFVASSVL IAIDKRKPEN SVARLIDLAH PVQPFENEAD YEKVNHRFED
GLDKLIRLFQ QVEK