Gene BURPS668_A1320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1320 
Symbol 
ID4888050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1244826 
End bp1247276 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content70% 
IMG OID640131259 
Producthaemagluttinin family protein 
Protein accessionYP_001062317 
Protein GI126444421 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTGCGG TTTCTGAATT TTCCCGGTCG AATGGCAAGT GTTCGACGAC GCAAGTCGTC 
ACGGCGGCGC CGGGCGTTGC CGGTCGTACC GCGGCTTCTG GCCGATCGCG CCCGTCGTGG
ACGAAGCTCG GGCTGATGTC GCTGGCGGTG AGCGCGGCGA TGGGCTGCAT GGCGACCGAC
GCCGCGGCGC AGGTCAGCTA TGCGGCGGGC GAGAACGCCT ATGCCGGCCC CGGCGGCAAT
ACCGGCCCGT GGGCGTTCTA CAACCCGGCC TTCAGCGCGG GCACGCTGCT GTACGGCACC
GCGGTCGGCA ACTACGCCTA TGCGAACGGC GAGGGCAGCT CGGCCTACGG CGATCACGCG
ACGGTGAAGG GGCGCATCGG CTCCGCGTTC GGCGCGTATT CGGAAGCGGC GGGCGACGGC
AGCACTGCAA TCGGCGCCAG CGCGCGGGCG CTGCCGGACT TCAGCATCGC GATCGGCACG
AACGCGCAGG CGCTGAAGGA CACGGGCCAA TCGATTCCCG GCCGCGAAGA CATCGGCACG
ATCGCGATCG GCGCGGGCGC GCTCGCGCAG GGCGACAACA GCGATCCGCT GCACGTGTCC
GCGCCGAACG CGTTCGGCGG CTATTCGAGC GCGACGGCAA GCGGCGCGGT GGCGCTGGGC
GAAGGCGCCG CATCGTCCGG CTATTACGCG AACGCGCTCG GCTCGTATTC GAAGGCGTCG
GGTGCGGGCG CGGTCGCGGT GGGCGGCGGC GCGCAAGCGA GCGCGCAAGG CGCGGTGGCG
ATCGGCGGCG CGACGAGCGT CGACAACGCA ACCGCGCTGT CCGGCTACGC GAGCGCAAGC
GGCGTCAATG CGATCGCGAT CGGTTCCGGC GCGCAGGCGA CGGGCGCCCG GTCGATCAGC
ATCGGCACGG GCAACGTCGT GTCGGGGGCG AGCTCGGGCG CCTTCGGCGA TCCGTCGACG
GTCACGGGCA CGGGCTCGTA TTCGTTCGGC AACAACAACA CGATCAATTC GAACAACGCG
TTCGTGCTCG GCAACAACGT GACGATCGGC CCAGGGTTCG ACGGCTCGGT CGCGCTCGGT
AGCGGCACGA CGGTCGCTGC GGCGAACCCC ACCGGCAGCG CGACGATCAC GACGAGCTCG
GGCGGCCAGT TGACGCTGTC CGGCTTTGCC GGCGCGAATC CGACGAGCGT CGTCAGCGTC
GGCGCGCCCG GCGCCGAGCG CCAGATCACG AACGTCGCGG CGGGGCGCAT CACGCCGACG
TCGACGGATG CCGTCAACGG CAGCCAGCTG TATGCGGTCG CGAGCACGAT CGACAATGCG
GTGAACGGCG GCGGGATCAA GTACTTCCAC GCGAATTCGA CTCTGGCCGA TTCGACGGCG
GCGGGCACGG ACAGCGTCGC GGTCGGGCCG GCCGCGCTCG CCTACGGCAA CGATTCGATC
GCCGAAGGCA CGAACGCGAC GGCGGGCGTG AGCGGCAATC CGGCGGTGGC GGGCGATGTC
GCGCTCGGCA GCGGCGCGCA GGCGACGGGC GGCCGCTCGC TCGCGCTCGG CGCGAACGCG
TCGGTCAACA CGGCGGGCGG CGTGGCGCTC GGCGCCGGCT CGGTCGCGAA CCGCGCGGCC
GGCACGTACA CCGATCCGAT CACGGGCAGC AGCTTCACGA CCGCATTCGG CGCGGTGTCG
GTCGGCCTCG AGGGTTCGCT GCGCCAGATC ACCAACGTCG CGGCGGGTAC GCAGGCAACG
GATGCGGTAA ACGTCGGTCA GTTGCAAGGC GCGATTGCGC AGTTGAATCA GACGATCCAG
AACATCACGA ACGGCTCCAA CTCGGGCAAC ACCGGCAATA ACGGCAACAA CACCGGGCAG
ACCGTGTCGG GCCAGTGGAT CACGGGCAAC CCGTCGACCT ATACGCCGCC CGTGGCGAGC
GGCATCGGCT CGACCGCCGC GGGCAGCGGC AGCGTGGCGT CCGGCGCGAA CAGCGTCGCG
ATCGGCGACG GCGCGTCGGC CTCCGGCAAC AACTCGGTGG CGCTCGGCGC CCATTCGGTC
GCGAGCGCGC CGAACACGGT GTCGGTCGGC TCGGTCGGCA ACGAGCGGAC GATCTCGAAC
GTCGCGCCGG GCGTGAACGG CACCGATGCG GTGAACGTGA ACCAGTTGAA CAGCGGTATC
GGCAATGCGG TCGGCCAGGC GAATCAGTAC ACGGATCAGA AGGTCGACCA TCTGCGGCGC
GAGATGAACG GCGGCGTGGC CGCGGCGATG GCCGTGGCGG GCTTGCCGCA GCCGACCGCG
CCCGGCAAGA GCATGGTCGC GATCGCCGGC TCGACGTGGC AGGGGCAGCA GGGCTTCGCG
CTTGGCGTAT CGACGATTTC CGAGAACGGC AAGTGGCTGT ACAAGGGCTC GCTCACGACC
AGCACGCGCG GCGGCACGGG CGCGGTGCTC GGGGCCGGTT ATCAGTGGTG A
 
Protein sequence
MIAVSEFSRS NGKCSTTQVV TAAPGVAGRT AASGRSRPSW TKLGLMSLAV SAAMGCMATD 
AAAQVSYAAG ENAYAGPGGN TGPWAFYNPA FSAGTLLYGT AVGNYAYANG EGSSAYGDHA
TVKGRIGSAF GAYSEAAGDG STAIGASARA LPDFSIAIGT NAQALKDTGQ SIPGREDIGT
IAIGAGALAQ GDNSDPLHVS APNAFGGYSS ATASGAVALG EGAASSGYYA NALGSYSKAS
GAGAVAVGGG AQASAQGAVA IGGATSVDNA TALSGYASAS GVNAIAIGSG AQATGARSIS
IGTGNVVSGA SSGAFGDPST VTGTGSYSFG NNNTINSNNA FVLGNNVTIG PGFDGSVALG
SGTTVAAANP TGSATITTSS GGQLTLSGFA GANPTSVVSV GAPGAERQIT NVAAGRITPT
STDAVNGSQL YAVASTIDNA VNGGGIKYFH ANSTLADSTA AGTDSVAVGP AALAYGNDSI
AEGTNATAGV SGNPAVAGDV ALGSGAQATG GRSLALGANA SVNTAGGVAL GAGSVANRAA
GTYTDPITGS SFTTAFGAVS VGLEGSLRQI TNVAAGTQAT DAVNVGQLQG AIAQLNQTIQ
NITNGSNSGN TGNNGNNTGQ TVSGQWITGN PSTYTPPVAS GIGSTAAGSG SVASGANSVA
IGDGASASGN NSVALGAHSV ASAPNTVSVG SVGNERTISN VAPGVNGTDA VNVNQLNSGI
GNAVGQANQY TDQKVDHLRR EMNGGVAAAM AVAGLPQPTA PGKSMVAIAG STWQGQQGFA
LGVSTISENG KWLYKGSLTT STRGGTGAVL GAGYQW