Gene BMASAVP1_A1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1361 
Symbol 
ID4680224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1331802 
End bp1335074 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content70% 
IMG OID639845632 
Producthaemagluttinin family protein 
Protein accessionYP_992693 
Protein GI121598713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA CTTATCGGGT TAGCTGGAGC GCGTCGCGGG GTGCGTGGAT GGTGGCGCCG 
GAGACGGCGC GTCGCAAAGG GAAAGGACAT TCGCTGACGA TCGTGTGCGC GATCGCCTCA
GGCCTGCTGC TTGCGGCGCC TGCGTGGGCG GACACGGTGT CGCCGTCGGG CACGGATAAC
GTCTACGGCG TCGACGCGAC CGATCCCGGC GTGTCGACGA ACCAGGGCAA TACGGCCTAC
GGGGCGCAGG CGGGCGCGAA GGTCACGGGT TCGTACAACA CCGCGATCGG GTATCAAGCA
GGGCAGAACG TGAACGTCAT CGATACCGTA TCGATCGGCA AGCAGGCCAC CGCGAGCGCG
AATGACGCGA TCGCGATCGG CACGAACACG AAGGCGAGCG GGCCGGCCGA CATCTACATG
GGGCTGAACG CAGGCGCCGG CGCCGGCTCG ACGACGAGCC CGGACGGCAC CGTCACGCTC
GGCATTCGCA ACATGGGCCT CGGGGAATCC GCGGGCTCGT ACGTGACGGG CCAGAACAAC
ACGGGGATCG GCTATCAGTC GGGCATGAAC GTGACGGGCG ACCAGAACGT CGGCCTCGGG
CAGCAGGCGG GACAATTCGT GACCGGGACC GGCAACTCGG CGATGGGGCA TCTGGCGGGG
TCGACGGTGT CGGGCAGCTA CAACGCCGCG TTCGGCGAGT ATGCGGGGAC CAACACGAGC
GGCGGCGCCA ATGCCGCGTT CGGCTTCTAT GCGGGGCGCT ACATCAACGG CACGAACAAC
ACGGCGCTCG GCGCGTACGA TCTGCCGGTC GTCAATGGCA CCTGGTACGG TTCGTACGTG
ACGGGCAGCA ACAACCTCGG CGCCGGCCAT AATTCGGGCG CCTACGTGAG CGGCGCGAGC
AACGTCGGGC TCGGCGACGG CGCGGGCACG TTCGTGACCG GCAGCAACAA CGTCGCCATC
GGCACGGCAG CGGGCTCGGG CGCGTATACC AGCGGTCCGA GCGGCGCGAC GCTCAACGCG
GCGCTCGTCG CGAGCAACAC CGTGAGCATC GGTACCCGCG CCACGGCGAG CCAGAGCGAC
GCGATCGCGA TCGGCAAGGG CGCGACCGCG AGCGGCGCGC AATCGATCAG CATCGGCACC
GGCAACGTCG TGAGCGGCAA GGGAAGCGGC GCGATCGGCG ATCCGAGCAC CGTCAGCGGC
GCGGGGTCCT ATTCGATCGG CAACAACAAT ACCGTCGCGA ACAGCAACAC GTTCGTGCTC
GGCAACGGCG TGACGACGAC GCAGGACAAC AGCGTCGTGC TCGGCAATCA GAGCACCGAC
CGCGCGGCCG TCGCGGTTTC GAGCGAAACC ATCAATGGCA CGACGTACAA CTACGCGGGC
GTCGCGAGCC CGGCCAACGG CGTCGTCAGC ATCGGCGGCG TGGGCACGGA ACGCCAGCTC
ATCAACGTGG CGGCGGGCCA GGTGAGCGCG ACCAGCACGG ACGCGATCAA CGGCAGCCAG
CTGTACGCGA CGAACCAGGC GGTGATCGCC GAGGACGCGA AAGTGAATTC GCTCGGCGGC
GGCGTGGCGA GCGCGCTCGG CGGCAACGCG GCGTACAACG CGACGACCGG CGCGATCACC
GCGCCGAGCT ACGCGGTCTA CGGGACCACG CAAAACTCCG TGGGCGGCGC GATCGATGCG
CTGCAGGCCC TCGCGCCGCT GCAGTACACG TCCGGCCCGG GCGTGACCAC GCCGAACGCG
CCGGGATCGG CGCCGACGAA CACGGTGACG CTCGTCGGCG CCGGCGGGCC GGGAGCCAAC
ACCACGCCGG TGACGCTCAC GAACGTCGCG CCGGGCAAAC TCTCCGCGAC CAGCACGGAC
GCGGTCAACG GCTCGCAGCT CTACGCGACC AACCAGCAGG TCGCGAACCT CGTGAGCTCG
GTGAACAACG GCGGCGTCGG CCCGGTGCAG TACAGCGATC CTAGCGCGCC GACGACGCCC
AACGGCGGCA AGCCCTCGCA GGACCTGACG CTCGTCGGCG CGGCAAGCGG CCCTGTCGCG
CTGCATAACG TCGCGCCGGG CACGGCGTCC ACCGATGCGG TCAACGTCGG GCAGCTCGGC
GCGGTGACGA CCGGCCTGGG CGGCGGCGCG GCGATCGATC CGAAGACGGG CGCCGTGACC
GCGCCGTCGT ACACGGTCTA CAACGCCGAC GGCACGACGT CGAACGTCAG CAACGTCGGC
GCGGCGATCG ATGCGATCAA CTCGACCGGC ATCAAGTATT TCCACGCGAA CAGCACGAAG
CCGGACAGCC AGGCGCTCGG CGCGGACAGC GTCGCGATCG GCCCGAACGC CGTCGCGAAC
AACGCGGGCG ACGTCGCGCT CGGTTCGGGA GCGGTCACGT CGCAAGCGGG CGGCACGCTG
AGCGAAACGA TCAACGGCGT GACCTACTCG TTCGCCGGCA CGACGCCGAT CGGCACGGTG
AGCGTCGGCG CGCCGGGCGT CGAGCGCACG ATCACCAACG TTGCCGCGGG GCGCATCGGG
CAGTCGAGCA CGGACGCGAT CAACGGCTCG CAACTGTACG GCACCAACCA GTCGATCGAG
GCGTTGACGG ACAAGATGAA CAGCCTCGGC AACACCGTGG CGAACACGCT CGGCAGCGGC
GCGTCGTACA ACCCGCAAAC AGGCGCGGTG AACGGCCCGG CCAACTCGGG CGGCGTGGTC
ACGCCCACGG TGATCCAGGA GGCGGCGAAC AAATGGGTGA GCGCCAATCC GTCGACCTAC
GTGGCGCCCG TCGCGACGGG CACGAACGGC ATGGCGGTCG GCAGCGGCGC GGTTTCGACG
GGCCAGAACT CGGTCGCGCT CGGCACGAAC GCGTCGGACG GCGGCCGCTC GAACGTCGTG
AGCGTCGGGG CGCCGGGCGC GGAGCGCCAG GTGACGAACG TGGCGGCCGG CACGCAGGCG
ACCGATGCGG TCAACCTCGG GCAGATGAAC GGCGCGCTCG CGCAGCAAAC CGACAGCTTC
AATCAGCGGC TGGGCGCGGT TCAGCAGGAC GTCGACAACG TCGCGCGCGC CGCCTACGGC
GGCATCGCGG CCGCGACCGC GCTCACGATG ATCCCCGAGG TCGACAAGGA CAAGACGATC
GCGGTGGGCA TCGGCGGCGG CACGTATCGC GGCTACCAGG CGGTGGCGCT CGGCGCGACG
GCGCGCATCA CCGAGAACAT CAAGGTTCGT GCGGGCGTCG GCATGAGCTC GGGCGGGACG
ACGGCCGGCA TCGGCGCATC GATGCAGTGG TAA
 
Protein sequence
MNKTYRVSWS ASRGAWMVAP ETARRKGKGH SLTIVCAIAS GLLLAAPAWA DTVSPSGTDN 
VYGVDATDPG VSTNQGNTAY GAQAGAKVTG SYNTAIGYQA GQNVNVIDTV SIGKQATASA
NDAIAIGTNT KASGPADIYM GLNAGAGAGS TTSPDGTVTL GIRNMGLGES AGSYVTGQNN
TGIGYQSGMN VTGDQNVGLG QQAGQFVTGT GNSAMGHLAG STVSGSYNAA FGEYAGTNTS
GGANAAFGFY AGRYINGTNN TALGAYDLPV VNGTWYGSYV TGSNNLGAGH NSGAYVSGAS
NVGLGDGAGT FVTGSNNVAI GTAAGSGAYT SGPSGATLNA ALVASNTVSI GTRATASQSD
AIAIGKGATA SGAQSISIGT GNVVSGKGSG AIGDPSTVSG AGSYSIGNNN TVANSNTFVL
GNGVTTTQDN SVVLGNQSTD RAAVAVSSET INGTTYNYAG VASPANGVVS IGGVGTERQL
INVAAGQVSA TSTDAINGSQ LYATNQAVIA EDAKVNSLGG GVASALGGNA AYNATTGAIT
APSYAVYGTT QNSVGGAIDA LQALAPLQYT SGPGVTTPNA PGSAPTNTVT LVGAGGPGAN
TTPVTLTNVA PGKLSATSTD AVNGSQLYAT NQQVANLVSS VNNGGVGPVQ YSDPSAPTTP
NGGKPSQDLT LVGAASGPVA LHNVAPGTAS TDAVNVGQLG AVTTGLGGGA AIDPKTGAVT
APSYTVYNAD GTTSNVSNVG AAIDAINSTG IKYFHANSTK PDSQALGADS VAIGPNAVAN
NAGDVALGSG AVTSQAGGTL SETINGVTYS FAGTTPIGTV SVGAPGVERT ITNVAAGRIG
QSSTDAINGS QLYGTNQSIE ALTDKMNSLG NTVANTLGSG ASYNPQTGAV NGPANSGGVV
TPTVIQEAAN KWVSANPSTY VAPVATGTNG MAVGSGAVST GQNSVALGTN ASDGGRSNVV
SVGAPGAERQ VTNVAAGTQA TDAVNLGQMN GALAQQTDSF NQRLGAVQQD VDNVARAAYG
GIAAATALTM IPEVDKDKTI AVGIGGGTYR GYQAVALGAT ARITENIKVR AGVGMSSGGT
TAGIGASMQW