Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A1361 |
Symbol | |
ID | 4680224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 1331802 |
End bp | 1335074 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639845632 |
Product | haemagluttinin family protein |
Protein accession | YP_992693 |
Protein GI | 121598713 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CTTATCGGGT TAGCTGGAGC GCGTCGCGGG GTGCGTGGAT GGTGGCGCCG GAGACGGCGC GTCGCAAAGG GAAAGGACAT TCGCTGACGA TCGTGTGCGC GATCGCCTCA GGCCTGCTGC TTGCGGCGCC TGCGTGGGCG GACACGGTGT CGCCGTCGGG CACGGATAAC GTCTACGGCG TCGACGCGAC CGATCCCGGC GTGTCGACGA ACCAGGGCAA TACGGCCTAC GGGGCGCAGG CGGGCGCGAA GGTCACGGGT TCGTACAACA CCGCGATCGG GTATCAAGCA GGGCAGAACG TGAACGTCAT CGATACCGTA TCGATCGGCA AGCAGGCCAC CGCGAGCGCG AATGACGCGA TCGCGATCGG CACGAACACG AAGGCGAGCG GGCCGGCCGA CATCTACATG GGGCTGAACG CAGGCGCCGG CGCCGGCTCG ACGACGAGCC CGGACGGCAC CGTCACGCTC GGCATTCGCA ACATGGGCCT CGGGGAATCC GCGGGCTCGT ACGTGACGGG CCAGAACAAC ACGGGGATCG GCTATCAGTC GGGCATGAAC GTGACGGGCG ACCAGAACGT CGGCCTCGGG CAGCAGGCGG GACAATTCGT GACCGGGACC GGCAACTCGG CGATGGGGCA TCTGGCGGGG TCGACGGTGT CGGGCAGCTA CAACGCCGCG TTCGGCGAGT ATGCGGGGAC CAACACGAGC GGCGGCGCCA ATGCCGCGTT CGGCTTCTAT GCGGGGCGCT ACATCAACGG CACGAACAAC ACGGCGCTCG GCGCGTACGA TCTGCCGGTC GTCAATGGCA CCTGGTACGG TTCGTACGTG ACGGGCAGCA ACAACCTCGG CGCCGGCCAT AATTCGGGCG CCTACGTGAG CGGCGCGAGC AACGTCGGGC TCGGCGACGG CGCGGGCACG TTCGTGACCG GCAGCAACAA CGTCGCCATC GGCACGGCAG CGGGCTCGGG CGCGTATACC AGCGGTCCGA GCGGCGCGAC GCTCAACGCG GCGCTCGTCG CGAGCAACAC CGTGAGCATC GGTACCCGCG CCACGGCGAG CCAGAGCGAC GCGATCGCGA TCGGCAAGGG CGCGACCGCG AGCGGCGCGC AATCGATCAG CATCGGCACC GGCAACGTCG TGAGCGGCAA GGGAAGCGGC GCGATCGGCG ATCCGAGCAC CGTCAGCGGC GCGGGGTCCT ATTCGATCGG CAACAACAAT ACCGTCGCGA ACAGCAACAC GTTCGTGCTC GGCAACGGCG TGACGACGAC GCAGGACAAC AGCGTCGTGC TCGGCAATCA GAGCACCGAC CGCGCGGCCG TCGCGGTTTC GAGCGAAACC ATCAATGGCA CGACGTACAA CTACGCGGGC GTCGCGAGCC CGGCCAACGG CGTCGTCAGC ATCGGCGGCG TGGGCACGGA ACGCCAGCTC ATCAACGTGG CGGCGGGCCA GGTGAGCGCG ACCAGCACGG ACGCGATCAA CGGCAGCCAG CTGTACGCGA CGAACCAGGC GGTGATCGCC GAGGACGCGA AAGTGAATTC GCTCGGCGGC GGCGTGGCGA GCGCGCTCGG CGGCAACGCG GCGTACAACG CGACGACCGG CGCGATCACC GCGCCGAGCT ACGCGGTCTA CGGGACCACG CAAAACTCCG TGGGCGGCGC GATCGATGCG CTGCAGGCCC TCGCGCCGCT GCAGTACACG TCCGGCCCGG GCGTGACCAC GCCGAACGCG CCGGGATCGG CGCCGACGAA CACGGTGACG CTCGTCGGCG CCGGCGGGCC GGGAGCCAAC ACCACGCCGG TGACGCTCAC GAACGTCGCG CCGGGCAAAC TCTCCGCGAC CAGCACGGAC GCGGTCAACG GCTCGCAGCT CTACGCGACC AACCAGCAGG TCGCGAACCT CGTGAGCTCG GTGAACAACG GCGGCGTCGG CCCGGTGCAG TACAGCGATC CTAGCGCGCC GACGACGCCC AACGGCGGCA AGCCCTCGCA GGACCTGACG CTCGTCGGCG CGGCAAGCGG CCCTGTCGCG CTGCATAACG TCGCGCCGGG CACGGCGTCC ACCGATGCGG TCAACGTCGG GCAGCTCGGC GCGGTGACGA CCGGCCTGGG CGGCGGCGCG GCGATCGATC CGAAGACGGG CGCCGTGACC GCGCCGTCGT ACACGGTCTA CAACGCCGAC GGCACGACGT CGAACGTCAG CAACGTCGGC GCGGCGATCG ATGCGATCAA CTCGACCGGC ATCAAGTATT TCCACGCGAA CAGCACGAAG CCGGACAGCC AGGCGCTCGG CGCGGACAGC GTCGCGATCG GCCCGAACGC CGTCGCGAAC AACGCGGGCG ACGTCGCGCT CGGTTCGGGA GCGGTCACGT CGCAAGCGGG CGGCACGCTG AGCGAAACGA TCAACGGCGT GACCTACTCG TTCGCCGGCA CGACGCCGAT CGGCACGGTG AGCGTCGGCG CGCCGGGCGT CGAGCGCACG ATCACCAACG TTGCCGCGGG GCGCATCGGG CAGTCGAGCA CGGACGCGAT CAACGGCTCG CAACTGTACG GCACCAACCA GTCGATCGAG GCGTTGACGG ACAAGATGAA CAGCCTCGGC AACACCGTGG CGAACACGCT CGGCAGCGGC GCGTCGTACA ACCCGCAAAC AGGCGCGGTG AACGGCCCGG CCAACTCGGG CGGCGTGGTC ACGCCCACGG TGATCCAGGA GGCGGCGAAC AAATGGGTGA GCGCCAATCC GTCGACCTAC GTGGCGCCCG TCGCGACGGG CACGAACGGC ATGGCGGTCG GCAGCGGCGC GGTTTCGACG GGCCAGAACT CGGTCGCGCT CGGCACGAAC GCGTCGGACG GCGGCCGCTC GAACGTCGTG AGCGTCGGGG CGCCGGGCGC GGAGCGCCAG GTGACGAACG TGGCGGCCGG CACGCAGGCG ACCGATGCGG TCAACCTCGG GCAGATGAAC GGCGCGCTCG CGCAGCAAAC CGACAGCTTC AATCAGCGGC TGGGCGCGGT TCAGCAGGAC GTCGACAACG TCGCGCGCGC CGCCTACGGC GGCATCGCGG CCGCGACCGC GCTCACGATG ATCCCCGAGG TCGACAAGGA CAAGACGATC GCGGTGGGCA TCGGCGGCGG CACGTATCGC GGCTACCAGG CGGTGGCGCT CGGCGCGACG GCGCGCATCA CCGAGAACAT CAAGGTTCGT GCGGGCGTCG GCATGAGCTC GGGCGGGACG ACGGCCGGCA TCGGCGCATC GATGCAGTGG TAA
|
Protein sequence | MNKTYRVSWS ASRGAWMVAP ETARRKGKGH SLTIVCAIAS GLLLAAPAWA DTVSPSGTDN VYGVDATDPG VSTNQGNTAY GAQAGAKVTG SYNTAIGYQA GQNVNVIDTV SIGKQATASA NDAIAIGTNT KASGPADIYM GLNAGAGAGS TTSPDGTVTL GIRNMGLGES AGSYVTGQNN TGIGYQSGMN VTGDQNVGLG QQAGQFVTGT GNSAMGHLAG STVSGSYNAA FGEYAGTNTS GGANAAFGFY AGRYINGTNN TALGAYDLPV VNGTWYGSYV TGSNNLGAGH NSGAYVSGAS NVGLGDGAGT FVTGSNNVAI GTAAGSGAYT SGPSGATLNA ALVASNTVSI GTRATASQSD AIAIGKGATA SGAQSISIGT GNVVSGKGSG AIGDPSTVSG AGSYSIGNNN TVANSNTFVL GNGVTTTQDN SVVLGNQSTD RAAVAVSSET INGTTYNYAG VASPANGVVS IGGVGTERQL INVAAGQVSA TSTDAINGSQ LYATNQAVIA EDAKVNSLGG GVASALGGNA AYNATTGAIT APSYAVYGTT QNSVGGAIDA LQALAPLQYT SGPGVTTPNA PGSAPTNTVT LVGAGGPGAN TTPVTLTNVA PGKLSATSTD AVNGSQLYAT NQQVANLVSS VNNGGVGPVQ YSDPSAPTTP NGGKPSQDLT LVGAASGPVA LHNVAPGTAS TDAVNVGQLG AVTTGLGGGA AIDPKTGAVT APSYTVYNAD GTTSNVSNVG AAIDAINSTG IKYFHANSTK PDSQALGADS VAIGPNAVAN NAGDVALGSG AVTSQAGGTL SETINGVTYS FAGTTPIGTV SVGAPGVERT ITNVAAGRIG QSSTDAINGS QLYGTNQSIE ALTDKMNSLG NTVANTLGSG ASYNPQTGAV NGPANSGGVV TPTVIQEAAN KWVSANPSTY VAPVATGTNG MAVGSGAVST GQNSVALGTN ASDGGRSNVV SVGAPGAERQ VTNVAAGTQA TDAVNLGQMN GALAQQTDSF NQRLGAVQQD VDNVARAAYG GIAAATALTM IPEVDKDKTI AVGIGGGTYR GYQAVALGAT ARITENIKVR AGVGMSSGGT TAGIGASMQW
|
| |