Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_1874 |
Symbol | |
ID | 4788684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 1921623 |
End bp | 1924085 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycosyl transferase, group 1 family protein |
Protein accession | YP_001025671 |
Protein GI | 124383299 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCG ATCTTGCAGA ACATCCGTTG CAGCGCGCCG CGACGTTTCA GGACGTCGAG CGCGCCGCGC GCGACGCGCG CAGCGCCGAT GCGCTGCCGC CCGCGCATCG CGCGCGCGAT GCGGCGCGGC CGCTGCGCGT GGCGATCGTG CACGACTGGC TCGTCACCTA CGCGGGCGCC GAGCGCGTGC TCGAGCAGAT CGTCGCGTGT TTTCCGGATG CGGATCTGTT CGGCCTCGTC GATTTCCTCG ACGACCGCAC GTTCCTGCGC GGCAAGCCGG TCACCACGTC GTTCATCCAG AGGCTGCCGT ACGCGCGCAC GAAGTACCGC AGCTACCTGC CGCTGATGCC GCTCGCGATC GAGCAGCTCG ACGTGTCCGG GTACGATCTC GTGATCTCGA GCAGCCACGC GGTCGCGAAG GGCATCCTGA CCGGGCCCGA CCAGGTGCAT GTGAGCTACG TGCATTCGCC GATCCGCTAC GCGTGGGATC TCCAGCATCA GTACCTCGAG CAGTCGCAGT TGACGCGCGG CGTCAAATCG GCGCTCGCGC GGCTGATCCT GCATTACATC CGCAACTGGG ACGTGCGCAC GTCGAATTCG GTCGACCGCT TCGTCGCGAA CTCGGCGTTC ATCGCGCGGC GCATCCGCAA GGTCTATCAG CGCGACGCGG CGGTCGTGTT TCCGCCCGTC GACGTCGACG CGTTCGCGCT GTCGACGCGG AAGGAGGACT TCTATCTGAC CGCCTCGCGG ATGGTGCCGT ACAAGAAGAT CGATCTGATC GTCGACGCGT TCGCGCAGAT GCCCGGGCGC CGGCTCGTCG TGATCGGCGA CGGGCCGGAC ATGCGCAAGA TCCGCGCAAA GGCCGCGCCG AACGTCGAGA TCATGGGCTA TCAGCCGTTC GCGGTGTTGC AGGACCGGAT GCGCCGCGCC AAGGCGTTCG TGTTCGCCGC CGAGGAGGAT TTCGGGATCT CGGTCGTCGA GGCGCAGGCC TGCGGCACGC CCGTCATTGC GTTCGGCAAG GGCGGCGCGC TCGAGACGGT GCGCGACGCG GCATCGCACG AGCGGCCGAC GGGCGTGTTC TTCGATGAGC AGAGCGTGCG GGCGATCGTC GCCGCGGTCG ACGATTTCGA GCGCGCGCCC GCGCGCTTCA AGCCGGAAGA TTGCCGCGCG AACGCCGAGC GGTTTTCCGC CGCGCATTTC CGGCGGCGCT TCGTCGCGCA GATCGACGCG CTGCTGCCGA ACGCGCAGGC GCGCGCGAAG CTCGCGGGCG CGATGCGGCC GGCCGGCCGC GCGTCGCTCG CGGCGAGCGG CCTGAGGGCG CTCGTGCTCG ACCAGAGTGG CGTGCTGGGC GGCGCCGAGC TGTCGCTGCT CGAGATCATG AAGCACCTGC GCGAGTCGGC CGACGTCGTG CTGTTCGACG ACGGGCCGTT TCGCGCGGCA CTGGACGACG CGGGCGTGCA CGTGGACGTG GTCGGCCAGC GCGCGCTCGC GGGCGTGCAC AAGCAGGGCG GCGTGTCGCT GCGCGCGGCG GGCAGCCTGC TCGCGCTCGT GCGCGAGGTC GCGCGGCGCG CGCGCGACGC CGACGTGATC TACGCGAACA CGCAGCGCGC GATGGTGGTC GGCGCGCTCG CGGGGCGGCT CGCGCGCAAG CCGGTGGTCT GGCATCTGCG CGACATCGTG AGCGACGCGC ACTTCGGCCC GAAGCAGCGG CTCGCGATCA AGCAGTGCGC GCGGCTCGGC GTGACGCGCG TGATCGCGAA TTCGGACGCG TCCGCGCGCG CGTTTCTCGA ATTGACGGGC TTCGAGCGGC GCGCGGTGCA GGTGGTCTTC AACGGCATCT CGGCCGAACC GTTCGTCGCG CTGGAACCGG TCCGCCCGGC CGCGCTGCGC GTGCGCTTCG GGCTGCCCGC CGATGCGTGG ATCGTCGGCT CGTTCAGTCG GCTCGCGCAC TGGAAAGGGC AGCACGTGCT GCTCGAGGCG GCGCGGCTCT ATCCGGACAT GCACGTCGCG CTCGTCGGCG CGCCGCTTTT CGGCGAGGAC GAGTACGCGG CCGAGCTGCG CGGCTTCGTC GCGCTGCACG GGCTCGGCGA GCGCGTGCAT TTCCTCGGCT TTCAGCGCGA CGTCGCCGCC TGCATGAAGG CGGTCGACGT CGTCGCGCAC ACGTCGATCA CGCCGGAGCC GTTCGGCCGC GTGATCGTCG AGGGGATGCT CGCGAAGCGG CCCGTCGTCG CGGCGCGCGC GGGCGGTGTC GTCGAGATCG TCGACGACGA CGTGAACGGC CTGCTCTGCG AGCCCGGCGA CGCGCATGCG CTTGCCGACG CGCTCGCCGC GCTGCGCACC GATGCGGTGC TTTGCGGGCG GCTCGTCGCG AACGGCTACG ACACCGCGGT GAACCGGTTC GGCACGCAGA TCTATGTCGA GCAGGTCGAG CGCATTCTCG TCGAGACCGC GCGGCGGCGG TGA
|
Protein sequence | MNRDLAEHPL QRAATFQDVE RAARDARSAD ALPPAHRARD AARPLRVAIV HDWLVTYAGA ERVLEQIVAC FPDADLFGLV DFLDDRTFLR GKPVTTSFIQ RLPYARTKYR SYLPLMPLAI EQLDVSGYDL VISSSHAVAK GILTGPDQVH VSYVHSPIRY AWDLQHQYLE QSQLTRGVKS ALARLILHYI RNWDVRTSNS VDRFVANSAF IARRIRKVYQ RDAAVVFPPV DVDAFALSTR KEDFYLTASR MVPYKKIDLI VDAFAQMPGR RLVVIGDGPD MRKIRAKAAP NVEIMGYQPF AVLQDRMRRA KAFVFAAEED FGISVVEAQA CGTPVIAFGK GGALETVRDA ASHERPTGVF FDEQSVRAIV AAVDDFERAP ARFKPEDCRA NAERFSAAHF RRRFVAQIDA LLPNAQARAK LAGAMRPAGR ASLAASGLRA LVLDQSGVLG GAELSLLEIM KHLRESADVV LFDDGPFRAA LDDAGVHVDV VGQRALAGVH KQGGVSLRAA GSLLALVREV ARRARDADVI YANTQRAMVV GALAGRLARK PVVWHLRDIV SDAHFGPKQR LAIKQCARLG VTRVIANSDA SARAFLELTG FERRAVQVVF NGISAEPFVA LEPVRPAALR VRFGLPADAW IVGSFSRLAH WKGQHVLLEA ARLYPDMHVA LVGAPLFGED EYAAELRGFV ALHGLGERVH FLGFQRDVAA CMKAVDVVAH TSITPEPFGR VIVEGMLAKR PVVAARAGGV VEIVDDDVNG LLCEPGDAHA LADALAALRT DAVLCGRLVA NGYDTAVNRF GTQIYVEQVE RILVETARRR
|
| |