Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1284 |
Symbol | |
ID | 4890131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | + |
Start bp | 1245121 |
End bp | 1248177 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640147552 |
Product | hypothetical protein |
Protein accession | YP_001078470 |
Protein GI | 126446951 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.878781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGAT TTCACGTTCA TCTCGTCGTC TTTTTCGCCG GCCTCGCCGC GGTCTGCTGG ATCGGCGCCG GCTATGCCGT CTCGAATCCC GTCGCGCTCG CCGTCACGCT GGTGATCGCC GCCTGCTATC TCGCCGGCGC GCTCGAATTG CGCCGCTACC GGCAAGCCAC GTCGACGCTC GCGCAGGCGG TCGCGGCGCT GTCCGAGCCG CCCGCGGCGC TCGGTGCGTG GCTCGAACGG CTGCATCCGA GCCTGCGCCA CGCGGTGCGT GTGCGCGTCG AGGGCGAGCG CGTCGCGCTG CCGGGCCCCG CGCTGACGCC TTATCTGGTC GGCCTGCTCG TGCTGCTCGG CATGCTCGGC ACGCTCATCG GGATGGTGAT GACGTTAAGG GGCACCGGCG CCGCGCTCGA AAGCTCGACC GATCTGCAGG CGATCCGCGC GTCGCTCGCC GCGCCCGTGA AGGGGCTCGG CTTCGCGTTC GGCACGTCGA TCGCCGGGGT GGCGACGTCG GCGATGCTCG GCTTGCTGTC GGCGCTTTGC CGGCGCGAGC GGCTCGACGC CGCGCAGGCG CTCGACGCGA AGATCGCGAC GACGCTGCGC GTGCATTCGC ACGCGCATCA GCGCGACGAA ACCTTCCGGC TGCTGCAGCG GCAAGCCGAC CTGATGCCGA CGCTCGTCGA GCGGCTGCAG GCGATGATGC ATTCGCTCGA GCAGCAGAGC GCCGCCTCGG CCGAGCGGCA GATCGCCGGC CAGCAGGCAT TTCTCGGCAA GGCCGAGGAG ACGTACGCGC GTCTCGCTTC GTCGGTCGGC CAATCGCTGA CGGACAGCGT CGCGGAAAGC GCGCGGGTGG CCGGCTCTGC GCTGCAGCCC GTGATGGAGA CGACGATGGC CGGGCTCGCG CGCGAGACGG CCGCGCTGCA CGATGCGCTC ACGCAGGCCG TGCAGCGCCA GCTCGACGGG CTGTCGGCCG GATTCGAGAC GACGGCGGCC CAGGTCGCGG ACGTCTGGCG TCACGCGCTC GCCGATCATC AGCGCTCGAG CGACGCGCTC GCGCAGCGCC TGCACGGCTC GATCGACCGG ATCGTCGAAT CGTTCGACCG GCGCTCCGCG GATTTGCTCG ACGGCGTGCG CGCGCGCCTC GACGCGACGG CAAGCAGCGT GTCCGACGCG TGGCGCGGCG CGCTCGCGCA ACAGGAGCAG GCGAACGAAG CACACGCCGA GCGCAACCGG CAGGCGCTGG AAACGGCCGC CGCGACGTTC GAGCGGCATT CCGCGGCGCT GCTGCGCACG ATGAGCGAAT CGCATTCGGC TCTGCAAGCA ACGCTCGAAT CGCGCGACGA GCAGCGTCTG GCCACATGGA CCGATTCGCT CGGCTCGATC GCCGCGAAGC TCGGCACCGA GTGGGCACAA ACCAGCGCGC AAGCCGCAAA CCGCCAGCAA ACGATCTGCG ATGCGCTCGC GCACACCGCG CGCGACCTCT CCGCTCAAGC CACGGCGTTC GAGCAGCACA CCGCTGCGCT GCTGCGCGCG ATGAGCGAAT CGCATTCGGC TCTGCAAGCA ACGCTCGAAT CGCGCGACGA GCAGCGTCTG GCCACGTGGA CCGATTCGCT CGGCTCGATC GCCGCGAAGC TCGGCACCGA GTGGGCGCAA ACCAGCGCGC AAGCGGCAAA CCGCCAGCAG GCGATCTGCG ATGCGCTCGC GCACACCGCG CGCGACCTCT CCGCTCAAGC CACGGCGTTC GAGCAGCACA CCGCTGCGCT GCTGCGCGCG ATGAGCGAAT CGCATTCGGC TCTGCAAGCA ACGCTCGAAT CGCGCGACGA GCAGCGTCTG GCCACATGGA CCGATTCGCT CGGCTCGATC GCCGCGAAGC TCGGCACCGA ATGGGAGCAA ACCAGCGCGC AAGCGGCAAA CCGCCAGCAG GCGATCTACG ATGCGCTCGC GCACACCGCG CGCGACCTCT CCGCTCAAGC CACGGCGTTC GAGCAGCACA CCGCTGCGCT GCTGCGCGCG ATGAGCGAAT CGCATTCGGC TCTGCAAGCA ACGCTCGAAT CGCGCGACGA GCAGCGTCTG GCCACATGGA CCGATTCGCT CGGCTCGATC GCCGCGAAGC TCGGCACCGA ATGGGAGCAA ACCAGCGCGC AAGCGGCAAA CCGCCAGCAG GCGATCTACG ATGCGCTCGC GCACACCGCG CGCGACCTCT CCGCTCAAGC CACGGCGTTC GAGCAGCACA CCGCTGCGCT GCTGCGCGCG ATGAGCGAAT CGCATTCGGC TCTGCAAGCA ACGCTCGAAT CGCGCGACGA GCAGCGTCTG GCCACATGGA CCGATTCGCT CGGCTCGATC GCCGCGAAGC TCGGCACCGA ATGGGAGCAA ACCAGCGCGC AAGCGGCAAA CCGCCAGCAG GCGATCTACG ATGCGCTCGC GCACACCGCG CGCGACCTCT CCGCGCACAC GCAAGCGCAC GCGAGCGCCA CGATCGCCGA GATCTCGCAG CTCGTGCAGG CCGCGTCGGA AGCGCCGAGG GTCGCGGCCG AAGTCGTCGC CGAGCTGCGC CAGAAGCTCT CCGACAGCAT GGTCCGCGAC ACCGCGATGC TCGAAGAGCG CAACCGGATG CTCGCGCCGC TCGAAACCCT GCTCGATGCG GTCAATCACG CATCGAGCGA ACAACGCGCG GCCGTCGACG CGCTCGTCGC GACGTCCTCG GCGCTGCTGC AGCGCGTCGG CACGCAGTTC ACCGATGAGG TCGGCACGCA AACCGACAGG CTCGGCGGGG TCGCCGCGCA GATCACGGGC AGCGCGGTCG AGATCGCGAG CCTCGGCGAC GCGCTCGGCG CGGCCGTCCA GTCGTTCGGC GAATCGAACG ACAAGCTCGT CGCGCATCTG CAACGCATCG AAGCCGCGCT CGACAAGTCG CTCGCCCGCA GCGACGAGCA ACTCGCGTAT TACGTCGCGC AGGCGCGCGA GGTCATCGAC CTGAGCATGA TGTCGCAGAA GCAGATCATC GAAGAACTGC AGCGCGTCGG CGGTGAACGC GCATCCGCCG GAGCCGCCGC AGCATGA
|
Protein sequence | MSRFHVHLVV FFAGLAAVCW IGAGYAVSNP VALAVTLVIA ACYLAGALEL RRYRQATSTL AQAVAALSEP PAALGAWLER LHPSLRHAVR VRVEGERVAL PGPALTPYLV GLLVLLGMLG TLIGMVMTLR GTGAALESST DLQAIRASLA APVKGLGFAF GTSIAGVATS AMLGLLSALC RRERLDAAQA LDAKIATTLR VHSHAHQRDE TFRLLQRQAD LMPTLVERLQ AMMHSLEQQS AASAERQIAG QQAFLGKAEE TYARLASSVG QSLTDSVAES ARVAGSALQP VMETTMAGLA RETAALHDAL TQAVQRQLDG LSAGFETTAA QVADVWRHAL ADHQRSSDAL AQRLHGSIDR IVESFDRRSA DLLDGVRARL DATASSVSDA WRGALAQQEQ ANEAHAERNR QALETAAATF ERHSAALLRT MSESHSALQA TLESRDEQRL ATWTDSLGSI AAKLGTEWAQ TSAQAANRQQ TICDALAHTA RDLSAQATAF EQHTAALLRA MSESHSALQA TLESRDEQRL ATWTDSLGSI AAKLGTEWAQ TSAQAANRQQ AICDALAHTA RDLSAQATAF EQHTAALLRA MSESHSALQA TLESRDEQRL ATWTDSLGSI AAKLGTEWEQ TSAQAANRQQ AIYDALAHTA RDLSAQATAF EQHTAALLRA MSESHSALQA TLESRDEQRL ATWTDSLGSI AAKLGTEWEQ TSAQAANRQQ AIYDALAHTA RDLSAQATAF EQHTAALLRA MSESHSALQA TLESRDEQRL ATWTDSLGSI AAKLGTEWEQ TSAQAANRQQ AIYDALAHTA RDLSAHTQAH ASATIAEISQ LVQAASEAPR VAAEVVAELR QKLSDSMVRD TAMLEERNRM LAPLETLLDA VNHASSEQRA AVDALVATSS ALLQRVGTQF TDEVGTQTDR LGGVAAQITG SAVEIASLGD ALGAAVQSFG ESNDKLVAHL QRIEAALDKS LARSDEQLAY YVAQAREVID LSMMSQKQII EELQRVGGER ASAGAAAA
|
| |