Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A2891 |
Symbol | |
ID | 4793315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | - |
Start bp | 2913416 |
End bp | 2914612 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_001028837 |
Protein GI | 124383792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00927369 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGATCC CGCTCGCGAC CCCACAACCG TCTCGGCAGC CCACCGGCCT GCCGGTGCTA CGGCTCGGCT TCAGACCGTT CTATCTCGGC GGCGCCTATT TCGGGATCGT CTCGATCGCG CTTTGGCTCG CATCGCTGCG CGGCCATGCG GTGGCCGGCC TGTCGCCCGC GATAAGCGGG CTCGCCTGGC ACGTTCATGA AATGGTGTTC GGGTTCTCCG CCGCGATCAT CGTCGGCTTC CTGCTCACGG CGATTCGCGC ATGGACGTCG CGCGAGACAC TGCACGGCGC ACCGCTCGCC GCGCTATGGC TGCCGTGGGC GGCCGGCCGC CTGCTCGTCT GGGCGGGACC GGAGCCGCTC GCCGCCGTGG TCGATTCCGC GTTCTTGCCG ATCACCGCGA TCCTCCTGCT GCGCGTGCTG CTCGCCGCGC GCAACCACCG CAATGTGTTC CTGACCGTCG CGCTCTTCCT GTTCGGTGCG CTCAACGCGC TCTTTCACGG GTGGGCCGCG CATGGGCGCC TCGACCTCGC GCTGCAGGCC GCCTACGCGG CGGTCGGCTT CGTCATGCTG TTCGTCGTCG TGATCGCGGG CCGCATCGTG CCAACCTTCA CGATGAACGC GATTCCCGGC TTCACGGTCA AGCGCTGGAA ATGGGTCGAG ACGCTCGCCG CCCCGGCGAC GGTGCTCGCG CTCTGCGCGG ACGCGGCCCG ACTGCCGGGC GCGATCGTCG CGGCCGTCGC GTTCGCCGCG GCTGCGCTGC ACGCAACGCG CATCGTCGGC TGGCGTTCGT GGCGCGTCGG CGCACGGCCG ATCCTGTGGA TCCTGCATGT CGCGTACGCC TGGGTGCCGG TGGGCTTCGC GATGCTCGCG CTCGCCGCGC TCGACGTCGC GCCCCATTCG CTCGCGATCC ATGCGCTGAC GGTGGGCGTG ATCGGCGGCG CGATCGTCGC GATGATCACG CGCACCGCGC TCGGCCATAC GGGCCGGCCG CTGCGCGCGG GGCCCGCCGA AATCGCGTGT TACTGGCTGC TGATCGCGGC CGCGCTCGTG CGCGTGTTCG CACCGTGGAT CGCGCCCGAT GCGACGCGCG TCTGGATCGA CGTCGCGGGC GCGTGCTGGG TGGCCGCGTT CGCGGTGTAT GCATTGCGTT ACACCGGCTA TCTGACCGCG CCGCGTATCG ACGGCAAGGC CGGTTGA
|
Protein sequence | MKIPLATPQP SRQPTGLPVL RLGFRPFYLG GAYFGIVSIA LWLASLRGHA VAGLSPAISG LAWHVHEMVF GFSAAIIVGF LLTAIRAWTS RETLHGAPLA ALWLPWAAGR LLVWAGPEPL AAVVDSAFLP ITAILLLRVL LAARNHRNVF LTVALFLFGA LNALFHGWAA HGRLDLALQA AYAAVGFVML FVVVIAGRIV PTFTMNAIPG FTVKRWKWVE TLAAPATVLA LCADAARLPG AIVAAVAFAA AALHATRIVG WRSWRVGARP ILWILHVAYA WVPVGFAMLA LAALDVAPHS LAIHALTVGV IGGAIVAMIT RTALGHTGRP LRAGPAEIAC YWLLIAAALV RVFAPWIAPD ATRVWIDVAG ACWVAAFAVY ALRYTGYLTA PRIDGKAG
|
| |