Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_1841 |
Symbol | |
ID | 4788986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 1886507 |
End bp | 1887751 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative dioxygenase |
Protein accession | YP_001025638 |
Protein GI | 124382436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000734173 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTCC CCATCGACGA CCTGCCGGCG GCGATCCGCA CGACCAAAGC GGTGCTGCGC GCCGCGATGC CGAATCGCGC GCCGGTGTTC CGCGCGCTCG AGGGCGACAT CGCGCGGCAG GCCGATGCGA TCCTGACCGA TCGCGCGCAG GGCCGCGACA CGATTCCCGT GCTGCGCTTC GCGGACATCG CCGCCGATCG CGTCGATCCG GCGTCGCTCG CCGCATTGAG GACGCGCGGC GCGTGCGTGA TCCGCGGCGT GTTCGACGCG CGGCAGGCGA GCGACTGGAA CGACGAGATC GCCGCATACG TCGAGGCGAA CCGGCTCGAC GACAAGCTCG CCCGTCGCGC CGAAGACCGC TATTTCGGCA CGCTCGCGTC GAGCCGGCCG CAGATCTACG GCATCTACTG GTCGAAGCCG CAGATCGCCG CGCGCCAGTC GCCCGCGCTC ACGCGGGCGC GCGTGTTCCT GAACCGGCTG TGGCGTGCGC AAAGCGAGGG GCGCGTGCAT TTCGACCCGG CGCGCGCGCC GGCCTATGCG GACCGCATTC GCCGCCGGCC GCCGGGCTCG TCGTCGCTCG GGCTGTCGCC GCACGTCGAC GGGGGCTCCG TCGAGCGCTG GCTCGAGCCG AACTACCGGC GCGTGTATCG GCACGTGCTG GCGGGCGACT GGCGCGCATA CGATCCGTTC GACGCGGCGT TCCGGCCCGA CGTCGAGGAG ATTCCGTCAC CCGCCGTCTG CTCGATGTTC CGCACGTTCC AGGGCTGGAC CGCGCTCACG CCGCAAGGGC CGGGCGACGG CACGCTGCAA CTGATCCCGG TGGCGAACGC GATGGCGTAC GTCGTGCTGC GCGCGCTGCA GGACGACGTG CCCGACGACG ACCTGTGCGG CGCGCAGCCG GGGCGCGCGC TGTCGGTGTT GCCGGCGTGG CACGCGCCGC TGCTCGCCGC GCTCGTGCCG ATTCCGCCGA TGGAGCCGGG CGACGCGGTG TTCTGGCACG GCGATGTCGT GCACGCGGTC GAGGATGCGC ATCGCGGAAC CGGTTACAGC AACGTGATGT ACATCGCGTC GGCGCCGGGG TGCGCGAAGA ACGACGCATA CCTGAAGCGT CAACTGCCGA GCTTCCTGCG CGGCGAGAGT CCGCCGGATT TTCCGGCGGA CCACTTCGAG ACGGATTTCA CCGGACGCGC GCGGGCGGAC GATCTGAGCG CGCTGGGGCG CGAGCAGATG GGCTTCGAGC CGTGA
|
Protein sequence | MPFPIDDLPA AIRTTKAVLR AAMPNRAPVF RALEGDIARQ ADAILTDRAQ GRDTIPVLRF ADIAADRVDP ASLAALRTRG ACVIRGVFDA RQASDWNDEI AAYVEANRLD DKLARRAEDR YFGTLASSRP QIYGIYWSKP QIAARQSPAL TRARVFLNRL WRAQSEGRVH FDPARAPAYA DRIRRRPPGS SSLGLSPHVD GGSVERWLEP NYRRVYRHVL AGDWRAYDPF DAAFRPDVEE IPSPAVCSMF RTFQGWTALT PQGPGDGTLQ LIPVANAMAY VVLRALQDDV PDDDLCGAQP GRALSVLPAW HAPLLAALVP IPPMEPGDAV FWHGDVVHAV EDAHRGTGYS NVMYIASAPG CAKNDAYLKR QLPSFLRGES PPDFPADHFE TDFTGRARAD DLSALGREQM GFEP
|
| |