Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA0447 |
Symbol | |
ID | 3087153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | + |
Start bp | 448268 |
End bp | 449899 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637564367 |
Product | hypothetical protein |
Protein accession | YP_105224 |
Protein GI | 53716548 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.414119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTGGC CCGCGGACGT CGACCTGCGC TTCTTCCAGC AGGCCGCGCC CGACCAATGG GCACGCGGCG AATGCTGGAC GCCCGGCGCG CGTTTCGAGC TGAGCGGCTT CGGGCCGCGG GGCGAGGGCT TCGCGGGCGA ACTGCCGCGT CTCGCGCCGG TCGCGCTCGT GACGCGCAAC GGCCGCCCGG GTATCGAGCG GCTGTCGTTC AAGCAGCAGA CGGCGTGGTT CCTGCCCGAT CGCGGCATCG GCGTGCTGTG GTGGAACGGC GCGGTCGCGC TCGATTTCCT GCTCGACGAC AGCCCGACGA TGCTCGTCAC CGCATTCAAG GACGAAGCCG AGCGGATCGA CATCGACGCG CTGATGAAGT TCGCCGATCA GCGTGCCGAC CTGAACTGCA CCGATCCGCT GCAGCAGGCG GATCACGAAC TGATGCCCGC GATTACGAGG GGCTGGACCT GGGAGATGAT CCTCGACACG GAAGACCACC CGCGTTTCGC TCCGGCGCCG CGCGGCTATG AAGAAGTCCG TGCGCGGGTC GAGCAGAATC GCCGCGAGTT GGTCGAGGCG CGCGATGCGA GCGAGCGGCT GTCGGCGTTC GAGGAAGCGA ACCGCAACGC GAAGCTGCCG GGCGCGCCGC GCGGCGGCGA GAACTGGCGC ACGCGGCTGC GTCAGGCGAA GACGCCCGAG CTCGCGAACG TGACGATTCG CGACGCCGAT CTGTCGTCGC TGCGCTTTGA CGGCTGGAAG TTCGACGACG TGCGCTTCGA GCGCTGCACG CTCGATCGCA GCGAATGGAC GAACTGCCGG CTCAATCAGG TGCATGCGGT CGACTGCTCG TTCGCCGACG TCAAGATGAG CGACGGCTGG TGGAAGGGCG GCAAGATCCA GCGCTGCAAT CTCGAACGCA GCGCGTGGTT GAACGTCGAG ATCGAGCGGA TCTCGCTCGA CGAATGCCGG CTCGACGATC TGAAGGTGGC GGGCGGATCG TGGTCGATGC TGTCGGTGCA GGGCCGCGGC GGCGTGCGCG GCGACGTTCA GGATGTCCAA TGGAATTCGG TGTCGTGGTC CGAGGTGAGC GCGCCCGGCT GGACCTGGAC CCGCGTGCGC GCCGACGATC TCGCGATCGT CGAATGCGCA ATGGCGGGCC TCGCGGTATC GCAGTGCACG CTCGCGAAGC CGAGCATCCT GCTCACCGAC CTGTCCGCGA GCGTCTGGCA GCGCAGCATG CTGACGTTCG CGGTGCTGTC GCACGGCACG TCGATCAACG GCGCGCGGCT CACCGATTGC GTGTTCAAGT CGTCGAGCCT GCAGGAGCTG CGTGCGGATC GGGTTCAGGT CGATCACTGC TCGTTCATGC AATTGAACGC GCAGCATCTG CACGCGCAGC AGTCGCATTG GAGCCGCACG GTGCTCGACG GCGCGAACGT GATGCATGCG CAACTGACGG GCACGTCGTT CGACCGCTGC TCGCTGAAGG AGGCGATGTT CTATGGCGCC GACATGCGGC AGACGCGCAT GCGCGACTGC AATCTCGTCA GGGTCCGCAC GTCGTGGATC CATCCGCCGG AAGCGGGCGC GTGGCGCGGC AATCTGAGCG CCGGCCAGCT CGACGTGCCG AGGAGGGTGT GA
|
Protein sequence | MGWPADVDLR FFQQAAPDQW ARGECWTPGA RFELSGFGPR GEGFAGELPR LAPVALVTRN GRPGIERLSF KQQTAWFLPD RGIGVLWWNG AVALDFLLDD SPTMLVTAFK DEAERIDIDA LMKFADQRAD LNCTDPLQQA DHELMPAITR GWTWEMILDT EDHPRFAPAP RGYEEVRARV EQNRRELVEA RDASERLSAF EEANRNAKLP GAPRGGENWR TRLRQAKTPE LANVTIRDAD LSSLRFDGWK FDDVRFERCT LDRSEWTNCR LNQVHAVDCS FADVKMSDGW WKGGKIQRCN LERSAWLNVE IERISLDECR LDDLKVAGGS WSMLSVQGRG GVRGDVQDVQ WNSVSWSEVS APGWTWTRVR ADDLAIVECA MAGLAVSQCT LAKPSILLTD LSASVWQRSM LTFAVLSHGT SINGARLTDC VFKSSSLQEL RADRVQVDHC SFMQLNAQHL HAQQSHWSRT VLDGANVMHA QLTGTSFDRC SLKEAMFYGA DMRQTRMRDC NLVRVRTSWI HPPEAGAWRG NLSAGQLDVP RRV
|
| |