Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A2687 |
Symbol | |
ID | 4791050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | + |
Start bp | 2714034 |
End bp | 2715389 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | putative 4-hydroxyphenylacetate transporter |
Protein accession | YP_001028639 |
Protein GI | 124385969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGG CGGCGAATCC CGCGCGCGTG CTCGAGATCG AGCGCGTGAT CGACGACACG CACCGGCCCG CGTTTCACGC GATGCTGCTC GCGCTTTGCG GGCTGTGCCT CGTGATCGAC GGTTTCGACG CGCAGGCGAT GGGCTACGTC GCACCGAGCG TGATCGCCGA ATGGGGTGTG AAGAAGCAGG CGCTCGGGCC CGTCTTCAGC GCGAGCCTGT TCGGCATGCT GCTCGGCGCG CTCGGCCTGT CGGTGCTCGC CGATCGGATC GGCCGGCGGC CCGTGCTGAT CGGCGCGACG CTGTTCTTCG CGCTCGCGAT GCTCGCGACG CCGTTCGCGA CGTCGATCCC GATATTGATC GCGCTGCGCT TCGTCACGGG CCTGGGGCTC GGCTGCATCA TGCCGAACGC GATGGCGCTC GTCGGCGAAT GCAGCCCGGG CGCGCACCGC GTGAAGCGGA TGATGATCGT GTCGTGCGGC TTCACGCTCG GTGCGGCGCT GGGCGGGTTC GTCAGCGCCG CGCTGATTCC CGCGTTCGGC TGGCGCGCGG TGTTCTTCGT CGGCGGCGCG GTGCCGCTCG CGCTCGCGGC CGCGATGGCC GCGAGCCTGC CCGAATCGCC GCAGTTGCTC GTGCTGCGCG GCCGGCACGA CGCGGCGCGC GCGTGGCTCG CGAAGTTCGC GCCGCGGCTC GCGGTCCCGC CCGATACGCG GCTTGTCGTG CGCGAAGCGG GACCCCGGGG CGCGCCCGTC GCCGAGCTGT TCCGCTCGGG ACGCGCGCGC GTCACGCTGC TGTTGTGGGC GATCAACTTC ATGAACCTGA TCGACCTGTA CTTCCTGTCG AACTGGCTGC CGACCGTGAT GCGCGACGCG GGCTACGCGA GCGGCACGGC CGTCATCGTC GGCACGGTGC TGCAGACGGG CGGCGTGATC GGCACGCTGT CGCTCGGCTG GTTCATCGAA CGGCATGGTT TCGCGCGCGT GCTGTTCGCG TGCTTCGCGT GCGCGACGAT CGCGATCGGC CTGATCGGCT CGGTCGCGCA CGCGTTCGTC TGGCTGCTCG CAGCCGTGTT CGTCGGCGGC TTTTGCGTCG TCGGCGGACA GCCCGCGGTC AATGCGCTCG CGGGCCATTA TTACCCGACG TCGCTGCGCT CGACGGGCAT CGGCTGGAGT CTCGGCGTGG GCCGCGTCGG CTCCGTGCTC GGGCCGCTCG TCGGCGGGCA ACTGATCGCG CTCGGCTGGT CGAACGACGC GCTGTTTCAC GCGGCGGCCG TGCCGGTGCT GTGCTCGGCC GTCTTCGTGA TCGGCCTCGC GAGCGTGACG CGGCGGCGCG GCATGGCCGC GCCGAACGTC GCTTGA
|
Protein sequence | MSAAANPARV LEIERVIDDT HRPAFHAMLL ALCGLCLVID GFDAQAMGYV APSVIAEWGV KKQALGPVFS ASLFGMLLGA LGLSVLADRI GRRPVLIGAT LFFALAMLAT PFATSIPILI ALRFVTGLGL GCIMPNAMAL VGECSPGAHR VKRMMIVSCG FTLGAALGGF VSAALIPAFG WRAVFFVGGA VPLALAAAMA ASLPESPQLL VLRGRHDAAR AWLAKFAPRL AVPPDTRLVV REAGPRGAPV AELFRSGRAR VTLLLWAINF MNLIDLYFLS NWLPTVMRDA GYASGTAVIV GTVLQTGGVI GTLSLGWFIE RHGFARVLFA CFACATIAIG LIGSVAHAFV WLLAAVFVGG FCVVGGQPAV NALAGHYYPT SLRSTGIGWS LGVGRVGSVL GPLVGGQLIA LGWSNDALFH AAAVPVLCSA VFVIGLASVT RRRGMAAPNV A
|
| |