Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA3123 |
Symbol | hisC-2 |
ID | 3090215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006348 |
Strand | + |
Start bp | 3217264 |
End bp | 3218334 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637563687 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_104607 |
Protein GI | 53724087 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGTT ACTGGAGCGA CATCGTCCGT CAACTCGAGC CGTATGTGCC GGGCGAGCAG CCGGCGCTCG CGCATCCCGT CAAGCTGAAC ACGAACGAGA ATCCGTATCC GCCGTCGCCG CGCGCGCTCG ACGCGATCCG GCGCGAGCTC GGCGATACGG GCGAAGCGCT GCGCCGCTAT CCGGACCCGG TCGCGCGCAG GCTGCGCGAG ACGGTGGCGG CCTATCACGG CATCGCGCCC GAGCAGGTGT TCGCCGGCAA CGGCTCCGAC GAAGTGCTCG CGCACGCGTT CCAGGCGCTC CTGCAACACG ACAGGCCGTT GCGCTTCCCG GACATCACGT ACAGCTTCTA CCCGACCTAT GCGCGGCTCT ATCGCGTCGC ATACGAGACG GTACCGCTCG CCGGCGATTT CTCGATCGTC GTCGACGACT ATCTCGACGA CGCCGGCTGC GTGCTGTTCC CGAACCCGAA CGCGCCGACG GGCCGCGCGC TGCCGCTTGC CGACATCGAG CGGATCGTCG CCGCCAACCC GAGCTCGGTT GTCGTGATCG ACGAGGCCTA TGTCGATTTC GGCGCGGAAT CGGCCGTCTC GCTGATCGCG CGCTATCCGA ATCTGCTCGT CGTGCATACC GTGTCGAAGG CGCGCTCGCT CGCCGGCATG CGCGTCGGCT TCGCGTTCGG CGACGCCGCG CTGATCGACG CGCTCACGCG CGTGAAGGAC AGCTTCAACT CGTATCCGCT CGACCGTCTC GCGCAAGTCG CGACGCAAGC GTCGTACGAG GACGAGGCGT GGTTCCAGGC GACGCGCAAG CAGGTGATCG CGAGCCGCGA GCGGCTCGTC GGCGCGCTGG CGGCGCTCGG CTTCGACGTC GTGCCGTCGG CGGCGAATTT CGTGTTCGCG CGCCCTCGTA GCCACGATGC GGCGACGCTC GCCGCGCAAC TGAAACAGCG GGAAATTTTC GTGCGGCACT TCAAGCTGCC GCGGATCGAC CAGCACTTGC GCATCACGGT CGGCTCGGAC GCCGAGTGCG ACGCGCTCGT CGCGGCGCTG CGGGAGCTGC TCGCCGCTTA A
|
Protein sequence | MSRYWSDIVR QLEPYVPGEQ PALAHPVKLN TNENPYPPSP RALDAIRREL GDTGEALRRY PDPVARRLRE TVAAYHGIAP EQVFAGNGSD EVLAHAFQAL LQHDRPLRFP DITYSFYPTY ARLYRVAYET VPLAGDFSIV VDDYLDDAGC VLFPNPNAPT GRALPLADIE RIVAANPSSV VVIDEAYVDF GAESAVSLIA RYPNLLVVHT VSKARSLAGM RVGFAFGDAA LIDALTRVKD SFNSYPLDRL AQVATQASYE DEAWFQATRK QVIASRERLV GALAALGFDV VPSAANFVFA RPRSHDAATL AAQLKQREIF VRHFKLPRID QHLRITVGSD AECDALVAAL RELLAA
|
| |