Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2013 |
Symbol | |
ID | 4901097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1975906 |
End bp | 1976916 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640135243 |
Product | putative histidinol-phosphate aminotransferase HisC |
Protein accession | YP_001066278 |
Protein GI | 126452208 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.160507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTTG GCGAGGCAAT GGATACCGAA GTGCGGGCGG CGGCGCAAGC CGTCTGCCTG GCGTTCAATG AAAACCCGGA AGCGGTGGAG CCGCGCGTGC AGGCCGCGAT TGCTGCCGCG GCCGCGCGGA TCAATCGCTA CCCGTTTGAC GCCGAACCGC GCGTCATGCG CAAGCTCGCC GAGCATTTCA GCTGTCCCGA GGACAACCTG ATGCTGGTGC GCGGCATCGA CGAATGCTTC GATCGAATCA GCGCCGAATT TTCGTCGATG CGCTTCGTTA CCGCATGGCC GGGCTTCGAC GGCTATCGCG CACGCATCGC CGTCAGCGGG CTGAGACACT TCGAAATCGG CCTGACCGAC GATCTGCTGC TCGATCCGAA CGATCTCGCC CAAGTCTCGC GTGACGATTG CGTCGTGCTC GCCAATCCTT CGAATCCGAC CGGCCAGGCG CTGAGCGCGG GCGAGCTCGA GCAATTGAGG CAGCGCGCGG GCAAGTTGCT GATCGACGAA ACCTACGTCG ATTATTCGTC GTTTCGCGCC CGCGGCCTGG CTTACGGCGA GAACGAACTG GTGTTTCGTT CGTTCTCGAA ATCCTACGGC CTCGCCGGCT TGCGGCTCGG CGCGCTGTTC GGGCCGAGCG AGCTGATTGC CGCGATGAAG CGCAAGCAGT GGTTCTGCAA CGTCGGCACG CTCGATCTGC ATGCGCTCGA AGCCGCGCTC GACAACGATC GCGCACGTGA GGCGCACATC GCGAAGACGC TCGCGCAGCG CCGCCGCGTC GCCGACGCGC TGCGCGGGCT CGGCTACCGC GTCGCGTCGT CCGAGGCCAA TTTCGTGCTC GTCGAAAACG CCGCCGGCGA GCGCACGCTG CGCTTCCTGC GCGAACGGGG CATTCAGGTG AAGGACGCCG GCCAGTTCGG ACTTCACCAC CACATCAGAA TCAGCATCGG CCGTGAAGAG GACAACGATC GGTTGCTCGC GGCGCTGGCC GAATATTCCG ACCACTCATA A
|
Protein sequence | MSVGEAMDTE VRAAAQAVCL AFNENPEAVE PRVQAAIAAA AARINRYPFD AEPRVMRKLA EHFSCPEDNL MLVRGIDECF DRISAEFSSM RFVTAWPGFD GYRARIAVSG LRHFEIGLTD DLLLDPNDLA QVSRDDCVVL ANPSNPTGQA LSAGELEQLR QRAGKLLIDE TYVDYSSFRA RGLAYGENEL VFRSFSKSYG LAGLRLGALF GPSELIAAMK RKQWFCNVGT LDLHALEAAL DNDRAREAHI AKTLAQRRRV ADALRGLGYR VASSEANFVL VENAAGERTL RFLRERGIQV KDAGQFGLHH HIRISIGREE DNDRLLAALA EYSDHS
|
| |