Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0164 |
Symbol | |
ID | 4883794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 155517 |
End bp | 156476 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640126092 |
Product | HpcH/HpaI aldolase family protein |
Protein accession | YP_001057217 |
Protein GI | 126440820 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.223586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTTCCG AATCCGCGCG CGGGCGCGAT CACGGTATGA TCGAATCGTG GATCGTCTGC CGCGACGAAC GCGTCGCGCG CGCCTGTTAC CGCGCAGCGC TCCCGCCCGG CGGTCTCGCC GCCCCTCTTT CCGAATCGCA GTCCGACCGT GCGTCGCACG GCAGGAGGCC GCCGATGAGC ACGCTCACCC ATTCCCTGAA AGCACGTTTG CGCGACGGCG ACGAGCCGCT GTTCGGCCTA TGGCTGACGC TCGCGAGCGA GGCGGCGACC GAGGCGCTCG CGCACGCCGG TTTCGACTGG CTGTGCATCG ACATGGAGCA CGCGCCGAAC GACAGCCGCG ACGTCGCCGC GCAACTGCGC GCGCTCGCGG CCGCGCATCT GCCGAGCGAG CCCGTCGTGC GGGTGCCGGC GCGCGAGCCG TGGTTCGTCA AGCGGGCGCT CGACGCGGGT GCGCGCACGC TGATGTTCCC GGGCGTCGAG ACGGCCGACG AGGCCGCGCA TGCGGTGCGG CTCACGCGCT TTCAGGCGCC CGATGCGCCG GACGGGCTGC GCGGCGTTGC GGGCATCGTG CGCGCGGCCG CTTATGGGAT GCGGCGCGAC TACGTGCAGA CGGCGAACGC GCAGATCGCG ACGATCGTGC AGATCGAATC GGCGCGCGGC GTCGACGAAG CCGAGCGGAT CGCGGCGACG CCGGGCGTCG ATTGCGTATT CGTCGGGCCC GCCGACCTGT CCGCGAGCCT CGGGCATCTC GGCGACACGA AGCATCCGGA CGTCGCGGCC GCGCTCGAGC ACGTGCTCGC GGCCGGGCGG CGCGCCGGCG TGCCGGTCGG CATCTTCGCC GCGGATACGG CCGGCGCGCG CCAGTCTCTC GAAGCCGGAT TCCGCGTGGT CGCGTTGTCC GCGGACGTCG TGTGGCTGCT GCGCGCGACG CGACAGGCGC TGCAGGAGGT GCGGGGATGA
|
Protein sequence | MFSESARGRD HGMIESWIVC RDERVARACY RAALPPGGLA APLSESQSDR ASHGRRPPMS TLTHSLKARL RDGDEPLFGL WLTLASEAAT EALAHAGFDW LCIDMEHAPN DSRDVAAQLR ALAAAHLPSE PVVRVPAREP WFVKRALDAG ARTLMFPGVE TADEAAHAVR LTRFQAPDAP DGLRGVAGIV RAAAYGMRRD YVQTANAQIA TIVQIESARG VDEAERIAAT PGVDCVFVGP ADLSASLGHL GDTKHPDVAA ALEHVLAAGR RAGVPVGIFA ADTAGARQSL EAGFRVVALS ADVVWLLRAT RQALQEVRG
|
| |