Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0051 |
Symbol | |
ID | 4884065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 49177 |
End bp | 50136 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640125979 |
Product | putative transcriptional regulator |
Protein accession | YP_001057106 |
Protein GI | 126439683 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.870155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACG GCAGATTCTT CACGACGGCG GGCGAATCGC CCGCGTTTCG CGGCCGCGCA TGGGGCCGCG TCGTCACGCA ATACTTCGGC GGGCTCGACG CGTGTTGCGA CGGCGACGAC GCGTTCGACG CGCAGCTGAG CCAGTACGAG ATCGGCCCGA TGCGCGTGTT CACGATCGCC GCGCCCGCGC ACCGCATCGT GCGGCCCGTT GCGGCGCTGC ACGATCACGG TTCCGACTTC TTCAAGCTGA TCCTGCAACT GAGCGGCGTG AGCGAGATCG AGCAGCGCGG CAAGACGTTC CGGCTGCACT CGGGCGACTG GAGCCTGTAC GATCCGCGCG TGCCGTACAG CATCGCGAAC CTGACGCACG TCGAGCAGCT CGCGATCCAG ATTCCGCGCA AGCAGCTCGG CGGCTTCGCG GTGCCGGATC TGCACACGTC GGACGTGCGC GAGTTCGAGC TCAAGGGGCT GTTCTCGCTG CTGTCTTCGT TTCTCGTGTC GTTGTCCGAA CAATTGCCGT CGCTGCCCGG CACGACGGGC ACGGCGCTAT CGGAGACGAT CCTCGGCCTG ATCGTATCGA CGCTGACCGC GCAGCGCGAC GCGCAAGGCG AGCACGTCGC GCTGCCCGCC GTGCTGCGGA TGCGCGTCAA GCAGTACATC CACGGCCACC TTGCCGACGC CGACCTGTCG ATCGACCGGA TCGCGCGCGA GCTACGCTGC TCGAAGCGCT ATCTGCACCG GATCTTCGAG GAGGAAGGCG TGACGATCGA CCGTTACATC TGGTCGAGCC GGCTCGAGCG CTGCAAGGAT GCGCTCGACA ACGCGCGCGC GGCGAAGCCG GCGATTTCCG AGATCGCGTT CAGCTGGGGG TTCAGCAGCA GCGCGCATTT CTGCCGCAGC TTCAAGCAGC GCTATGGCAT GACGCCGCGC GAATTCGTGC GCCGGCGTGC GTGGTCCTGA
|
Protein sequence | MVNGRFFTTA GESPAFRGRA WGRVVTQYFG GLDACCDGDD AFDAQLSQYE IGPMRVFTIA APAHRIVRPV AALHDHGSDF FKLILQLSGV SEIEQRGKTF RLHSGDWSLY DPRVPYSIAN LTHVEQLAIQ IPRKQLGGFA VPDLHTSDVR EFELKGLFSL LSSFLVSLSE QLPSLPGTTG TALSETILGL IVSTLTAQRD AQGEHVALPA VLRMRVKQYI HGHLADADLS IDRIARELRC SKRYLHRIFE EEGVTIDRYI WSSRLERCKD ALDNARAAKP AISEIAFSWG FSSSAHFCRS FKQRYGMTPR EFVRRRAWS
|
| |