Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0050 |
Symbol | |
ID | 4902739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 49256 |
End bp | 50215 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640133280 |
Product | putative transcriptional regulator |
Protein accession | YP_001064335 |
Protein GI | 126454299 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACG GCAGATTCTT CACGACGGCG GGCGAATCGC CCGCGTTTCG CGGCCGCGCA TGGGGCCGCG TCGTCACGCA ATACTTCGGC GGGCTCGATG CGTGTTGCGA CGGCGACGAC GCGTTCGACG CGCAGCTGAG CCAGTACGAG ATCGGCCCGA TGCGCGTGTT CACGATCACC GCGCCCGCGC ACCGCATCGT GCGGCCCGCC GCGGCGCTGC ACGATCACGG TTCCGACTTC TTCAAGCTGA TCCTGCAACT GAGCGGCGTG AGCGAGATCG AGCAGCGCGG CAAGACGTTC CGGCTGCACT CGGGCGACTG GAGCCTGTAC GATCCGCGCG TGCCGTACAG CATCGCGAAC CTGACGCACG TCGAGCAGCT CGCGATCCAG ATTCCGCGCA AGCAGCTCGG CGGCTTCGCG GTGCCGGATC TGCATACGTC GGACGTGCGC GAGTTCGAGC TCAAGGGGCT GTTCTCGCTG CTGTCTTCGT TTCTCGTGTC GTTGTCCGAA CAATTGCCGT CGCTGCCCGG CACGACAGGC ACCGCGCTAT CGGAGACGAT CCTCGGCCTG ATCGTATCGA CGCTGACCGC GCAGCGCGAC GCGCAAGGCG AGCACGTCGC GCTGCCCGCC GTGCTGCGGA TGCGCGTCAA GCAGTACATC CACGGCCACC TTGCCGACGC CGACCTGTCG ATCGACCGGA TCGCGCGCGA GCTACGCTGC TCGAAGCGCT ATCTGCACCG GATCTTCGAG GAGGAAGGCG TGACGATCGA CCGTTACATC TGGTCGAGCC GGCTCGAGCG CTGCAAGGAT GCGCTCGACA ACGCGCGCGC GGCGAAGCCG GCGATTTCCG AGATCGCGTT CAGCTGGGGG TTCAGCAGCA GCGCGCATTT CTGCCGCAGC TTCAAGCAGC GCTATGGCAT GACGCCGCGC GAATTCGTGC GCCGGCGTGC GTGGCCCTGA
|
Protein sequence | MVNGRFFTTA GESPAFRGRA WGRVVTQYFG GLDACCDGDD AFDAQLSQYE IGPMRVFTIT APAHRIVRPA AALHDHGSDF FKLILQLSGV SEIEQRGKTF RLHSGDWSLY DPRVPYSIAN LTHVEQLAIQ IPRKQLGGFA VPDLHTSDVR EFELKGLFSL LSSFLVSLSE QLPSLPGTTG TALSETILGL IVSTLTAQRD AQGEHVALPA VLRMRVKQYI HGHLADADLS IDRIARELRC SKRYLHRIFE EEGVTIDRYI WSSRLERCKD ALDNARAAKP AISEIAFSWG FSSSAHFCRS FKQRYGMTPR EFVRRRAWP
|
| |