Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0044 |
Symbol | |
ID | 3848504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 48240 |
End bp | 49199 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637839717 |
Product | AraC family transcriptional regulator |
Protein accession | YP_440604 |
Protein GI | 83719725 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACG GCAGATTCTT CACGACGGCG GGCGAATCGC CCGCGTTTCG CGGCCACGCA TGGGGCCGCG TCGTCACGCA ATACTTCGGC GGACTCGACG CGTGCTGCGA CAGCGACGAC GTGTTCGACG CGCAGCTCAG CCAGTACGAG ATCGGCCCGA TGCGCGTGTT CACGATCGCC GCGCCCGCAC ACCGGATCGT GCGGCCCGTC GCGGCGCTGC ACGATCACGG CTCCGATTTC TTCAAGCTGA TCCTGCAACT GAGCGGCGTG AGCGAGATCG AGCAGCGCGG CAAGGTGTTC CGGCTGCGCA CCGGCGACTG GAGCCTGTAC GACCCGCGCG TGCCGTACAG CATCGCGAAC CTGACGCGCG TCGAGCAACT GGCGATCCAG ATTCCGCGCA GGCAGCTCGG CGGCTTCGCG GTGCCGGACC TGCACACGTC CGACGTCCGC GAGTTCGAGC TCAAGGGGCT GTTCTCGCTG CTGTCGTCGT TTCTCATGTC GTTATCCGAA CAATTGCCGT CGCTGCCCGG CACGACGGGC GCCGCGCTGT CTGAGACGAT TCTGGGCCTC ATCGTATCGA CGCTGACCGC GCAGCGCGAC GCGCAAGGCG CGCACGTCGC GCTGCCCGCC GTGCTGCGGA TGCGCGTGAA GCAATACATC CGCGGCCATC TCGCCGACGC CGACCTGTCG ATCGACCGGA TCGCGCGCGA GCTGCGCTGC TCGAAGCGCT ATCTGCACCG GATCTTCGAG GAGGAAGGCG TGACGATCGA CCGCTACATC TGGTCGAGCC GGCTCGAGCG CTGCAAGGAT GCGCTCGACA ACGCGCGCGC GGCGAAGCCC GCGATTTCCG AGATCGCGTT CAGTTGGGGA TTCAGCAGCA GCGCGCATTT CTGCCGCAGC TTCAAGCAGC GCTACGGCAT GACGCCGCGC GAGTTCGTGC GGCGGCGCAC CTCGGCCTGA
|
Protein sequence | MVNGRFFTTA GESPAFRGHA WGRVVTQYFG GLDACCDSDD VFDAQLSQYE IGPMRVFTIA APAHRIVRPV AALHDHGSDF FKLILQLSGV SEIEQRGKVF RLRTGDWSLY DPRVPYSIAN LTRVEQLAIQ IPRRQLGGFA VPDLHTSDVR EFELKGLFSL LSSFLMSLSE QLPSLPGTTG AALSETILGL IVSTLTAQRD AQGAHVALPA VLRMRVKQYI RGHLADADLS IDRIARELRC SKRYLHRIFE EEGVTIDRYI WSSRLERCKD ALDNARAAKP AISEIAFSWG FSSSAHFCRS FKQRYGMTPR EFVRRRTSA
|
| |