Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_4006 |
Symbol | eutR |
ID | 4901183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3910469 |
End bp | 3911497 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640137232 |
Product | ethanolamine operon transcriptional activator EutR |
Protein accession | YP_001068225 |
Protein GI | 126452825 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCACG AGCACACCGG CGCCGAGACG ACGGCGGAGG ACACCGGGCG CAGCGCGGAC GGCGACGGCG GCGCGGCCGG GCGCGCGCTC GTGAGCGTCG CGCACGACGC CGACGAGCAG GCGCGCAACC TGATCGGCTG GCGCCAGACC TACGACCAGC TCGCGGCGGG CCGCTTCGTC GGCACGTTGA CCGAGCTGCC GCTCGACACG ATGAAGGTGT TCCGGGAGAC GACGAGCCAT ACGCTGCGGC AGGCGTGCGA GGTGCGCGGC GATGCGTACT GGTTCGGCAT TCCGCTCGCG CGCGACGGCG CGGCGCGCAT CGACGCGCGG CCGATCGCCG CCGACGCGCT CGCGTTCCGG CCCGGCAACG TCGAGTTCGA GCTGTTGACG CCCGCGCAAT TCTCGATCTA CGGGGTGGTC GTGCGCGGCG CGGTGTTGCG CCGTTACGCG CAGGAGGTCG AGCGCTGCGG GCTCGACGAG CGGTTGCCGC TCGTGCCCGT CGTGCGCGTC GGCGAGGCGC GGCTCACGCG GCTGTGCGCG TTGCTCGCGC AGCGTCTGGA CGACGCCGAC GCGATGAGCG CGGCGGGCGA GCCGCTATCC GACTGCGCGC GCAACGACCT GCAGGCGGAA GTGCTCGCGG CGCTGTTGGA CCTGTGCGCG TCGCCCGCGG CCGACGCGAG CGTCGAGCAC TCGTCGCGGC GCCGCAAGAT CGTCGCGGCC GCGCGCGACT ACGTGCTCGC GCATCGCTCG CGGCCTGTCG GTGTGCCGGA GCTGTGCGAG CAACTGCACG TGAGCCGGCG CACGTTGCAG TATTGCTTCC AGGATGTGCT CGGGATGGCG CCCGCGACCT ACCTGCGCGC GCTGCGGCTC AACGGCGTGC GGCGCGATCT GCGCGGCCGC GCGGCCGCCT CGGTGCAGGA CGCCGCGGCT GCATGGGGGT TTTGGCATCT GAGCCAGTTC GCGACCGATT ATCGGCGGAT GTTCGGCGCG CGGCCGTCGG AGACGCTGCG CGACGCGCTC GCCTGTTGA
|
Protein sequence | MDHEHTGAET TAEDTGRSAD GDGGAAGRAL VSVAHDADEQ ARNLIGWRQT YDQLAAGRFV GTLTELPLDT MKVFRETTSH TLRQACEVRG DAYWFGIPLA RDGAARIDAR PIAADALAFR PGNVEFELLT PAQFSIYGVV VRGAVLRRYA QEVERCGLDE RLPLVPVVRV GEARLTRLCA LLAQRLDDAD AMSAAGEPLS DCARNDLQAE VLAALLDLCA SPAADASVEH SSRRRKIVAA ARDYVLAHRS RPVGVPELCE QLHVSRRTLQ YCFQDVLGMA PATYLRALRL NGVRRDLRGR AAASVQDAAA AWGFWHLSQF ATDYRRMFGA RPSETLRDAL AC
|
| |