Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0923 |
Symbol | |
ID | 4905907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 902754 |
End bp | 903737 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640144029 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001074959 |
Protein GI | 126455612 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.139126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGG ATCAGCGCCA CACCCGACTC GAATCCGCCG CGCACGCGCC GCCCCGGCCC GATGCGCAAA CGCTTGCGCC GCGCGAGGCC GCGCGCCGCG AGCTCGCCGC GCTGATCGAG CGCTTCGCGC CCGCCGACGG CGCGCACCCG AGCGCGATTC CCGCGCTGTC GTTCTTTCGC TGCTCGTCGC CCGTCGATCT CGGCTGCAGC GTCACGCGCG CCGCGTTCGT GTTCGCCGCG CAGGGCGCGA AGCGGGTAAC GGTCGCGGGG CAGGCGTACG AATACGATCA TCAGCAGTGC CTCGTCACGT CGGTCGATCT GCCGATGCTG TCGCAGGTCA CGCGCGCGTC GGCCGGCGCG CCGTATCTGT GCGTGAAGAT CGCGCTCGAC GTGCAGCGCA TCGCCGAGCT CTCGGCCGAG ATGCGGATGC CGCCGCCGGA GGCGGTGCCC ACGGGCGAGG GAATCGTCGT CGGCGCGCTG TCCGAGCCGC TTTTCGACGC GGCGCTGCGG CTCGTGCGAT TGCTCGATAC TCCAGCCGAC ATCCCGATCC TCGCGCCGCT GATCGAAAAG GAGCTGCTGT ACCGGCTGCT GACGAGCGGG CTGGGCGCGC GGCTGCGGCA CATCGCGGTC GCGGGCAGCC AGACGTACCG GATCGCGCGT GCGATCGAAT GGCTTCGTCA TCACTACACG GAGCCGCTCA GGGTCGAGAC GCTCGCGCAG CAGGTCAATA TGAGCGTGTC GTCGCTGCAT CATCACTTCA AGCACGTGAC GACGCTCAGC CCGCTCCAGT ATCAGAAGCA ACTGCGGCTG CACGAGGCGC GCCGGCTGCT GCTCGGCCAG CACGGCGACG TCGGTTCGGT CGCGCTCAGG GTCGGATACG ACAGCCCGTC GCAGTTCAGC CGCGAATACA GCCGGCTGTT CGGCGCGCCG CCGTTGCGCG ACGTCGTGCA ACGGCGGCGC AACGGGACGG GCGTTCAGGA GTGA
|
Protein sequence | MPMDQRHTRL ESAAHAPPRP DAQTLAPREA ARRELAALIE RFAPADGAHP SAIPALSFFR CSSPVDLGCS VTRAAFVFAA QGAKRVTVAG QAYEYDHQQC LVTSVDLPML SQVTRASAGA PYLCVKIALD VQRIAELSAE MRMPPPEAVP TGEGIVVGAL SEPLFDAALR LVRLLDTPAD IPILAPLIEK ELLYRLLTSG LGARLRHIAV AGSQTYRIAR AIEWLRHHYT EPLRVETLAQ QVNMSVSSLH HHFKHVTTLS PLQYQKQLRL HEARRLLLGQ HGDVGSVALR VGYDSPSQFS REYSRLFGAP PLRDVVQRRR NGTGVQE
|
| |