Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0739 |
Symbol | |
ID | 4903942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 727562 |
End bp | 728659 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640143845 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001074775 |
Protein GI | 126456465 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGCTCGA TTTCACAATA CGCGAGCCGA GCGCCCGGCC GCATCAAGAT CAGATTCAAG GAACGCTTCC CCATGTCGCC CGACCGCACC GCGTCGCTGT CTCATTTCGC GTTCATGCCG CTGCCCAACT TCACGATGAT CGCGTTCACG AACGCGATCG AAGTGCTCAG GATGGCCAAC TACCTGAGCG GGCAGCCGCT GTACCGCTGG TCGATCATCA GCCCGCAGGG CGGCATGGTC ACGGCGAGCA ACGGGCTCGC GGTCGACACC GGCCCGGCCG AATGCGCGGG GCAGCCGGAC ATCGTGTTCG TCTGCGGCGG CGTGGACGTG CAGCGCGCGA CGCAACCCGA GCATCTCGCG GCGCTGCGCC GCTTCGCGCG CGCGGGCGTC GCGCTCGGCA GCCTGTGCAC CGGCACCTAT GCGCTCGCGA AGGCGGGGCT CCTCGCCGGC TACGCCTGTG CGATCCATTG GGAAAATCTG TCGGCGCTGA AGGAAGAATT TCCCGATACG CGCTTTCTCA AGGAACTGTT CGTGATCGAC CGCGATCGCG TGACGTGCAC GGGCGGCGTC GCGCCGCTCG ACATGATGCT GAACCTGATC GCGTCGCGCA TCGGCACCGC GCGCGTCACG CAGATCGCCG AGCAGTTCAT CGTCGAGCAC GTGCGCGACA CGAGCGCGCA GCAGCGCATG CCGCTCGTCG CCCGGCTCGG CTCCGCGAAC AAATCGCTGT TCGAAGTGAT CGCGCTGATG GAGAACAACA TCGAGGAGCC GCTGTCGCGC GAAGAACTCG CGCGGCTCGC GAACATGTCG CAGCGGCAGT TGCAGCGCCT CTTTCGCGAG CATCTCGGCA TGACGCCGAC GCATTACTAC CTGACGCTGC GCCTGCGCCG CGCGCGCGAG CTGCTGCTGC AAACCGACAT GTCGATCATG CACATCACGA TGGCGTGCGG CTTCCAATCC GCGTGCCACT TCAGCAAGAG CTACCGCGAC GCGTTCGGCA CCGCGCCGAC GCGCGAGCGC CGCAAGCAGG TCGCGCCGCT CGCGCAGCCG TCGATGCCGG GCGGCGCGCC CGCGCCGTCG ATGATGCTGC ACGCGTGA
|
Protein sequence | MRSISQYASR APGRIKIRFK ERFPMSPDRT ASLSHFAFMP LPNFTMIAFT NAIEVLRMAN YLSGQPLYRW SIISPQGGMV TASNGLAVDT GPAECAGQPD IVFVCGGVDV QRATQPEHLA ALRRFARAGV ALGSLCTGTY ALAKAGLLAG YACAIHWENL SALKEEFPDT RFLKELFVID RDRVTCTGGV APLDMMLNLI ASRIGTARVT QIAEQFIVEH VRDTSAQQRM PLVARLGSAN KSLFEVIALM ENNIEEPLSR EELARLANMS QRQLQRLFRE HLGMTPTHYY LTLRLRRARE LLLQTDMSIM HITMACGFQS ACHFSKSYRD AFGTAPTRER RKQVAPLAQP SMPGGAPAPS MMLHA
|
| |