Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3262 |
Symbol | |
ID | 4899473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3171032 |
End bp | 3172012 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640136488 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001067499 |
Protein GI | 126454175 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGCTACC GAAAAGGGAT CGCTGTTGTC TTGTTTGGAG GCTTTTCGTT GATGCAGGCC GGGCGCATGG CGGAGATCTT CAATCTCGCC AATCGCTTTC GACACGCGGG CGAGGATTCC GGCGGCTACG AAATGCTGCT GTTGTCGGCG ACGGGCGGCG CGGTGTATTC GTCGTCCGGC ATTCCCGTCT GGACCCAGCC GCTGCACGAG CGTGCGATCG AGCGCGTGCA TGCGATCTTC GCGATCGGCG AACCGGACGC GGCCGATCGC GACGACGAGA AGGTGTCCGA ATGGTTGCGC AGCGCCCGCG CGCATGTTCG CGCATCGGGT GGCGCGCGGC GCCTGATGGG GCTTGCCGCG CTCGAACGCG GCGATGCGGC GGCGGACGAC CTGTCGATCG ACGCGATGCT CTCGGGCGCC GCATCCGCCG AGAAAGTGCG CCCGACAGCG GCGGAGCGCG CGGCGTTCTC GGCCGCGCTC GCGATCATTC GGCACGATTG CGGCGATAGT GCCGCGCACG AGGTTGCCGA GCGCTTGTGC CCGGCGCTGA GCGCACGGCC CGCCGAATCG GCTTTCGCCG CGCGCGAGGC GCGGGCGAGC AAGCTGATTC GCGCGTCGGT TCAGCAACTG CGCGACAACA GCGCCAACCG CATTTCGATC GCCGATACCG CGCATGCGGC GGCGATGAGC GAGCGCAATT TCCTGCGCCG CTTCAAGCAG GAAATCGGCG TGACGCCGTC CGAATTCGTG CAGAAAGTCC GGCTCGAGCA TGCGTGCCAC ATGCTCGTGC ATACCGATCT GCCGGTCGAC AAGATCGCGC GGCGCACCGG CCTCGGCAGC GGCGACCGCC TCGCGAAGCT GTTCCGCCAG CATCTGTCGA TGTCGCCGAC CGAGTATCGC GCGATCGAGC GCAGCCGAGG CGCGGACGCG GATCTCGCGT GCGGCGATTT CGTTTCGCGG CTGAGCGGAT CGGTTTCATA A
|
Protein sequence | MCYRKGIAVV LFGGFSLMQA GRMAEIFNLA NRFRHAGEDS GGYEMLLLSA TGGAVYSSSG IPVWTQPLHE RAIERVHAIF AIGEPDAADR DDEKVSEWLR SARAHVRASG GARRLMGLAA LERGDAAADD LSIDAMLSGA ASAEKVRPTA AERAAFSAAL AIIRHDCGDS AAHEVAERLC PALSARPAES AFAAREARAS KLIRASVQQL RDNSANRISI ADTAHAAAMS ERNFLRRFKQ EIGVTPSEFV QKVRLEHACH MLVHTDLPVD KIARRTGLGS GDRLAKLFRQ HLSMSPTEYR AIERSRGADA DLACGDFVSR LSGSVS
|
| |