Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0886 |
Symbol | |
ID | 3694219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 1146431 |
End bp | 1147381 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637731140 |
Product | AraC family transcriptional regulator |
Protein accession | YP_336044 |
Protein GI | 76817453 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCCGA AACGAGGTCG ACTTGGTTTA CCATCGAAGC TCCAGCGCGC CATCTTTTTT GAATGGGTAG CTTATGGAAG CGTCAGGGAG GCTGTCTTGA ACGATTGTGC ACGCGACATC GCGCCCGACC GAGATCAAAT TCGGATCGGA TCCGGCGCCC CAGGCATCGA ACGCGTGGAG GCGCACTTCC TCGATCATGC GTTCACGCCG CATCGTCACG ACACGTACGC GATCGGCATC ACGCTCTCGG GCTTGCAAAC CTTCGGCTAT CTCGGCGAAA TCCACCACTG CCTGCCGGGA CAGTGCCATA TCCTGCACCC CGACGAGTTG CACGACGGCC GCGCGGGAAC CGACGAAGGC TTCGGTTACC GAATCATCTA CGTCGATCCC GCGCTCGTCC AGGAAGCGCT CGGCGGCCGC ATGCTGCCGT TCGTTCGCTC ACCGATCTTC CAGGCACCCG CCGTTTCCGA GGCGCTCGCG GCCGGCATCT GGAACCTGGA CGAGGAAATC GACAACGTAT CGCGCATCGA CATCGCCGTC GCCGTCGCGA ACCTGCTGAC GGCCGCCGCC GCGAAGGGCA GCGCGCCGAA AGCCGGCCCG CTCGCGATGG TCGAATTGAC ACGTATCCGC GATATCATCG CCTCCTGTCC GCGCGAGCCG ATCTCGATGG ACGCGCTCGA GCACGCATCG GGACTCGACC GCTGGACGAT CGCCCGCCAG TTCCGACGGC TGTTCGGCAC GAGCCCGAGC CGCTTTCGCA CGCAGCGACA GCTCAATCTC GTGCGTCGGC TATTGATGGA AGGCGAATCG CTGTCGACCG CCTCCACCGA CGCGGGCTTT TCCGATCAGA GCCACATGTC GCGGCACTTC AAGAGTACTT ACGGCATCAC GCCGGGTGCG TGGATTTCCG CCGTGCGCAC CCGGCACGCG CAGCACCCGT CGGACCACTA A
|
Protein sequence | MSPKRGRLGL PSKLQRAIFF EWVAYGSVRE AVLNDCARDI APDRDQIRIG SGAPGIERVE AHFLDHAFTP HRHDTYAIGI TLSGLQTFGY LGEIHHCLPG QCHILHPDEL HDGRAGTDEG FGYRIIYVDP ALVQEALGGR MLPFVRSPIF QAPAVSEALA AGIWNLDEEI DNVSRIDIAV AVANLLTAAA AKGSAPKAGP LAMVELTRIR DIIASCPREP ISMDALEHAS GLDRWTIARQ FRRLFGTSPS RFRTQRQLNL VRRLLMEGES LSTASTDAGF SDQSHMSRHF KSTYGITPGA WISAVRTRHA QHPSDH
|
| |