Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0236 |
Symbol | |
ID | 3694216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 350827 |
End bp | 351846 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637730490 |
Product | AraC family transcriptional regulator |
Protein accession | YP_335395 |
Protein GI | 76817450 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCAG ACCTCGAGAT CGTCCCCACC CGCCGCGACG AATCGTTTCG CGCATGGTCG CACGACTATC CGCACACGGT CGCGAAATGG CATTTTCATC CGGAGTACGA AATCCACCTG ATTCAGGGTT CGCGCGGCAA GTTCTTCGTC GGCGACCATA TCGGCGATTT CGCGCCCGGC AACCTCGTCG TCACCGGGCC GAACCTGCCG CACAACTGGA TCAGCGAGCT CGGCCCCGGC GAGCGCGTGC CGTCGCGCGA CGTCGTGCTG CAGTTCTCGC GCGACGCGGC CGAGAAGATG GTGGCCGCGT TCGCCGAGCT GCAGCCGGTG CTCGACCTGA TCGACGAAGC GTCGCGCGGC GTGCAGTTTC CGGACGAGAT CGGGCTCGCC GTCGCGCCGC TGATGCTCGA GCTCGCGAGC GCGCACGGCT GCCGGCGCGT CGAGGTGCTG ATGGCGCTGT TCGACCGGCT GGCGTCGTGC GCCGCGCGTC GCACGCTCGC CGGCCCCGGC TACCGGATCG ACGCGCAGCA CTACATGTCG TCGACGATCA ACCAGGTGCT CGCGTACCTG CGGCAGAACC TGCCGGGCGC GCTACGCGAG GCGGACGTCG CCGAATTCGC CGGCATGAGC GTGAGCACGT TCACGCGCTT CTTCCGCCGG CACACGGGCT CGACGTTCGT CCAGTATCTG AACCGGCTGC GGATCAACGA AGCGTGCGAG CTGCTGATGT GCTCGGCGCT CAGCGTCACC GACATCTGCT ACCGCATCGG CTTCAACAAC CTGTCGAACT TCAACCGGCA ATTCCTCGCG ATGAAGGGGA TGCCGCCGTC GCGCTTTCGC GCGCTGCATC GGTTGAACGA GCCGCATGAC GCGCCCGAAC CGCACGAGCC GCACGAGCCG CACGCGTCGC TCGCGCCGGC CGCCGCGCCC GCGGCCCCGG GCGCGGCGGC CCGGCCCCCC GAGCGCGCCG CACCCACCGC GCGCGCCGTC ATCCATTCGC ACCGGAGCCT CCACCCGTGA
|
Protein sequence | MNPDLEIVPT RRDESFRAWS HDYPHTVAKW HFHPEYEIHL IQGSRGKFFV GDHIGDFAPG NLVVTGPNLP HNWISELGPG ERVPSRDVVL QFSRDAAEKM VAAFAELQPV LDLIDEASRG VQFPDEIGLA VAPLMLELAS AHGCRRVEVL MALFDRLASC AARRTLAGPG YRIDAQHYMS STINQVLAYL RQNLPGALRE ADVAEFAGMS VSTFTRFFRR HTGSTFVQYL NRLRINEACE LLMCSALSVT DICYRIGFNN LSNFNRQFLA MKGMPPSRFR ALHRLNEPHD APEPHEPHEP HASLAPAAAP AAPGAAARPP ERAAPTARAV IHSHRSLHP
|
| |