Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA1078 |
Symbol | |
ID | 3087503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | + |
Start bp | 1120440 |
End bp | 1121396 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637564979 |
Product | AraC family transcriptional regulator |
Protein accession | YP_105741 |
Protein GI | 53716685 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.441231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCAG ACCTCGAGAT CGTCCCCACC CGCCGCGACG AATCGTTTCG CGCATGGTCG CACGACTATC CGCACACGGT CGCGAAATGG CATTTTCATC CGGAGTACGA AATCCACCTG ATTCAGGGTT CGCGCGGCAA GTTCTTCGTC GGCGACCATA TCGGCGATTT CGCGCCCGGC AACCTCGTCG TCACCGGGCC GAACCTGCCG CACAACTGGA TCAGCGAGCT CGGCCCCGGC GAGCGCGTGC CGTCGCGCGA TGTCGTGCTG CAGTTCTCGC GCGACGCGGC CGAGAAGATG GTGGCCGCGT TCGCCGAGCT GCAGCCGGTG CTCGACCTGA TAGACGAAGC GTCGCGCGGC GTGCAGTTTC CGGACGAGAT CGGGCTCGCC GTCGCGCCGC TGATGCTCGA GCTCGCGAGC GCGCACGGCT GCCGGCGCGT CGAGGTGCTG ATGGCGCTGT TCGACCGGCT GGCGTCGTGC GCCGCGCGTC GCACGCTCGC CGGCCCCGGC TACCGGATCG ACGCGCAGCA CTACATGTCG TCGACGATCA ACCAGGTGCT CGCGTACCTG CGGCAGAACC TGCCGGGCGC GCTACGCGAG GCGGACGTCG CCGAATTCGC CGGCATGAGC GTGAGCACGT TCACGCGCTT CTTCCGCCGG CACACGGGCT CGACGTTCGT CCAGTACCTG AACCGGCTGC GGATCAACGA AGCGTGCGAG CTGCTGATGT GCTCGGCGCT CAGCGTCACC GACATCTGCT ACCGCATCGG CTTCAACAAC CTGTCGAACT TCAACCGGCA ATTCCTCGCG ATGAAGGGGA TGCCGCCGTC GCGCTTTCGC GCGCTGCATC GGTTGAACGA GCCGCATGAC GCGCCCGAAC CGCACGAGCC GCACGCGTCG CTCGCGCCGG CCACCGCGCG CGCCGTCATC CATTCGCACC GGAGCCTCCA CCCGTGA
|
Protein sequence | MNPDLEIVPT RRDESFRAWS HDYPHTVAKW HFHPEYEIHL IQGSRGKFFV GDHIGDFAPG NLVVTGPNLP HNWISELGPG ERVPSRDVVL QFSRDAAEKM VAAFAELQPV LDLIDEASRG VQFPDEIGLA VAPLMLELAS AHGCRRVEVL MALFDRLASC AARRTLAGPG YRIDAQHYMS STINQVLAYL RQNLPGALRE ADVAEFAGMS VSTFTRFFRR HTGSTFVQYL NRLRINEACE LLMCSALSVT DICYRIGFNN LSNFNRQFLA MKGMPPSRFR ALHRLNEPHD APEPHEPHAS LAPATARAVI HSHRSLHP
|
| |