Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1237 |
Symbol | |
ID | 4889918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | + |
Start bp | 1203214 |
End bp | 1204212 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640147505 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001078423 |
Protein GI | 126445681 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.89016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGGCG GGCGCCCGGC CGTGCACGCG CAGGAGCGAC AGATGAACCC AGACCTCGAG ATCGTCCCCA CCCGCCGCGA CGAATCGTTT CGCGCATGGT CGCACGACTA TCCGCACACG GTCGCGAAAT GGCATTTTCA TCCGGAGTAC GAAATCCACC TGATTCAGGG TTCGCGCGGC AAGTTCTTCG TCGGCGACCA TATCGGCGAT TTCGCGCCCG GCAACCTCGT CGTCACCGGG CCGAACCTGC CGCACAACTG GATCAGCGAG CTCGGCCCCG GCGAGCGCGT GCCGTCGCGC GATGTCGTGC TGCAGTTCTC GCGCGACGCG GCCGAGAAGA TGGTGGCCGC GTTCGCCGAG CTGCAGCCGG TGCTCGACCT GATAGACGAA GCGTCGCGCG GCGTGCAGTT TCCGGACGAG ATCGGGCTCG CCGTCGCGCC GCTGATGCTC GAGCTCGCGA GCGCGCACGG CTGCCGGCGC GTCGAGGTGC TGATGGCGCT GTTCGACCGG CTGGCGTCGT GCGCCGCGCG TCGCACGCTC GCCGGCCCCG GCTACCGGAT CGACGCGCAG CACTACATGT CGTCGACGAT CAACCAGGTG CTCGCGTACC TGCGGCAGAA CCTGCCGGGC GCGCTACGCG AGGCGGACGT CGCCGAATTC GCCGGCATGA GCGTGAGCAC GTTCACGCGC TTCTTCCGCC GGCACACGGG CTCGACGTTC GTCCAGTACC TGAACCGGCT GCGGATCAAC GAAGCGTGCG AGCTGCTGAT GTGCTCGGCG CTCAGCGTCA CCGACATCTG CTACCGCATC GGCTTCAACA ACCTGTCGAA CTTCAACCGG CAATTCCTCG CGATGAAGGG GATGCCGCCG TCGCGCTTTC GCGCGCTGCA TCGGTTGAAC GAGCCGCATG ACGCGCCCGA ACCGCACGAG CCGCACGCGT CGCTCGCGCC GGCCACCGCG CGCGCCGTCA TCCATTCGCA CCGGAGCCTC CACCCGTGA
|
Protein sequence | MNGGRPAVHA QERQMNPDLE IVPTRRDESF RAWSHDYPHT VAKWHFHPEY EIHLIQGSRG KFFVGDHIGD FAPGNLVVTG PNLPHNWISE LGPGERVPSR DVVLQFSRDA AEKMVAAFAE LQPVLDLIDE ASRGVQFPDE IGLAVAPLML ELASAHGCRR VEVLMALFDR LASCAARRTL AGPGYRIDAQ HYMSSTINQV LAYLRQNLPG ALREADVAEF AGMSVSTFTR FFRRHTGSTF VQYLNRLRIN EACELLMCSA LSVTDICYRI GFNNLSNFNR QFLAMKGMPP SRFRALHRLN EPHDAPEPHE PHASLAPATA RAVIHSHRSL HP
|
| |