Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_3957 |
Symbol | |
ID | 5768255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010086 |
Strand | + |
Start bp | 931764 |
End bp | 932732 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641318260 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001583932 |
Protein GI | 161520505 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0264307 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACA TCGACTGGCT GAGCCATCTT CTGCAAATGA TCACCGTCAC CGGCCGGGTG GAGGTCCGCT GCGTGTACGG TGCGCCGTGG CAGGTCGCGT GGGATCGATC GGCCGCACAC GTGATTCCGT ACCACGTCGT GCTCGAGGGG CGTGCCGTGC TCGAGGATAC CGAGTCCGGG ACGGTCAGGG AGCTGGCGAG CGGCGACATC GTGCTGCTGC CGCACGGTGC CGCGCACGTG CTGCACGATG GCAGCGGCCA TACGCCGGGC CGCACGCATA ATCGCGCGGG TTTCGCCGGA TGGATGCTCA GCGAGAACGA CGGTCGCGGC GAACGCCTGG ACATGCTGTG CGGACGCTTC TTCATCCGGC CGCCGCATGA TCGGCTGATT CGCGACTACC TGCCGACGAC ACTGACCGTG CGCTCCGCGG ATGGCGGCGA CGACGACGGC AACGGTAGCG GGAACGGCTC TGCGTCGAAC CAGCTCGCCA GTCTCGTCGC GTTGATGCGC ATGGAGTCGG CCGGCGAGAA GCCGGGCGGA TACGCGATCC TCAATGCCCT GACTTCGGCG CTGTTCACGC TCGTGCTGCG CGCGGCGAGC GAATCGGGGC AGGCGCCGGC GGGATTGCTC GCGCTGGCCG GTCATCCGCG GCTGGCGCCG GCGATTTCGG CAATGTTCGC CGACCCCGCG CGACCGTGGA GCCTGCCCGA GCTGGCCGCC CTCTGCAACA TGTCGCGCGC AACCTTCATG CGGCACTTTC AGGACAAACT CGGCCGCTCG GCCACGGACC TGCTCACGGA CATCCGGATG ACACTGGCCG CAAACGAGCT GAAGAAGCCG ACGATGAGCA CCGAGGCCGT GGCCGAGGCG ATCGGCTATC GGTCGGTCGC GGCATTCCGG CGTGTGTTCA CCGACAAGAT GGGGATGACG CCCGGACAAT GGCGCCGTCT CGCGAACGAA GGCGACTAG
|
Protein sequence | MSDIDWLSHL LQMITVTGRV EVRCVYGAPW QVAWDRSAAH VIPYHVVLEG RAVLEDTESG TVRELASGDI VLLPHGAAHV LHDGSGHTPG RTHNRAGFAG WMLSENDGRG ERLDMLCGRF FIRPPHDRLI RDYLPTTLTV RSADGGDDDG NGSGNGSASN QLASLVALMR MESAGEKPGG YAILNALTSA LFTLVLRAAS ESGQAPAGLL ALAGHPRLAP AISAMFADPA RPWSLPELAA LCNMSRATFM RHFQDKLGRS ATDLLTDIRM TLAANELKKP TMSTEAVAEA IGYRSVAAFR RVFTDKMGMT PGQWRRLANE GD
|
| |