Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1071 |
Symbol | |
ID | 4887691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1029561 |
End bp | 1031096 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640131011 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001062070 |
Protein GI | 126444581 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.030835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTAC TGAATTTTGG AGCCACGATG GACTACGGTG TGTTGCTGTC GAACTTCGAA CGGGACACCG CGCGCGACGC GCTCGCGCGC GCGTCGCAGC AGCACCGGCT CTACGCGTGC CTGCGCGCGG CGATCCTGAA CGGCACGCTC GAAGCCGGCA CGTATCTGAT GTCGTCGCGC GCGCTCGCCG AGACGCTGCG GATCGCGCGC AACACGGTGC TCTATGCCTA CGAGCGGCTC GCCGCCGAGG GCTTCGTCGT CGCGCGGCGG CAAGGCACGA TGGTCGCGCG TGTGGGGCTG CCCGCGGCGA GCGCGACCGC CTCGCCCACG CACGCGCGGC CCTCGCTCGC GCGGCGCGTC ACCGGGCTGC CGGACATCGA CGCCGACGAC GAGCGCGAGC CGCTGCCGTT CCTGCCGGGC ATGCCCGCGC TCGACCAGTT TCCGCTCGCG CCGTGGCGGC GCGCGGTCGA GCGCGCATGG CGGCGAATCG GTCCGGCGCA GCTCGGTCAC GCGCCGCTCG GCGGCAATCT GCGGCTGCGG CAGGCGATCG CCGAATATCT GCGCGTGTCG CGCGGCATCG GCTGCGATGC GCAGCAGGTG TTCATCACCG ACGGCACGCA GCACGGCCTC GATCTGTGCG CGCGCACGCT CGCCGACGCG GGCGATACCG TCTGGATCGA GCATCCCGGC TACGCCGGCG CGCGCGCCGC GTTCGAGGCG GCCGACCTGC GGCTCGTGCC GATCCCCGTC GATGCGGACG GCCTCGCGCC GAGCGCCGAG CACTGGCGTG CGCATCCGCC GCGGCTTGTC TACATCACGC CGTCGCACCA GTATCCGCTC GGCGCGGTGA TGAGCGTGGA GCGGCGCGTC GCGCTCGTCG CGAACGCGCG CGCGGCGGGC GCGTGGATCG TGGAGGACGA TTACGACAGC GAGTTCCGCC ACTTCGGCGC GCCGCTCGCC GCGCTGCAAA GTCTCGGCGA CGACGCGCCC GTCGTCTATC TCGGCACGTT CAGCAAGACG ATGTTTCCGA CGCTGCGCAT CGGCTTCGTC GTCGCGTCGG CGGCGCTCGC GCCGCAACTG CGTCACACGA TCGGCGCGCT CGCGCCGCGC GGGCGCCTTG CCGAGCAGCT CGCGCTCGCC GACTTCATCG AAGCGGGCCA TTTCACCCGG CATCTGCGCC GGATGCGCCG GCTCTACGAA GAGCGGCGCG ACGCACTGCA GGACGCGCTC GCGCGTCATC TCGGCGGCGC GCTGACGGTG TCGGGCGGCG CGGGCGGCAT GCATCTGTCC GCGCGGCTCG ATGCGCCTGT CGCCGACGTC GACGTCGCGC GCGCGGCGCT CGCCCGCGCG ATCACCGTGC GGCCGCTGTC GCGCTTCTGC CTGCCGGGCA CCGATCGCGC CGCATACAAC GGCCTCGTGC TCGGCTACGG CGCGGTGCCG ACCGAGCAGA TCGACGCTTG CGTGCGGCGG CTCGGCGCCG CGATCGACGA TGCGCTGCGC GAGGTGACGC GGGCGCCGCG CGACGCCGCG AGATGA
|
Protein sequence | MRLLNFGATM DYGVLLSNFE RDTARDALAR ASQQHRLYAC LRAAILNGTL EAGTYLMSSR ALAETLRIAR NTVLYAYERL AAEGFVVARR QGTMVARVGL PAASATASPT HARPSLARRV TGLPDIDADD EREPLPFLPG MPALDQFPLA PWRRAVERAW RRIGPAQLGH APLGGNLRLR QAIAEYLRVS RGIGCDAQQV FITDGTQHGL DLCARTLADA GDTVWIEHPG YAGARAAFEA ADLRLVPIPV DADGLAPSAE HWRAHPPRLV YITPSHQYPL GAVMSVERRV ALVANARAAG AWIVEDDYDS EFRHFGAPLA ALQSLGDDAP VVYLGTFSKT MFPTLRIGFV VASAALAPQL RHTIGALAPR GRLAEQLALA DFIEAGHFTR HLRRMRRLYE ERRDALQDAL ARHLGGALTV SGGAGGMHLS ARLDAPVADV DVARAALARA ITVRPLSRFC LPGTDRAAYN GLVLGYGAVP TEQIDACVRR LGAAIDDALR EVTRAPRDAA R
|
| |