Gene BURPS668_A1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1071 
Symbol 
ID4887691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1029561 
End bp1031096 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content73% 
IMG OID640131011 
ProductGntR family transcriptional regulator 
Protein accessionYP_001062070 
Protein GI126444581 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.030835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTAC TGAATTTTGG AGCCACGATG GACTACGGTG TGTTGCTGTC GAACTTCGAA 
CGGGACACCG CGCGCGACGC GCTCGCGCGC GCGTCGCAGC AGCACCGGCT CTACGCGTGC
CTGCGCGCGG CGATCCTGAA CGGCACGCTC GAAGCCGGCA CGTATCTGAT GTCGTCGCGC
GCGCTCGCCG AGACGCTGCG GATCGCGCGC AACACGGTGC TCTATGCCTA CGAGCGGCTC
GCCGCCGAGG GCTTCGTCGT CGCGCGGCGG CAAGGCACGA TGGTCGCGCG TGTGGGGCTG
CCCGCGGCGA GCGCGACCGC CTCGCCCACG CACGCGCGGC CCTCGCTCGC GCGGCGCGTC
ACCGGGCTGC CGGACATCGA CGCCGACGAC GAGCGCGAGC CGCTGCCGTT CCTGCCGGGC
ATGCCCGCGC TCGACCAGTT TCCGCTCGCG CCGTGGCGGC GCGCGGTCGA GCGCGCATGG
CGGCGAATCG GTCCGGCGCA GCTCGGTCAC GCGCCGCTCG GCGGCAATCT GCGGCTGCGG
CAGGCGATCG CCGAATATCT GCGCGTGTCG CGCGGCATCG GCTGCGATGC GCAGCAGGTG
TTCATCACCG ACGGCACGCA GCACGGCCTC GATCTGTGCG CGCGCACGCT CGCCGACGCG
GGCGATACCG TCTGGATCGA GCATCCCGGC TACGCCGGCG CGCGCGCCGC GTTCGAGGCG
GCCGACCTGC GGCTCGTGCC GATCCCCGTC GATGCGGACG GCCTCGCGCC GAGCGCCGAG
CACTGGCGTG CGCATCCGCC GCGGCTTGTC TACATCACGC CGTCGCACCA GTATCCGCTC
GGCGCGGTGA TGAGCGTGGA GCGGCGCGTC GCGCTCGTCG CGAACGCGCG CGCGGCGGGC
GCGTGGATCG TGGAGGACGA TTACGACAGC GAGTTCCGCC ACTTCGGCGC GCCGCTCGCC
GCGCTGCAAA GTCTCGGCGA CGACGCGCCC GTCGTCTATC TCGGCACGTT CAGCAAGACG
ATGTTTCCGA CGCTGCGCAT CGGCTTCGTC GTCGCGTCGG CGGCGCTCGC GCCGCAACTG
CGTCACACGA TCGGCGCGCT CGCGCCGCGC GGGCGCCTTG CCGAGCAGCT CGCGCTCGCC
GACTTCATCG AAGCGGGCCA TTTCACCCGG CATCTGCGCC GGATGCGCCG GCTCTACGAA
GAGCGGCGCG ACGCACTGCA GGACGCGCTC GCGCGTCATC TCGGCGGCGC GCTGACGGTG
TCGGGCGGCG CGGGCGGCAT GCATCTGTCC GCGCGGCTCG ATGCGCCTGT CGCCGACGTC
GACGTCGCGC GCGCGGCGCT CGCCCGCGCG ATCACCGTGC GGCCGCTGTC GCGCTTCTGC
CTGCCGGGCA CCGATCGCGC CGCATACAAC GGCCTCGTGC TCGGCTACGG CGCGGTGCCG
ACCGAGCAGA TCGACGCTTG CGTGCGGCGG CTCGGCGCCG CGATCGACGA TGCGCTGCGC
GAGGTGACGC GGGCGCCGCG CGACGCCGCG AGATGA
 
Protein sequence
MRLLNFGATM DYGVLLSNFE RDTARDALAR ASQQHRLYAC LRAAILNGTL EAGTYLMSSR 
ALAETLRIAR NTVLYAYERL AAEGFVVARR QGTMVARVGL PAASATASPT HARPSLARRV
TGLPDIDADD EREPLPFLPG MPALDQFPLA PWRRAVERAW RRIGPAQLGH APLGGNLRLR
QAIAEYLRVS RGIGCDAQQV FITDGTQHGL DLCARTLADA GDTVWIEHPG YAGARAAFEA
ADLRLVPIPV DADGLAPSAE HWRAHPPRLV YITPSHQYPL GAVMSVERRV ALVANARAAG
AWIVEDDYDS EFRHFGAPLA ALQSLGDDAP VVYLGTFSKT MFPTLRIGFV VASAALAPQL
RHTIGALAPR GRLAEQLALA DFIEAGHFTR HLRRMRRLYE ERRDALQDAL ARHLGGALTV
SGGAGGMHLS ARLDAPVADV DVARAALARA ITVRPLSRFC LPGTDRAAYN GLVLGYGAVP
TEQIDACVRR LGAAIDDALR EVTRAPRDAA R