Gene BURPS668_3935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3935 
SymboleutR 
ID4882492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3835047 
End bp3836075 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID640129863 
Productethanolamine operon transcriptional activator EutR 
Protein accessionYP_001060928 
Protein GI126438810 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACG AGCACACCGG CGCCGAGACG ACGGCGGAGG ACACCGGGCG CAGCGCGGAC 
GGCGACGGCG GCGCGGCCGG GCGCGCGCTC GTGAGCGTCG CGCACGACGC CGACGAGCAG
GCGCGCAACC TGATCGGCTG GCGCCAGACC TACGACCAGC TCGCGGCGGG CCGCTTCGTC
GGCACGTTGA CCGAACTGCC GCTCGACACG ATGAAGGTGT TCCGGGAGAC GACGAGCCAT
ACGCTGCGGC AGGCGTGCGA GGTGCGCGGC GATGCGTACT GGTTCGGCAT TCCGCTCGCG
CGCGACGGCG CGGCGCGCAT CGACGCGCGG CCGATCGCCG CCGACGCGCT CGCGTTCCGG
CCCGGCAACG TCGAGTTCGA GCTGTTGACG CCCGCGCAAT TCTCGATCTA CGGAGTGGTC
GTGCGCGGCG CGGTGTTGCG CCGTTACGCG CAGGAGGTCG AGCGCTGCGG GCTCGACGAG
CGGTTGCCGC TCGTGCCCGT CGTGCGCGTC GGCGAGGCGC GGCTCACGCG GCTGTGCGCG
TTGCTCGCGC AGCGTCTGAA CGACGCCGAC GCGATGAGCG CGGCGGGCGA GCCGCTATCC
GACTGCGCGC GCAACGACCT GCAGGCGGAA GTGCTCGCGG CGCTGTTCGA CCTGTGCGCG
TCGCCCGCGG CCGACGCGAG CGTCGAGCAC TCGTCGCGGC GCCGCAAGAT CGTCGCGGCC
GCGCGCGACT ACGTGCTCGC GCATCGCTCG CGGCCTGTCG GCGTGCCGGA GCTGTGCGAG
CAACTGCACG TGAGCCGGCG CACGCTGCAG TATTGCTTCC AGGATGTGCT CGGGATGGCG
CCCGCGACCT ACCTGCGCGC GCTGCGGCTC AACGGCGTGC GGCGCGATCT GCGCGGCCGC
GCGGCCGCCT CGGTGCAGGA CGCCGCGGCT GCATGGGGGT TTTGGCATCT GAGCCAGTTC
GCGACCGATT ATCGGCGGAT GTTCGGCGCG CGGCCGTCGG AGACGCTGCG CGACGCGCTC
GCCTGTTGA
 
Protein sequence
MDHEHTGAET TAEDTGRSAD GDGGAAGRAL VSVAHDADEQ ARNLIGWRQT YDQLAAGRFV 
GTLTELPLDT MKVFRETTSH TLRQACEVRG DAYWFGIPLA RDGAARIDAR PIAADALAFR
PGNVEFELLT PAQFSIYGVV VRGAVLRRYA QEVERCGLDE RLPLVPVVRV GEARLTRLCA
LLAQRLNDAD AMSAAGEPLS DCARNDLQAE VLAALFDLCA SPAADASVEH SSRRRKIVAA
ARDYVLAHRS RPVGVPELCE QLHVSRRTLQ YCFQDVLGMA PATYLRALRL NGVRRDLRGR
AAASVQDAAA AWGFWHLSQF ATDYRRMFGA RPSETLRDAL AC