Gene BURPS1106A_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4006 
SymboleutR 
ID4901183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3910469 
End bp3911497 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID640137232 
Productethanolamine operon transcriptional activator EutR 
Protein accessionYP_001068225 
Protein GI126452825 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACG AGCACACCGG CGCCGAGACG ACGGCGGAGG ACACCGGGCG CAGCGCGGAC 
GGCGACGGCG GCGCGGCCGG GCGCGCGCTC GTGAGCGTCG CGCACGACGC CGACGAGCAG
GCGCGCAACC TGATCGGCTG GCGCCAGACC TACGACCAGC TCGCGGCGGG CCGCTTCGTC
GGCACGTTGA CCGAGCTGCC GCTCGACACG ATGAAGGTGT TCCGGGAGAC GACGAGCCAT
ACGCTGCGGC AGGCGTGCGA GGTGCGCGGC GATGCGTACT GGTTCGGCAT TCCGCTCGCG
CGCGACGGCG CGGCGCGCAT CGACGCGCGG CCGATCGCCG CCGACGCGCT CGCGTTCCGG
CCCGGCAACG TCGAGTTCGA GCTGTTGACG CCCGCGCAAT TCTCGATCTA CGGGGTGGTC
GTGCGCGGCG CGGTGTTGCG CCGTTACGCG CAGGAGGTCG AGCGCTGCGG GCTCGACGAG
CGGTTGCCGC TCGTGCCCGT CGTGCGCGTC GGCGAGGCGC GGCTCACGCG GCTGTGCGCG
TTGCTCGCGC AGCGTCTGGA CGACGCCGAC GCGATGAGCG CGGCGGGCGA GCCGCTATCC
GACTGCGCGC GCAACGACCT GCAGGCGGAA GTGCTCGCGG CGCTGTTGGA CCTGTGCGCG
TCGCCCGCGG CCGACGCGAG CGTCGAGCAC TCGTCGCGGC GCCGCAAGAT CGTCGCGGCC
GCGCGCGACT ACGTGCTCGC GCATCGCTCG CGGCCTGTCG GTGTGCCGGA GCTGTGCGAG
CAACTGCACG TGAGCCGGCG CACGTTGCAG TATTGCTTCC AGGATGTGCT CGGGATGGCG
CCCGCGACCT ACCTGCGCGC GCTGCGGCTC AACGGCGTGC GGCGCGATCT GCGCGGCCGC
GCGGCCGCCT CGGTGCAGGA CGCCGCGGCT GCATGGGGGT TTTGGCATCT GAGCCAGTTC
GCGACCGATT ATCGGCGGAT GTTCGGCGCG CGGCCGTCGG AGACGCTGCG CGACGCGCTC
GCCTGTTGA
 
Protein sequence
MDHEHTGAET TAEDTGRSAD GDGGAAGRAL VSVAHDADEQ ARNLIGWRQT YDQLAAGRFV 
GTLTELPLDT MKVFRETTSH TLRQACEVRG DAYWFGIPLA RDGAARIDAR PIAADALAFR
PGNVEFELLT PAQFSIYGVV VRGAVLRRYA QEVERCGLDE RLPLVPVVRV GEARLTRLCA
LLAQRLDDAD AMSAAGEPLS DCARNDLQAE VLAALLDLCA SPAADASVEH SSRRRKIVAA
ARDYVLAHRS RPVGVPELCE QLHVSRRTLQ YCFQDVLGMA PATYLRALRL NGVRRDLRGR
AAASVQDAAA AWGFWHLSQF ATDYRRMFGA RPSETLRDAL AC