Gene BURPS1106A_A1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1655 
Symbol 
ID4904472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1623605 
End bp1624657 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content69% 
IMG OID640144761 
ProductAraC family transcriptional regulator 
Protein accessionYP_001075689 
Protein GI126458411 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.365127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGCG GGCGCCCGGC CGTGCACGCG CAGGAGCGAC AGATGAACCC AGACCTCGAG 
ATCGTCCCCA CCCGCCGCGA CGAATCGTTT CGCGCATGGT CGCACGACTA TCCGCACACG
GTCGCGAAAT GGCATTTTCA TCCGGAGTAC GAAATCCACC TGATTCAGGG TTCGCGCGGC
AAGTTCTTCG TCGGCGACCA TATCGGCGAT TTCGCGCCCG GCAACCTCGT CGTCACCGGG
CCGAACCTGC CGCACAACTG GATCAGCGAG CTCGGCCCCG GCGAGCGCGT GCCGTCGCGC
GACGTCGTGC TGCAGTTCTC GCGCGACGCG GCCGAGAAGA TGGTGGCCGC GTTCGCCGAG
CTGCAGCCGG TGCTCGACCT GATCGACGAA GCGTCGCGCG GCGTGCAGTT TCCGGACGAG
ATCGGGCTCG CCGTCGCGCC GCTGATGCTC GAGCTCGCGA GCGCGCACGG CTGCCGGCGC
GTCGAGGTGC TGATGGCGCT GTTCGACCGG CTGGCGTCGT GCGCCGCGCG TCGCACGCTC
GCCGGCCCCG GCTACCGGAT CGACGCGCAG CACTACATGT CGTCGACGAT CAACCAGGTG
CTCGCGTACC TGCGGCAGAA CCTGCCGGGC GCGCTACGCG AGGCGGACGT CGCCGAATTC
GCCGGCATGA GCGTGAGCAC GTTCACGCGC TTCTTCCGCC GGCACACGGG CTCGACGTTC
GTCCAGTATC TGAACCGGCT GCGGATCAAC GAAGCGTGCG AGCTGCTGAT GTGCTCGGCG
CTCAGCGTCA CCGACATCTG CTACCGCATC GGCTTCAACA ACCTGTCGAA CTTCAACCGG
CAATTCCTCG CGATGAAGGG GATGCCGCCG TCGCGCTTTC GCGCGCTGCA TCGGTTGAAC
GAGCCGCATG ACGCGCCCGA ACCGCACGAG CCGCACGCGT CGCTCGCGCC GGCCGCCGCG
CCCGCGGCCC CGGGCGCGGC GGCCCGCCCT CCCGAGCGCG CCGCGCCCAC CGCGCGCGCC
GTCATTCATT CGCACCGGAG CCTCCACCCG TGA
 
Protein sequence
MNGGRPAVHA QERQMNPDLE IVPTRRDESF RAWSHDYPHT VAKWHFHPEY EIHLIQGSRG 
KFFVGDHIGD FAPGNLVVTG PNLPHNWISE LGPGERVPSR DVVLQFSRDA AEKMVAAFAE
LQPVLDLIDE ASRGVQFPDE IGLAVAPLML ELASAHGCRR VEVLMALFDR LASCAARRTL
AGPGYRIDAQ HYMSSTINQV LAYLRQNLPG ALREADVAEF AGMSVSTFTR FFRRHTGSTF
VQYLNRLRIN EACELLMCSA LSVTDICYRI GFNNLSNFNR QFLAMKGMPP SRFRALHRLN
EPHDAPEPHE PHASLAPAAA PAAPGAAARP PERAAPTARA VIHSHRSLHP