Gene BURPS668_A1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1744 
Symbol 
ID4887167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1690411 
End bp1691463 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content70% 
IMG OID640131682 
ProductAraC family transcription regulator 
Protein accessionYP_001062739 
Protein GI126442860 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.109239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGCG GGCGCCCGGC CGTGCACGCG CAGGAGCGAC AGATGAACCC AGACCTCGAG 
ATCGTCCCCA CCCGCCGCGA CGAATCGTTT CGCGCATGGT CGCACGACTA TCCGCACACG
GTCGCGAAAT GGCATTTTCA TCCGGAGTAC GAAATCCACC TGATTCAGGG TTCGCGCGGC
AAGTTCTTCG TCGGCGACCA TATCGGCGAT TTCGCGCCCG GCAACCTCGT CGTCACCGGG
CCGAACCTGC CGCACAACTG GATCAGCGAG CTCGGCCCCG GCGAGCGCGT GCCGTCGCGC
GACGTCGTGC TGCAGTTCTC GCGCGACGCG GCCGAGAAGA TGGTGGCCGC GTTCGCCGAG
CTGCAGCCGG TGCTCGACCT GATCGACGAA GCGTCGCGCG GCGTGCAGTT TCCGGACGAG
ATCGGGCTCG CCGTCGCGCC GCTGATGCTC GAGCTCGCGA GCGCGCACGG CTGCCGGCGC
GTCGAGGTGC TGATGGCGCT GTTCGACCGG CTGGCGTCGT GCGCCGCGCG TCGCACGCTC
GCCGGCCCCG GCTACCGGAT CGACGCGCAG CACTACATGT CGTCGACGAT CAACCAGGTG
CTCGCGTACC TGCGGCAGAA CCTGCCGGGC GCGCTACGCG AGGCGGACGT CGCCGAATTC
GCCGGCATGA GCGTGAGCAC GTTCACGCGC TTCTTCCGCC GGCACACGGG CTCGACGTTC
GTCCAGTACC TGAACCGGCT GCGCATCAAC GAAGCGTGCG AGCTGCTGAT GTGCTCGGCG
CTCAGCGTCA CCGACATCTG CTACCGCATC GGCTTCAACA ACCTGTCGAA CTTCAACCGG
CAATTCCTCG CGATGAAGGG GATGCCGCCG TCGCGCTTTC GCGCGCTGCA TCGGTTGAAC
GAGCCGCATG ACGCGCCCGA ACCGCACGAG CCGCACGCGT CGCTCGCGCC GGCCGCCGCG
CCCGCGGCCC CGGGCGCGGC GGCCCGCCCT CCCGAGCGCG CCGCGCCCAC CGCGCGCGCC
GTCATTCATT CGCACCGGAG CCTCCACCCG TGA
 
Protein sequence
MNGGRPAVHA QERQMNPDLE IVPTRRDESF RAWSHDYPHT VAKWHFHPEY EIHLIQGSRG 
KFFVGDHIGD FAPGNLVVTG PNLPHNWISE LGPGERVPSR DVVLQFSRDA AEKMVAAFAE
LQPVLDLIDE ASRGVQFPDE IGLAVAPLML ELASAHGCRR VEVLMALFDR LASCAARRTL
AGPGYRIDAQ HYMSSTINQV LAYLRQNLPG ALREADVAEF AGMSVSTFTR FFRRHTGSTF
VQYLNRLRIN EACELLMCSA LSVTDICYRI GFNNLSNFNR QFLAMKGMPP SRFRALHRLN
EPHDAPEPHE PHASLAPAAA PAAPGAAARP PERAAPTARA VIHSHRSLHP