Gene BURPS668_A1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1119 
Symbol 
ID4887653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1074493 
End bp1075725 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content68% 
IMG OID640131059 
Productputative purine catabolism transcriptional regulator 
Protein accessionYP_001062118 
Protein GI126444990 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA CCATCAGCGA GATCCTGCAA CTGCCCGGTC TCGAAGAGCT CCAGCTGCGC 
GCGGGCGAGC GCAGCGTGCA GCGGCCGGTG CGCTGGTACT ACGTCGCGGA GAACGAAGGC
ATCGCCGATT GGGTGATGGG CGGCGAGCTC GTGTTCGTCA CCGGGATCAA TCATCCGCGC
GACGAGGCGA ACCTGCTGCA GCTGATTCGC GAGGGCGCGA AGAGCCGCAT CGCCGGGATG
GTGATCCTGA CGGGCGAGGC GTTCATCCGC CGCATCCCCG ATTCGGTCGT CGCGCTCGCC
GAGCAGCTCG AGATCGTGCT GATCGAGCAG CCGTATCTGC TGAAGATGGT GATCGTCACG
CAGTTGATCG GCACCGCGCT CGCGCGGCAC GAGAACACGC TGCGCTCGCA GCGCGACATC
GTGAACCAGC TGCTGACGGG CGACTACCCG AGCATCGACA TCGCCGCCCA TCGCGCGCGC
AATCTGCAGC TCGCGCTCGA TCGGCCGCGC CGCGTCGTCG CGCTGCGGCT CGCGGGCGTG
CCCGCGCTTT TCGAAGGGCG CGATCCGGCC GCGGCGGAGG CGCTGCTGCA GGATGCACGG
CAGACGGTTC AGCGCGGCCT CGACGACTGG CTGCGCGACG AGGAAGGCGC ACTGCCCGTC
GTCGAGCAGG GCGAGCTGTT CGTGCTGCTG CTGCCGTGCG ACGATCCGCG CTTCAGGAAG
CAAAAGCTCG CGCTCGGCGC GCTGCGCGAC GCGTTGAACC GGCAAACCGG GCCGCTCGCG
CTGTTCGTCG GGATTTCGTC GACGGTCGGC GCCGCGCGCC ATTATTGCCG CGGGCTCGCC
GAGGCGCGGC AGGCGCTCGG CGTTGCCGAG GGCATGCGCG CGGGGCAGGG CCTGTGCGAC
TACAGCGAGC TCGGCGTGCT GAAGCTGCTC GCCGCGATTC CCGATCCGAC GCTGATCGAC
GGCTTCGTGA AGGAAACGCT CGGCAATCTG CTCGACAGCA ACCGCAAGCA TCCGACGATG
CTGATCGAGA CGCTCGAGGC GCTGCTTCAG GAAAACGGCA ACGCGATCAA GGCGGCCGAG
CAGTTGTCGA TCCACCGCAA CACGCTCAAT CACCGGCTGC GCAGGATCGA GACGCAGTCG
GGGCAATCGC TCGCCGATCC GTATTTTCGG CTGAACGCAT CCGTCGCGCT GCTCGCGTGG
CGGATGTCGG ATACGCAACG ACAGGAGTTC TGA
 
Protein sequence
MSLTISEILQ LPGLEELQLR AGERSVQRPV RWYYVAENEG IADWVMGGEL VFVTGINHPR 
DEANLLQLIR EGAKSRIAGM VILTGEAFIR RIPDSVVALA EQLEIVLIEQ PYLLKMVIVT
QLIGTALARH ENTLRSQRDI VNQLLTGDYP SIDIAAHRAR NLQLALDRPR RVVALRLAGV
PALFEGRDPA AAEALLQDAR QTVQRGLDDW LRDEEGALPV VEQGELFVLL LPCDDPRFRK
QKLALGALRD ALNRQTGPLA LFVGISSTVG AARHYCRGLA EARQALGVAE GMRAGQGLCD
YSELGVLKLL AAIPDPTLID GFVKETLGNL LDSNRKHPTM LIETLEALLQ ENGNAIKAAE
QLSIHRNTLN HRLRRIETQS GQSLADPYFR LNASVALLAW RMSDTQRQEF