Gene BURPS1106A_A1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1033 
Symbol 
ID4903897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp999414 
End bp1000646 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID640144139 
Productputative purine catabolism transcriptional regulator 
Protein accessionYP_001075069 
Protein GI126458435 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA CCATCAGCGA GATCCTGCAA CTGCCCGGTC TCGAAGAGCT CCAGCTGCGC 
GCGGGCGAGC GCAGCGTGCA GCGGCCGGTG CGCTGGTACT ACGTTGCGGA GAACGAAGGC
ATCGCCGATT GGGTGATGGG CGGCGAGCTC GTGTTCGTCA CCGGGATCAA TCATCCGCGC
GACGAGGCGA ACCTGCTGCA GCTGATTCGC GAGGGCGCGA AGAGCCGCAT CGCCGGGATG
GTGATCCTGA CGGGCGAGGC GTTCATCCGC CGCATCCCCG ATTCGGTCGT CGCGCTTGCC
GAGCAGCTCG AGATCGTGCT GATCGAGCAG CCGTATCTGC TGAAGATGGT GATCGTCACG
CAGTTGATCG GCACCGCGCT TGCGCGGCAC GAGAACACGC TGCGCTCGCA GCGCGACATC
GTGAACCAGC TGCTGACGGG CGACTACCCG AGCATCGACA TCGCCGCCCA TCGCGCGCGC
AATCTGCAGC TCGCGCTCGA TCGGCCGCGC CGCGTCGTCG CGCTGCGGCT CGCGGGCGTG
CCCGCGCTCT TCGAAGGGCG CGATCCGGCT GCGGCGGAGG CGCTGCTGCA GGATGCACGG
CAGACGGTTC AGCGCGGCCT CGACGACTGG CTGCGCGATG AGGATGGCGC ACTGCCCGTC
GTCGAGCAGG GCGAGCTGTT CGTGCTGCTG CTGCCGTGCG ACGATCCGCG CTTCAGGAAG
CAAAAGCTCG CGCTCGGCGC GCTGCGCGAC GCATTGAACC GGCAAACCGG GCCGCTCGCG
CTGTTCGTCG GGATTTCGTC GACGGTCGGC GCCGCGCGCC ATTACTGCCG CGGGCTCGCC
GAGGCGCGGC AGGCGCTCGG CGTCGCCGAG GGCATGCGCG CGGGGCAGGG CCTGTGCGAC
TACAGCGAGC TCGGCGTGCT GAAGCTGCTC GCCGCGATTC CCGATCCGAC GCTGATCGAC
GGCTTCGTGA AGGAAACGCT CGGCAATCTG CTCGACAGCA ACCGCAAGCA TCCGACGATG
CTGATCGAGA CGCTCGAGGC GCTGCTTCAG GAAAACGGCA ACGCGATCAA GGCGGCCGAG
CAGTTGTCGA TCCACCGCAA CACGCTCAAT CACCGGCTGC GCAGGATCGA GACGCAGTCG
GGGCAATCGC TCGCCGATCC GTATTTTCGG CTGAACGCAT CCGTCGCGCT GCTCGCGTGG
CGGATGTCGG ATACGCAACG ACAGGAGTTC TGA
 
Protein sequence
MSLTISEILQ LPGLEELQLR AGERSVQRPV RWYYVAENEG IADWVMGGEL VFVTGINHPR 
DEANLLQLIR EGAKSRIAGM VILTGEAFIR RIPDSVVALA EQLEIVLIEQ PYLLKMVIVT
QLIGTALARH ENTLRSQRDI VNQLLTGDYP SIDIAAHRAR NLQLALDRPR RVVALRLAGV
PALFEGRDPA AAEALLQDAR QTVQRGLDDW LRDEDGALPV VEQGELFVLL LPCDDPRFRK
QKLALGALRD ALNRQTGPLA LFVGISSTVG AARHYCRGLA EARQALGVAE GMRAGQGLCD
YSELGVLKLL AAIPDPTLID GFVKETLGNL LDSNRKHPTM LIETLEALLQ ENGNAIKAAE
QLSIHRNTLN HRLRRIETQS GQSLADPYFR LNASVALLAW RMSDTQRQEF