Gene BURPS1106A_A2526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2526 
Symbol 
ID4906343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2485970 
End bp2487079 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content73% 
IMG OID640145629 
ProductDJ-1/PfpI family protein/transcriptional regulator, AraC family 
Protein accessionYP_001076556 
Protein GI126456454 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0374888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG CGCCGGCTCG CGCCGACCGC ACCGGCGCGA TGGACAAACC GACTGTACGC 
AATTACCGTA TGGCGGATAT GCCAAAGTCC CCACAGTTCC CGCCAACGCC ATCGGCGGCC
GCGCCCGCCG CCGCGCGCCG CGCGGTGCAC GTGCTCGCGT TCGACGATGT GCAGTTGCTC
GACGTCACCG GGCCGCTGCA AGTGTTCGCG AGCGCGAACG ATTTCGCCGC GCGCCGCGGG
CTCGCGATTC CGTACGCGCC GCGCGTCGTC GCCGCCCACG CGCCTTCGGT GATGTCGTCG
GCCGGGCTCG CGTTCGCCGC CGCGCCGCTG CCCGCCGCGC GCGAGCCGTC CGATACGCTG
ATCGTCGCGG GCGGCTGCGG CGTCCACGGC GCGGCGCGCG ATCCGCGGCT CGTCGACTGG
GTGCGCCGGC GCGCGGCGCA CGCGCGGCGC ATCGCGTCGG TGTGCTCGGG CGCGTTCGTG
CTCGCGGCGG CGGGGCTGCT GGGCGGACGC CGCGTCGTCA CGCACTGGTC GCGCTGCGAC
GAGCTCGCGC AACGCTATCC CGACGTGCGC GTCGAGCCCG ATCCCATTTT CATCCGCGAC
GGCAACGTCT GGACGTCGGC AGGCGTCACG GCCGGCATCG ATCTCGCGCT CGCGCTCGTC
GAGGACGACC TCGGCCGCGC GCTGGCGCTC GACGTCGCGC GGTATCTCGT CGTGTTTCTG
AAGCGCCCGG GCGGCCAGGC GCAATTCAGC GCCGCGCTGT CGCTGCAGCA CGAGGGCGGC
TGCTTCGACG AACTGCACGC ATGGGCGGCC GCGAATCTCG GCGCGGACTT GTCGGTCGCG
GCGCTCGCCG CGCGCGCCGG CATGAGCGAG CGCAGTTTCA TGCGCCGCTA CCGCGAAGCG
ACCGGCAGGA CGCCCGCGCG GGCGATCGAG CAGATGCGCG TCGAAGCCGC GCGCAACCTG
CTCGCCGACG CGCCGCTGCC GATCAAGCGG ATCGCCGCGC GCTGCGGATT CGGCAGCGAG
GAAACGATGC GCCGCAGTTT CCTGCGCATG CTCGGCGTGG CACCGCAGGC CTATCGCGAG
CGGTTCGCGA CGAATCGGCG AGGCGTCTGA
 
Protein sequence
MSVAPARADR TGAMDKPTVR NYRMADMPKS PQFPPTPSAA APAAARRAVH VLAFDDVQLL 
DVTGPLQVFA SANDFAARRG LAIPYAPRVV AAHAPSVMSS AGLAFAAAPL PAAREPSDTL
IVAGGCGVHG AARDPRLVDW VRRRAAHARR IASVCSGAFV LAAAGLLGGR RVVTHWSRCD
ELAQRYPDVR VEPDPIFIRD GNVWTSAGVT AGIDLALALV EDDLGRALAL DVARYLVVFL
KRPGGQAQFS AALSLQHEGG CFDELHAWAA ANLGADLSVA ALAARAGMSE RSFMRRYREA
TGRTPARAIE QMRVEAARNL LADAPLPIKR IAARCGFGSE ETMRRSFLRM LGVAPQAYRE
RFATNRRGV