Gene BURPS1106A_A0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0234 
Symbol 
ID4903631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp218984 
End bp220330 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content68% 
IMG OID640143341 
ProductLysR family transcriptional regulator 
Protein accessionYP_001074277 
Protein GI126457796 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCCG CCGCATCGCA TTGCGTTGCC GATGAACCGA TTGCCGCGCA TGAAGAACCA 
GAGCCGCGAC GCGTACCGGC CCGCCCGGGC CCGCTTGCGC GGCGCGCGCA TGCGAAGCGA
AACGAAGCCG GCGGCCGCCG CATCGGCGAC GCCGCACGCC CAATCCGCGC GCCGGCGGCA
GCGAGCCGCG CCGCGCGCGT TCGCGCCCGC CCGAGGTGGC GCCCGCCCAC GATACGCTGC
GCCGCCCCTC CCGCATGCGG CGCAACAGCC ATCGTCCCGA TCCATATCCA GCACCGGGCT
CATGTCGCCG AATCGCGATG CAGCGCCCGC GCGACATCGG CGCCTCGCCC GCGAACGGGC
GAGCGCCGCT CGGTATTTCG AAACATGACC GTTCTTGAGG TGGGAAATAT GTCCGAGGTG
GGGATAAGAA ATTTGAATCA CCTGCGCGTT TTCATGGCGA TCGTCGAGAA GGGCAGCTTC
ACTGCGGCGG CCGAATGCCT GAGCATGTCG AAATCACTGG TCAGCGAATA CCTGAGCCGC
CTCGAAGCCG AGATCGACAC GCAGCTCGTG ATGCGCAGCA CCCGCCGGAT CGCGCCGACC
GACGCCGGCA ACAAGCTGTA TTGCGCGTCG CAGGCGTTCG TGAGCGGCCT GTACGACGTG
ATCGGCAGCA TCCGCTGCCT GCGCCACGAA TCGACCGGCC TGTTGCGCGT CGCGGCGCCG
AGCGGCTTTT CCACCACGCA TCTGAGCTCG ATCGCCGCGA CCTTCATTCA CCAGCATCCG
CAGATCGAGC TCGAGATCGT CTGCAACGAC GACGAGATCG ATCTGGTGGG CGAGCGCGTC
GATCTCGCGT TCGAAACCGG ATGGCCCAAG AAGAAGGGCT TCCGGATGAA GATGCTCGGC
GCGTTCGATC AGGTGCTCGT CGCGTCGCCC GAGTATTCGC GCAGGCACGC GGTGCCGCGG
CATCCGGACG ATCTGCCCGG CTCGCACTGG ATCGGGCACG GCGGGCTCGC CAATCTCAGC
TATTCGGTGT TCGGCAACGA GGGGCGATCG GTCCGGATTC AGACCCCCGG GCGCCTGAAG
GTGAAGAGCG TGCTGCTCGC GCATCAGATG GCGCTCGCCG GCGCGGGCAT CAGTGCGTTT
CCCGATTATC TGGTCGCCGA GGATCTGCGC GAAGGGCGCC TGCATCGGCT GCTGCCGACG
TGGACGATGC CGAAGGGCGG CATCTACGCG TTTCGCACGG CGCCGCGGCA GGCGTCGGTT
CGCGAGCGCC TGTTTCTCGC CGCGGTCCAG GCGTATCTGG CCGGCCTGTG CGGCGAGCAC
GCGCGCGCGG GCGCGGTCCC GACCTAG
 
Protein sequence
MIAAASHCVA DEPIAAHEEP EPRRVPARPG PLARRAHAKR NEAGGRRIGD AARPIRAPAA 
ASRAARVRAR PRWRPPTIRC AAPPACGATA IVPIHIQHRA HVAESRCSAR ATSAPRPRTG
ERRSVFRNMT VLEVGNMSEV GIRNLNHLRV FMAIVEKGSF TAAAECLSMS KSLVSEYLSR
LEAEIDTQLV MRSTRRIAPT DAGNKLYCAS QAFVSGLYDV IGSIRCLRHE STGLLRVAAP
SGFSTTHLSS IAATFIHQHP QIELEIVCND DEIDLVGERV DLAFETGWPK KKGFRMKMLG
AFDQVLVASP EYSRRHAVPR HPDDLPGSHW IGHGGLANLS YSVFGNEGRS VRIQTPGRLK
VKSVLLAHQM ALAGAGISAF PDYLVAEDLR EGRLHRLLPT WTMPKGGIYA FRTAPRQASV
RERLFLAAVQ AYLAGLCGEH ARAGAVPT