Gene BURPS1710b_A0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0419 
SymbolhrpB 
ID3692189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp585770 
End bp587197 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content61% 
IMG OID637730673 
Producthypothetical protein 
Protein accessionYP_335578 
Protein GI76819581 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.151395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCGG CTCTGAGCTT GCTCGGTGCC GACGCGCCGA CGCACTTGCA TTCGCACTTG 
AAGTTGATTC TGGGCGGCGA GTTCAATGCG GCGCTCGAAA GATCGAGCGA ATGGGCCGAA
ACGACGGTGG CATCGGAGCG GACATCATGG GATCTGCAAT TGCACGCGGA TCTGCAGCTG
GTGCTTGGCT TCGAAGTCGA AGCCGAAGAA AACTATCGGC GCGCCCAGCG AAAAATTCGC
GGCTCAAACA GTAAGATTCG CATCGCGACC TGCCGGAACG CCGCGTGGCA AGCCCTGTTC
CGCTACCGGG TCACGACCGC GCTCGCGTGT TTTTCCCGAA TCTGCGACGA GCCCGGCATC
GAGGCCGGCG GATTGGTGGA GGCGCGCTTT GGGATCGCCT GCGCGCTCTA TGAAATGGGG
CGGATAGACG ATGCGTTTGA TGCGATCGAT TCGATGGAGA AGATCGCCGA ACAGCAATCG
GACGAGATGC GCGCGCACTG GAAAGACTTG ATCGCCGTGT TGCGTTTCGA TCTCGTCGTG
CAAAGCGAAT TGCGCCGGGC TGCGGCGTTC GTCGATCATG TGTATTGGCA ATCTGCGCAG
TCGATGAGCC GGGTGGACCG CGCGCACGGT GTGTCGGAGG CCGCCGTATC CGTCGAGACG
CCGCTGCTGC GCGGCCGGGT GGCCTATCTG CTGCAGTTGC GATGCGCGGC CGCGGGCAAT
CGGGACGCCG TCGCCGAGTT GGCGCGTTGC CTCGATGCGG CGGGCGAGCA GGGATTCGTC
GACTTTCGAT ACACGCTGCG CCTCGAGATT GCGCTCGCCC TGCTCGCGGG CGACGCGCCC
AATTTGGCGC AATTCGTGTT GGAGCCGATT TCCGACACAT TGCATGGCGC AGAGTCGAGC
CGCCGCTATC GGGAATATTT CTATTGCGCC GCGAAGGTGC ATCTGGCGCA GGACCACACG
CAGGAATCGC TGGCCTTATA TCGACGCTAC GCGCTGATCG CGATGAGATG TCTGCGCGAG
GACGCGCTGA TCGGCAGGCA GTTCCTGGTC GGGCAGGAAC TGAAGCAGCT TCCCCAGTCC
GACGATGTGA CCGTGCGCTT GCCGTTGAAA TATCGGCGCG CCTATCACTA TATTCTCCAG
AATCTCAACC GTAGCGACCT TTCGGTTCGG GAGATCGCGG CGGAGATCGG CGTCACGGAG
CGCGCGCTGC AGAACGCATT CAAGATCTAC CTCGGGCTTT CCCCGCGTGA ACTGATCCGC
TCGCGGAGAA TGGAGCGTAT CCGCACGGAA CTCGTCGATT TCACGTTGAC GGGTGAGCGC
AACGTCAAGG AGGCGGCTCG AAAATGGGGT GTCCAGAATG GTTCGACACT CGTGATCGCC
TATCGGAAGG AGTACGACGA AACCCCTTCG GAAACGCTCG CGCGCTGA
 
Protein sequence
MVSALSLLGA DAPTHLHSHL KLILGGEFNA ALERSSEWAE TTVASERTSW DLQLHADLQL 
VLGFEVEAEE NYRRAQRKIR GSNSKIRIAT CRNAAWQALF RYRVTTALAC FSRICDEPGI
EAGGLVEARF GIACALYEMG RIDDAFDAID SMEKIAEQQS DEMRAHWKDL IAVLRFDLVV
QSELRRAAAF VDHVYWQSAQ SMSRVDRAHG VSEAAVSVET PLLRGRVAYL LQLRCAAAGN
RDAVAELARC LDAAGEQGFV DFRYTLRLEI ALALLAGDAP NLAQFVLEPI SDTLHGAESS
RRYREYFYCA AKVHLAQDHT QESLALYRRY ALIAMRCLRE DALIGRQFLV GQELKQLPQS
DDVTVRLPLK YRRAYHYILQ NLNRSDLSVR EIAAEIGVTE RALQNAFKIY LGLSPRELIR
SRRMERIRTE LVDFTLTGER NVKEAARKWG VQNGSTLVIA YRKEYDETPS ETLAR