Gene BURPS668_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0547 
SymbolrpoH 
ID4884666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp519518 
End bp520453 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content66% 
IMG OID640126475 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_001057600 
Protein GI126441976 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.118318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAACG CATTGACCCT TCCGAACACA CTGCGCCCGG CGTCCGCTAA GGCCGTATCG 
GCGGGCTCGC TGACGCTCGC CTCTCATTCG ATGCTGCCCG GCCATCTGGG CAACATCGAC
GCCTATATCC AGGCTGTGAA CCGGATTCCG CTGCTAAGCG CGGAGGAAGA GCGTCAATAC
GCGACCGAAT ACCGCGAGCA AAACAATCTC GACGCCGCGC GCCGGCTCGT GCTGTCGCAC
CTGCGGCTCG TCGTGTCGGT CGCGCGCAAC TACCTCGGCT ACGGCCTGCC GCACGGCGAT
CTGATCCAGG AAGGCAACAT CGGCCTGATG AAGGCGGTCA AGCGGTTCGA TCCCGCCCAG
AACGTGCGCC TCGTGTCGTA CGCGATCCAC TGGATCAAGG CCGAGATTCA CGAGTACATC
CTGCGCAACT GGCGCATGGT CAAGGTGGCG ACGACGAAGG CGCAGCGCAA GCTGTTCTTC
AATCTGCGCA GCCACAAGAA GGGCACGCAG GCGTTCACGC CGGAGGAAAT CGACGGCCTC
GCGCAGGAGC TGAACGTCAA GCGCGAGGAA GTGGCCGAGA TGGAAACCCG CCTGTCGGGC
GGCGACATCG CGCTCGAAGG CCAGATCGAC GACGGCGAGG AATCGTACGC GCCGATCGCC
TATCTCGCCG ATTCGCACAA CGAGCCGACC GCCGTGCTCG CCGCGCGGCA GCGCGACATG
CTGCAGACGG ACGGCATCGC GCGCGCGCTC GAATCGCTCG ACGCGCGCAG CCGCCGGATC
ATCGAGGCGC GCTGGCTGAA CGTCGACGAC GACGGCTCGG GCGGCTCGAC GCTGCACGAT
CTCGCGGCCG AATTCGGCGT GTCGGCGGAG CGCATCCGCC AGATCGAGGC AAGCGCGATG
AAGAAGATGC GCACGGCGCT CGCCGCGTAC GCATAA
 
Protein sequence
MSNALTLPNT LRPASAKAVS AGSLTLASHS MLPGHLGNID AYIQAVNRIP LLSAEEERQY 
ATEYREQNNL DAARRLVLSH LRLVVSVARN YLGYGLPHGD LIQEGNIGLM KAVKRFDPAQ
NVRLVSYAIH WIKAEIHEYI LRNWRMVKVA TTKAQRKLFF NLRSHKKGTQ AFTPEEIDGL
AQELNVKREE VAEMETRLSG GDIALEGQID DGEESYAPIA YLADSHNEPT AVLAARQRDM
LQTDGIARAL ESLDARSRRI IEARWLNVDD DGSGGSTLHD LAAEFGVSAE RIRQIEASAM
KKMRTALAAY A