Gene Bcen_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4549 
Symbol 
ID4094422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp1804664 
End bp1805932 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content73% 
IMG OID638017836 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_624403 
Protein GI107026892 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCATG AGGCGACGCA TCGTGCGATC GAAGCGGTCT GGCGAATCGA GGCGCCGAAA 
ATCATCGCGC GGGCCGCGCG GGTGGTGCGC GACGTCGGCG TGGCCGAGGA ACTGGCGCAG
GACACGCTCG TCGCGGCGCT CGAGCACTGG CCCGTCGACG GCGTGCCCGA CAACCCGGCC
GCCTGGCTGA TGACGGCCGT GAAGCGGCGC GCGCTCGACC GGGTCCGGCA GGAGTCGCTT
CATGCGGCGA AGCGCGACCA GCTCGGCCAC GAGATGGACG CGCTCGAGGC GCACGTCGTC
CCCGACATCG CGGACGCGAT CGCCGATGCG GGCGACGACG ACATCGGCGA CGACCTGCTG
CGGCTGATCT TCACGTCGTG CCACCCGGTG CTGTCGACCG ACGCGCGCGT CGCGCTGACG
CTGCGGCTGC TCGGCGGGCT GACGACGGGC GAGATCGCGC GCGCATTCCT GACGCCGGAG
CCGACGATCG CGCAGCGGAT CGTGCGCGCG AAGCGCACGC TCGCGGCGGC GCACGTGCCG
TTCGAGGTGC CAGCGGCCGA TGCGCGGCCG GCGCGGCTCG CGTCCGTGCT CGAAGTGATC
TACCTCGTGT TCAACGAAGG CCATGCGGCG ACTGCCGGCG ACGACTGGAT GCGTCCGGCG
CTGTGCGACG AGGCGTTGCG CCTCGGCCGC GTGCTGGCCG GGCTGGCGCC GGACGAAAGC
GAAGTGCTCG GGCTCGTCGC GCTGATGGAA CTGCAGGCGT CGCGCATGCA TGCCCGCGTC
GACGCGCAAG GCCGGCCCGT GCTGCTGCTC GACCAGGACC GCAGCCGCTG GGATCCGTTG
CTGATCCGGC GCGGCCTCGC GGCGCTGGAG CGGGCGACGA AGCTCGGCGG CGTGCGCGGG
CCGTATGCGC TGCAGGCCGC GCTCGCCGCG TGCCATGCGC GTGCGCGACA GGCGGCGGAC
ACGGACTGGG CGCAGATCGT CGCGCTGTAT GACGCGCTCG CCGAAGTCGC GCCGTCGCCG
GTCGTCGAAC TCAATCGCGC GGTGGCGGTG GGGATGGCGT TCGGGCCGGC CGCGGCGCTC
GAACTCGTCG ACGTGCTGCG CGACGATCCG GCGCTCGCGC GCTATCACTG GCTGCCGAGC
GTGCGCGGCG ATCTGCTCGC GAAGCTCGGC CGTGCCGACG AGGCGAAGCT GGAGTTCCGC
CGCGCGGCGG AGTTGACGCG CAACGAACGT GAGCGCGAGT TGCTGCTCAA GCGTGCGATG
GATGCGTGA
 
Protein sequence
MTHEATHRAI EAVWRIEAPK IIARAARVVR DVGVAEELAQ DTLVAALEHW PVDGVPDNPA 
AWLMTAVKRR ALDRVRQESL HAAKRDQLGH EMDALEAHVV PDIADAIADA GDDDIGDDLL
RLIFTSCHPV LSTDARVALT LRLLGGLTTG EIARAFLTPE PTIAQRIVRA KRTLAAAHVP
FEVPAADARP ARLASVLEVI YLVFNEGHAA TAGDDWMRPA LCDEALRLGR VLAGLAPDES
EVLGLVALME LQASRMHARV DAQGRPVLLL DQDRSRWDPL LIRRGLAALE RATKLGGVRG
PYALQAALAA CHARARQAAD TDWAQIVALY DALAEVAPSP VVELNRAVAV GMAFGPAAAL
ELVDVLRDDP ALARYHWLPS VRGDLLAKLG RADEAKLEFR RAAELTRNER ERELLLKRAM
DA