Gene BCZK4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4042 
SymbolsigA 
ID3026972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4150370 
End bp4151491 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content39% 
IMG OID637548256 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_085621 
Protein GI52141207 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000102439 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA AACCAGCTCG TTCTAAACAA ATTGAAACTG AAATGACCCT TGAGCAAGTG 
AAAGAACAAC TCACTGAGCT CGGAAAAAAA CGTGGCGTTC TTACATATGA AGAGATTGCA
GAACGCATGA ATGGATTTGA AATTGAATCC GATCAAATGG ATGAATACTA TGAATATTTA
GGTGAACAAG GGATTGACTT AGTTGGCGAC AACGATGAAG GCCCTAATAA TCACCAAATT
ACAAAAACAG AAGAAGAGTT TGACCTGAAT GACTTAAGTG TACCACCAGG GGTTAAAATC
AACGATCCTG TTCGTATGTA TTTAAAAGAA ATTGGTCGTG TAGATTTACT ATCTGCAGAA
GAAGAAATTC GACTTGCAAC GCGTATTGAA GAAGGCGATG AAGAAGCAAA ACGTCGTCTT
GCAGAAGCAA ACTTACGTCT TGTAGTAAGT ATTGCAAAGC GCTATGTAGG CCGCGGTATG
CTTTTCTTAG ACTTAATCCA AGAAGGGAAT ATGGGTCTAA TTAAAGCGGT TGAAAAGTTC
GATTATCGTA AAGGTTTCAA ATTTAGTACG TATGCAACTT GGTGGATTCG CCAAGCAATT
ACACGTGCGA TTGCAGACCA AGCAAGAACA ATTCGTATCC CAGTTCATAT GGTTGAAACG
ATTAATAAGT TAATTCGTGT ACAACGTCAA TTATTACAAG ATTTAGGACG TGAACCATCT
CCTGAAGAGA TTGGTGAAGA AATGGATCTT GCTCCAGAAA AAGTGCGCGA AATCTTAAAA
ATTGCTCAGG AGCCAGTCTC TCTTGAAACA CCGATTGGTG AAGAAGATGA CTCCCATTTA
GGTGATTTTA TTGAAGACCA AGAAGCAACA TCGCCTGCGG ACCATGCAGC GTATGAATTG
CTAAAAGAAC AATTAGAAGA TGTGTTAGAT ACACTAACAG ATCGTGAAGA AAATGTTCTA
CGTCTTCGTT TTGGTTTAGA TGATGGACGA ACTCGTACGC TTGAAGAAGT TGGGAAAGTA
TTCGGCGTAA CGAGAGAACG TATTCGTCAA ATTGAAGCAA AAGCACTTCG TAAATTGAGA
CATCCTAGCC GTAGTAAGCG TCTTAAGGAT TTCTTAGAAT AG
 
Protein sequence
MADKPARSKQ IETEMTLEQV KEQLTELGKK RGVLTYEEIA ERMNGFEIES DQMDEYYEYL 
GEQGIDLVGD NDEGPNNHQI TKTEEEFDLN DLSVPPGVKI NDPVRMYLKE IGRVDLLSAE
EEIRLATRIE EGDEEAKRRL AEANLRLVVS IAKRYVGRGM LFLDLIQEGN MGLIKAVEKF
DYRKGFKFST YATWWIRQAI TRAIADQART IRIPVHMVET INKLIRVQRQ LLQDLGREPS
PEEIGEEMDL APEKVREILK IAQEPVSLET PIGEEDDSHL GDFIEDQEAT SPADHAAYEL
LKEQLEDVLD TLTDREENVL RLRFGLDDGR TRTLEEVGKV FGVTRERIRQ IEAKALRKLR
HPSRSKRLKD FLE