Gene B21_02887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02887 
SymbolrpoD 
ID8114541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3078333 
End bp3080174 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content53% 
IMG OID644849075 
Producthypothetical protein 
Protein accessionYP_003000648 
Protein GI251786344 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00855324 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCCGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAG
ATCGAAGACA TCATCCAAAT GATCAACGAC ATGGGCATTC AGGTGATGGA AGAAGCACCG
GATGCCGATG ATCTGATGCT GGCTGAAAAC ACCGCGGACG AAGATGCTGC CGAAGCCGCC
GCGCAGGTGC TTTCCAGCGT GGAATCTGAA ATCGGGCGCA CGACTGACCC GGTACGCATG
TACATGCGTG AAATGGGCAC CGTTGAACTG TTGACCCGCG AAGGCGAAAT TGACATCGCT
AAGCGTATTG AAGACGGGAT CAACCAGGTT CAATGCTCCG TTGCTGAATA TCCGGAAGCG
ATCACCTATC TGCTGGAACA GTACGATCGT GTTGAAGCAG AAGAAGCGCG TCTGTCCGAT
CTGATCACCG GCTTTGTTGA CCCGAACGCA GAAGAAGATC TGGCACCTAC CGCCACTCAC
GTCGGTTCTG AGCTTTCCCA GGAAGATCTG GACGATGACG AAGATGAAGA CGAAGAAGAT
GGCGATGACG ACAGCGCCGA TGATGACAAC AGCATCGACC CGGAACTGGC TCGCGAAAAA
TTTGCGGAAC TACGCGCTCA GTACGTTGTA ACGCGTGACA CCATCAAAGC GAAAGGTCGC
AGTCACGCTA CCGCTCAGGA AGAGATCCTG AAACTGTCTG AAGTATTCAA ACAGTTCCGC
CTGGTGCCGA AGCAGTTTGA CTACCTGGTC AACAGCATGC GCGTCATGAT GGACCGCGTT
CGTACGCAAG AACGTCTGAT CATGAAGCTC TGCGTTGAGC AGTGCAAAAT GCCGAAGAAA
AACTTCATTA CCCTGTTTAC CGGCAACGAA ACCAGCGATA CCTGGTTCAA CGCGGCAATT
GCGATGAACA AGCCGTGGTC GGAAAAACTG CACGATGTCT CTGAAGAAGT GCATCGCGCC
CTGCAAAAAC TGCAGCAGAT TGAAGAAGAA ACCGGCCTGA CCATCGAGCA GGTTAAAGAT
ATCAACCGTC GTATGTCCAT CGGTGAAGCG AAAGCCCGCC GTGCGAAGAA AGAGATGGTT
GAAGCGAACT TACGTCTGGT TATTTCTATC GCTAAGAAAT ACACCAACCG TGGCTTGCAG
TTCCTTGACC TGATTCAGGA AGGCAACATC GGTCTGATGA AAGCGGTTGA TAAATTCGAA
TACCGCCGTG GTTACAAGTT CTCCACCTAC GCAACCTGGT GGATCCGTCA GGCGATCACC
CGCTCTATCG CGGATCAGGC GCGCACCATC CGTATTCCGG TGCATATGAT TGAGACCATC
AACAAGCTCA ACCGTATTTC TCGCCAGATG CTGCAAGAGA TGGGCCGTGA ACCGACGCCG
GAAGAACTGG CTGAACGTAT GCTGATGCCG GAAGACAAGA TCCGCAAAGT GCTGAAGATC
GCCAAAGAGC CAATCTCCAT GGAAACGCCG ATCGGTGATG ATGAAGATTC GCATCTGGGG
GATTTCATCG AGGATACCAC CCTCGAGCTG CCGCTGGATT CTGCGACCAC CGAAAGCCTG
CGTGCGGCAA CGCACGACGT GCTGGCTGGC CTGACCGCGC GTGAAGCAAA AGTTCTGCGT
ATGCGTTTCG GTATCGATAT GAACACCGAC CACACGCTGG AAGAAGTGGG TAAACAGTTC
GACGTTACCC GCGAACGTAT CCGTCAGATC GAAGCGAAGG CGCTGCGCAA ACTGCGTCAC
CCGAGCCGTT CTGAAGTGCT GCGTAGCTTC CTGGACGATT AA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLMLAEN TADEDAAEAA AQVLSSVESE IGRTTDPVRM YMREMGTVEL LTREGEIDIA
KRIEDGINQV QCSVAEYPEA ITYLLEQYDR VEAEEARLSD LITGFVDPNA EEDLAPTATH
VGSELSQEDL DDDEDEDEED GDDDSADDDN SIDPELAREK FAELRAQYVV TRDTIKAKGR
SHATAQEEIL KLSEVFKQFR LVPKQFDYLV NSMRVMMDRV RTQERLIMKL CVEQCKMPKK
NFITLFTGNE TSDTWFNAAI AMNKPWSEKL HDVSEEVHRA LQKLQQIEEE TGLTIEQVKD
INRRMSIGEA KARRAKKEMV EANLRLVISI AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE
YRRGYKFSTY ATWWIRQAIT RSIADQARTI RIPVHMIETI NKLNRISRQM LQEMGREPTP
EELAERMLMP EDKIRKVLKI AKEPISMETP IGDDEDSHLG DFIEDTTLEL PLDSATTESL
RAATHDVLAG LTAREAKVLR MRFGIDMNTD HTLEEVGKQF DVTRERIRQI EAKALRKLRH
PSRSEVLRSF LDD