Gene EcHS_A3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3248 
SymbolrpoD 
ID5592247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3256492 
End bp3258333 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content53% 
IMG OID640922365 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001459861 
Protein GI157162543 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0316286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCCGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAG
ATCGAAGACA TCATCCAAAT GATCAACGAC ATGGGCATTC AGGTGATGGA AGAAGCACCG
GATGCCGATG ATCTGATGCT GGCTGAAAAC ACCGCGGACG AAGATGCTGC CGAAGCCGCC
GCGCAGGTGC TTTCCAGCGT GGAATCTGAA ATCGGGCGCA CGACTGACCC GGTACGCATG
TACATGCGTG AAATGGGCAC CGTTGAACTG TTGACCCGCG AAGGCGAAAT TGACATCGCT
AAGCGTATTG AAGACGGGAT CAACCAGGTT CAATGCTCCG TTGCTGAATA TCCGGAAGCG
ATCACCTATC TGCTGGAACA GTACGATCGT GTTGAAGCAG AAGAAGCGCG TCTGTCCGAT
CTGATCACCG GCTTTGTTGA CCCGAACGCA GAAGAAGATC TGGCACCTAC CGCCACTCAC
GTCGGTTCTG AGCTTTCCCA GGAAGATCTG GACGATGACG AAGATGAAGA CGAAGAAGAT
GGCGATGACG ACAGCGCCGA TGATGACAAC AGCATCGACC CGGAACTGGC TCGCGAAAAA
TTTGCGGAAC TACGCGCTCA GTACGTTGTA ACGCGTGACA CCATCAAAGC GAAAGGTCGC
AGTCACGCTA CCGCTCAGGA AGAGATCCTG AAACTGTCTG AAGTATTCAA ACAGTTCCGC
CTGGTGCCGA AGCAGTTTGA CTACCTGGTC AACAGCATGC GCGTCATGAT GGACCGCGTT
CGTACGCAAG AACGTCTGAT CATGAAGCTC TGCGTTGAGC AGTGCAAAAT GCCGAAGAAA
AACTTCATTA CCCTGTTTAC CGGCAACGAA ACCAGCGATA CCTGGTTCAA CGCGGCAATT
GCGATGAACA AGCCGTGGTC GGAAAAACTG CACGATGTCT CTGAAGAAGT GCATCGCGCC
CTGCAAAAAC TGCAGCAGAT TGAAGAAGAA ACCGGCCTGA CCATCGAGCA GGTTAAAGAT
ATCAACCGTC GTATGTCCAT CGGTGAAGCG AAAGCCCGCC GTGCGAAGAA AGAGATGGTT
GAAGCGAACT TACGTCTGGT TATTTCTATC GCTAAGAAAT ACACCAACCG TGGCTTGCAG
TTCCTTGACC TGATTCAGGA AGGCAACATC GGTCTGATGA AAGCGGTTGA TAAATTCGAA
TACCGCCGTG GTTACAAGTT CTCCACCTAC GCAACCTGGT GGATCCGTCA GGCGATCACC
CGCTCTATCG CGGATCAGGC GCGCACCATC CGTATTCCGG TGCATATGAT TGAGACCATC
AACAAGCTCA ACCGTATTTC TCGCCAGATG CTGCAAGAGA TGGGCCGTGA ACCGACGCCG
GAAGAACTGG CTGAACGTAT GCTGATGCCG GAAGACAAGA TCCGCAAAGT GCTGAAGATC
GCCAAAGAGC CAATCTCCAT GGAAACGCCG ATCGGTGATG ATGAAGATTC GCATCTGGGG
GATTTCATCG AGGATACCAC CCTCGAGCTG CCGCTGGATT CTGCGACCAC CGAAAGCCTG
CGTGCGGCAA CGCACGACGT GCTGGCTGGC CTGACCGCGC GTGAAGCGAA AGTTCTGCGT
ATGCGTTTCG GTATCGATAT GAACACCGAC CACACGCTGG AAGAAGTGGG TAAACAGTTC
GACGTTACCC GCGAACGTAT CCGTCAGATC GAAGCGAAGG CGCTGCGCAA ACTGCGTCAC
CCGAGCCGTT CTGAAGTGTT GCGTAGCTTC CTGGACGATT AA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLMLAEN TADEDAAEAA AQVLSSVESE IGRTTDPVRM YMREMGTVEL LTREGEIDIA
KRIEDGINQV QCSVAEYPEA ITYLLEQYDR VEAEEARLSD LITGFVDPNA EEDLAPTATH
VGSELSQEDL DDDEDEDEED GDDDSADDDN SIDPELAREK FAELRAQYVV TRDTIKAKGR
SHATAQEEIL KLSEVFKQFR LVPKQFDYLV NSMRVMMDRV RTQERLIMKL CVEQCKMPKK
NFITLFTGNE TSDTWFNAAI AMNKPWSEKL HDVSEEVHRA LQKLQQIEEE TGLTIEQVKD
INRRMSIGEA KARRAKKEMV EANLRLVISI AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE
YRRGYKFSTY ATWWIRQAIT RSIADQARTI RIPVHMIETI NKLNRISRQM LQEMGREPTP
EELAERMLMP EDKIRKVLKI AKEPISMETP IGDDEDSHLG DFIEDTTLEL PLDSATTESL
RAATHDVLAG LTAREAKVLR MRFGIDMNTD HTLEEVGKQF DVTRERIRQI EAKALRKLRH
PSRSEVLRSF LDD