Gene EcSMS35_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3360 
SymbolrpoD 
ID6145062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3438565 
End bp3440406 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content53% 
IMG OID641618189 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001745339 
Protein GI170683578 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0343174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCCGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAG
ATCGAAGACA TCATCCAAAT GATCAACGAC ATGGGCATTC AGGTGATGGA AGAAGCACCG
GATGCCGATG ATCTGATGCT GGCTGAAAAC ACCGCGGACG AAGATGCTGC CGAAGCCGCC
GCGCAGGTGC TTTCCAGCGT GGAATCTGAA ATCGGGCGCA CGACTGACCC GGTACGCATG
TACATGCGTG AAATGGGCAC CGTTGAACTG TTGACCCGCG AAGGCGAAAT TGACATCGCT
AAGCGTATTG AAGACGGGAT CAACCAGGTT CAATGCTCCG TTGCTGAATA TCCGGAAGCG
ATCACCTATC TGCTGGAACA GTACGATCGT GTTGAAGCAG AAGAAGCGCG TCTGTCCGAT
CTGATCACCG GCTTTGTTGA CCCGAACGCA GAAGAAGATC TGGCACCTAC CGCCACTCAC
GTCGGTTCTG AGCTTTCCCA GGAAGATCTG GACGATGACG AAGATGAAGA CGAAGAAGAT
GGCGATGACG ACAGCGCCGA TGATGACAAC AGCATCGACC CGGAACTGGC TCGCGAAAAA
TTTGCGGAAC TGCGCGCTCA GTACGTTGTA ACGCGTGACA CCATCAAAGC GAAAGGTCGC
AGTCACGCTG CTGCTCAGGA AGAGATCCTG AAACTGTCTG AAGTATTTAA ACAGTTCCGC
CTGGTGCCGA AGCAGTTTGA CTACCTGGTC AACAGCATGC GCGTCATGAT GGACCGCGTT
CGTACGCAAG AACGTCTGAT CATGAAGCTC TGCGTTGAGC AGTGCAAAAT GCCGAAGAAA
AACTTCATTA CCCTGTTTAC CGGCAACGAA ACCAGCGATA CCTGGTTCAA CGCGGCAATT
GCGATGAACA AGCCGTGGTC GGAAAAACTG CACGATGTCT CTGAAGAAGT GCATCGCGCC
CTGCAGAAAC TGCAGCAGAT TGAAGAAGAA ACCGGCCTGA CCATCGAGCA GGTTAAAGAT
ATCAACCGTC GTATGTCCAT CGGTGAAGCA AAAGCCCGCC GTGCGAAGAA AGAGATGGTT
GAAGCGAACT TACGTCTGGT TATTTCTATC GCCAAGAAAT ACACCAACCG TGGCTTGCAG
TTCCTTGACC TGATTCAGGA AGGTAACATC GGCCTGATGA AAGCGGTTGA TAAATTCGAA
TACCGCCGTG GTTACAAGTT CTCCACCTAC GCAACCTGGT GGATCCGTCA GGCGATCACC
CGTTCTATCG CGGATCAGGC GCGCACCATC CGTATTCCGG TGCATATGAT TGAGACTATC
AACAAACTCA ACCGTATTTC TCGCCAGATG CTGCAAGAGA TGGGCCGTGA ACCGACGCCG
GAAGAACTGG CTGAACGTAT GCTGATGCCG GAAGACAAGA TCCGCAAAGT GCTGAAGATC
GCCAAAGAGC CAATCTCCAT GGAAACGCCG ATCGGTGATG ATGAAGATTC GCATCTGGGG
GATTTCATCG AGGATACCAC CCTCGAGCTG CCGCTGGATT CTGCGACCAC CGAAAGCCTG
CGTGCGGCAA CGCACGACGT GCTGGCTGGC CTGACCGCGC GTGAAGCGAA AGTTCTGCGT
ATGCGTTTCG GTATCGATAT GAACACCGAC CACACGCTGG AAGAAGTGGG TAAACAGTTC
GACGTTACCC GCGAACGTAT CCGTCAGATC GAAGCGAAGG CGCTGCGCAA ACTGCGTCAC
CCGAGCCGTT CTGAAGTGCT GCGTAGCTTC CTGGACGATT AA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLMLAEN TADEDAAEAA AQVLSSVESE IGRTTDPVRM YMREMGTVEL LTREGEIDIA
KRIEDGINQV QCSVAEYPEA ITYLLEQYDR VEAEEARLSD LITGFVDPNA EEDLAPTATH
VGSELSQEDL DDDEDEDEED GDDDSADDDN SIDPELAREK FAELRAQYVV TRDTIKAKGR
SHAAAQEEIL KLSEVFKQFR LVPKQFDYLV NSMRVMMDRV RTQERLIMKL CVEQCKMPKK
NFITLFTGNE TSDTWFNAAI AMNKPWSEKL HDVSEEVHRA LQKLQQIEEE TGLTIEQVKD
INRRMSIGEA KARRAKKEMV EANLRLVISI AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE
YRRGYKFSTY ATWWIRQAIT RSIADQARTI RIPVHMIETI NKLNRISRQM LQEMGREPTP
EELAERMLMP EDKIRKVLKI AKEPISMETP IGDDEDSHLG DFIEDTTLEL PLDSATTESL
RAATHDVLAG LTAREAKVLR MRFGIDMNTD HTLEEVGKQF DVTRERIRQI EAKALRKLRH
PSRSEVLRSF LDD