Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3360 |
Symbol | rpoD |
ID | 6145062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3438565 |
End bp | 3440406 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618189 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001745339 |
Protein GI | 170683578 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0343174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC TATCTGACCT ATGCCGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAG ATCGAAGACA TCATCCAAAT GATCAACGAC ATGGGCATTC AGGTGATGGA AGAAGCACCG GATGCCGATG ATCTGATGCT GGCTGAAAAC ACCGCGGACG AAGATGCTGC CGAAGCCGCC GCGCAGGTGC TTTCCAGCGT GGAATCTGAA ATCGGGCGCA CGACTGACCC GGTACGCATG TACATGCGTG AAATGGGCAC CGTTGAACTG TTGACCCGCG AAGGCGAAAT TGACATCGCT AAGCGTATTG AAGACGGGAT CAACCAGGTT CAATGCTCCG TTGCTGAATA TCCGGAAGCG ATCACCTATC TGCTGGAACA GTACGATCGT GTTGAAGCAG AAGAAGCGCG TCTGTCCGAT CTGATCACCG GCTTTGTTGA CCCGAACGCA GAAGAAGATC TGGCACCTAC CGCCACTCAC GTCGGTTCTG AGCTTTCCCA GGAAGATCTG GACGATGACG AAGATGAAGA CGAAGAAGAT GGCGATGACG ACAGCGCCGA TGATGACAAC AGCATCGACC CGGAACTGGC TCGCGAAAAA TTTGCGGAAC TGCGCGCTCA GTACGTTGTA ACGCGTGACA CCATCAAAGC GAAAGGTCGC AGTCACGCTG CTGCTCAGGA AGAGATCCTG AAACTGTCTG AAGTATTTAA ACAGTTCCGC CTGGTGCCGA AGCAGTTTGA CTACCTGGTC AACAGCATGC GCGTCATGAT GGACCGCGTT CGTACGCAAG AACGTCTGAT CATGAAGCTC TGCGTTGAGC AGTGCAAAAT GCCGAAGAAA AACTTCATTA CCCTGTTTAC CGGCAACGAA ACCAGCGATA CCTGGTTCAA CGCGGCAATT GCGATGAACA AGCCGTGGTC GGAAAAACTG CACGATGTCT CTGAAGAAGT GCATCGCGCC CTGCAGAAAC TGCAGCAGAT TGAAGAAGAA ACCGGCCTGA CCATCGAGCA GGTTAAAGAT ATCAACCGTC GTATGTCCAT CGGTGAAGCA AAAGCCCGCC GTGCGAAGAA AGAGATGGTT GAAGCGAACT TACGTCTGGT TATTTCTATC GCCAAGAAAT ACACCAACCG TGGCTTGCAG TTCCTTGACC TGATTCAGGA AGGTAACATC GGCCTGATGA AAGCGGTTGA TAAATTCGAA TACCGCCGTG GTTACAAGTT CTCCACCTAC GCAACCTGGT GGATCCGTCA GGCGATCACC CGTTCTATCG CGGATCAGGC GCGCACCATC CGTATTCCGG TGCATATGAT TGAGACTATC AACAAACTCA ACCGTATTTC TCGCCAGATG CTGCAAGAGA TGGGCCGTGA ACCGACGCCG GAAGAACTGG CTGAACGTAT GCTGATGCCG GAAGACAAGA TCCGCAAAGT GCTGAAGATC GCCAAAGAGC CAATCTCCAT GGAAACGCCG ATCGGTGATG ATGAAGATTC GCATCTGGGG GATTTCATCG AGGATACCAC CCTCGAGCTG CCGCTGGATT CTGCGACCAC CGAAAGCCTG CGTGCGGCAA CGCACGACGT GCTGGCTGGC CTGACCGCGC GTGAAGCGAA AGTTCTGCGT ATGCGTTTCG GTATCGATAT GAACACCGAC CACACGCTGG AAGAAGTGGG TAAACAGTTC GACGTTACCC GCGAACGTAT CCGTCAGATC GAAGCGAAGG CGCTGCGCAA ACTGCGTCAC CCGAGCCGTT CTGAAGTGCT GCGTAGCTTC CTGGACGATT AA
|
Protein sequence | MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP DADDLMLAEN TADEDAAEAA AQVLSSVESE IGRTTDPVRM YMREMGTVEL LTREGEIDIA KRIEDGINQV QCSVAEYPEA ITYLLEQYDR VEAEEARLSD LITGFVDPNA EEDLAPTATH VGSELSQEDL DDDEDEDEED GDDDSADDDN SIDPELAREK FAELRAQYVV TRDTIKAKGR SHAAAQEEIL KLSEVFKQFR LVPKQFDYLV NSMRVMMDRV RTQERLIMKL CVEQCKMPKK NFITLFTGNE TSDTWFNAAI AMNKPWSEKL HDVSEEVHRA LQKLQQIEEE TGLTIEQVKD INRRMSIGEA KARRAKKEMV EANLRLVISI AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE YRRGYKFSTY ATWWIRQAIT RSIADQARTI RIPVHMIETI NKLNRISRQM LQEMGREPTP EELAERMLMP EDKIRKVLKI AKEPISMETP IGDDEDSHLG DFIEDTTLEL PLDSATTESL RAATHDVLAG LTAREAKVLR MRFGIDMNTD HTLEEVGKQF DVTRERIRQI EAKALRKLRH PSRSEVLRSF LDD
|
| |