Gene ECH74115_5454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5454 
SymbolrpoC 
ID6966776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5097359 
End bp5101582 
Gene Length4224 bp 
Protein Length1407 aa 
Translation table11 
GC content54% 
IMG OID643389103 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_002273504 
Protein GI209399168 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00336905 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAAGATT TATTAAAGTT TCTGAAAGCG CAGACTAAAA CCGAAGAGTT TGATGCGATC 
AAAATTGCTC TGGCTTCGCC AGACATGATC CGTTCATGGT CTTTCGGTGA AGTTAAAAAG
CCGGAAACCA TCAACTACCG TACGTTCAAA CCAGAACGTG ACGGCCTTTT CTGCGCCCGT
ATCTTTGGGC CGGTAAAAGA TTACGAGTGC CTGTGCGGTA AGTACAAGCG CCTGAAACAC
CGTGGCGTCA TCTGTGAGAA GTGTGGCGTT GAAGTGACCC AGACTAAAGT ACGCCGTGAG
CGTATGGGCC ACATCGAACT GGCTTCCCCG ACTGCGCACA TCTGGTTCCT GAAATCGCTG
CCGTCCCGTA TCGGCCTGCT GCTCGATATG CCGCTGCGCG ATATCGAACG CGTACTGTAC
TTTGAATCCT ATGTGGTTAT CGAAGGCGGT ATGACCAACC TGGAGCGTCA GCAGATCCTG
ACTGAAGAGC AGTATCTGGA CGCGCTGGAA GAGTTCGGTG ACGAATTCGA CGCGAAAATG
GGTGCGGAAG CAATCCAGGC CCTGCTGAAG AGCATGGATC TGGAGCAAGA GTGCGAACAG
CTGCGTGAAG AGCTGAACGA AACCAACTCC GAAACCAAGC GTAAAAAGCT GACCAAGCGT
ATCAAACTGC TGGAAGCGTT CGTTCAGTCT GGTAACAAAC CAGAGTGGAT GATCCTGACC
GTTCTGCCGG TACTGCCGCC AGATCTGCGT CCGCTGGTTC CGCTGGATGG TGGTCGTTTC
GCGACTTCTG ACCTGAACGA TCTGTATCGT CGCGTCATTA ACCGTAACAA CCGTCTGAAA
CGTCTGCTGG ATCTGGCTGC GCCGGACATC ATCGTACGTA ACGAAAAACG TATGCTGCAG
GAAGCGGTAG ACGCCCTGCT GGATAACGGT CGTCGCGGTC GTGCGATCAC CGGTTCTAAC
AAGCGTCCTC TGAAATCTTT GGCCGACATG ATCAAAGGTA AACAGGGTCG TTTCCGTCAG
AACCTGCTCG GTAAGCGTGT TGACTACTCC GGTCGTTCTG TAATCACCGT GGGTCCATAC
CTGCGTCTGC ATCAGTGCGG TCTGCCGAAG AAAATGGCGC TGGAGCTGTT CAAACCGTTC
ATCTACGGCA AGCTGGAACT GCGTGGTCTT GCTACCACCA TTAAAGCTGC GAAGAAAATG
GTTGAGCGCG AAGAAGCTGT CGTTTGGGAT ATCCTGGACG AAGTTATCCG CGAACACCCG
GTACTGCTGA ACCGTGCACC GACTCTGCAC CGTCTGGGTA TCCAGGCATT TGAACCGGTA
CTGATCGAAG GTAAAGCTAT CCAGCTGCAC CCGCTGGTTT GTGCGGCATA TAACGCCGAC
TTCGATGGTG ACCAGATGGC TGTTCACGTA CCGCTGACGC TGGAAGCCCA GCTGGAAGCG
CGTGCGCTGA TGATGTCTAC CAACAACATC CTGTCCCCGG CGAACGGCGA ACCAATCATC
GTTCCGTCTC AGGACGTTGT ACTGGGTCTG TACTACATGA CCCGTGACTG TGTTAACGCC
AAAGGCGAAG GCATGGTGCT GACTGGCCCG AAAGAAGCAG AACGTCTGTA TCGCTCTGGT
CTGGCTTCTC TGCATGCGCG CGTTAAAGTG CGTATCACCG AGTATGAAAA AGATGCTAAC
GGTGAATTAG TAGCGAAAAC CAGCCTGAAA GACACGACTG TTGGCCGTGC CATTCTGTGG
ATGATTGTAC CGAAAGGTCT GCCTTACTCC ATCGTCAACC AGGCGCTGGG TAAAAAAGCA
ATCTCCAAAA TGCTGAACAC CTGCTACCGC ATTCTCGGTC TGAAACCGAC CGTTATTTTT
GCGGACCAGA TCATGTACAC CGGCTTTGCA TATGCAGCGC GTTCTGGTGC ATCTGTTGGT
ATCGATGACA TGGTCATCCC GGAGAAGAAA CACGAAATCA TCTCCGAGGC AGAAGCAGAA
GTTGCTGAAA TTCAGGAGCA GTTCCAGTCT GGTCTGGTAA CTGCGGGCGA ACGCTACAAC
AAAGTTATCG ATATCTGGGC TGCGGCGAAC GATCGTGTAT CCAAAGCGAT GATGGATAAC
CTGCAAACTG AAACCGTTAT TAACCGTGAC GGTCAGGAAG AGAAGCAGGT TTCCTTCAAC
AGCATCTACA TGATGGCCGA CTCCGGTGCG CGTGGTTCTG CGGCACAGAT TCGTCAGCTT
GCTGGTATGC GTGGTCTGAT GGCGAAGCCG GATGGCTCCA TCATCGAAAC GCCAATCACC
GCGAACTTCC GTGAAGGTCT GAACGTACTC CAGTACTTCA TCTCCACCCA CGGTGCTCGT
AAAGGTCTGG CGGATACCGC ACTGAAAACT GCGAACTCCG GTTACCTGAC TCGTCGTCTG
GTTGACGTGG CGCAGGACCT GGTGGTTACC GAAGACGATT GTGGTACCCA TGAAGGTATC
ATGATGACTC CGGTTATCGA GGGTGGTGAC GTTAAAGAGC CGCTGCGCGA TCGCGTATTG
GGTCGTGTAA CTGCTGAAGA CGTTCTGAAG CCGGGTACTG CTGATATCCT CGTTCCGCGC
AACACGCTGC TGCACGAACA GTGGTGTGAC CTGCTGGAAG AGAACTCTGT CGACGCGGTT
AAAGTACGTT CTGTTGTATC TTGTGACACC GACTTTGGTG TATGTGCGCA CTGCTACGGT
CGTGACCTGG CGCGTGGCCA CATCATCAAC AAGGGTGAAG CAATCGGTGT TATCGCGGCA
CAGTCCATCG GTGAACCGGG TACACAGCTG ACCATGCGTA CGTTCCACAT CGGTGGTGCG
GCATCTCGTG CGGCTGCTGA ATCCAGCATC CAGGTGAAAA ACAAAGGTAG CATCAAGCTC
AGCAACGTGA AGTCGGTTGT GAACTCCAGC GGTAAACTGG TTATCACTTC CCGTAACACC
GAACTGAAAC TGATCGACGA ATTCGGTCGT ACCAAAGAAA GCTACAAAGT ACCTTACGGT
GCGGTACTGG CAAAAGGCGA TGGCGAACAG GTTGCTGGCG GCGAAACCGT TGCAAACTGG
GACCCGCACA CCATGCCGGT TATCACCGAA GTAAGCGGTT TTGTACGCTT TACTGACATG
ATCGACGGCC AGACCATTAC TCGTCAGACC GACGAACTGA CCGGTCTGTC TTCGCTGGTG
GTTCTGGATT CCGCAGAACG TACCGCAGGT GGTAAAGATC TGCGTCCGGC ACTGAAAATC
GTTGATGCTC AGGGTAACGA CGTTCTGATC CCAGGTACCG ATATGCCTGC GCAGTACTTC
CTGCCGGGTA AAGCGATTGT TCAGCTGGAA GATGGCGTAC AGATCAGCTC TGGTGACACC
CTGGCGCGTA TTCCGCAGGA ATCCGGCGGT ACCAAGGACA TCACCGGTGG TCTGCCACGC
GTTGCGGACC TGTTCGAAGC ACGTCGTCCG AAAGAGCCGG CAATCCTGGC TGAAATCAGC
GGTATCGTTT CCTTCGGTAA AGAAACCAAA GGTAAACGTC GTCTGGTTAT CACCCCGGTA
GACGGTAGCG ATCCGTACGA AGAGATGATT CCGAAATGGC GTCAGCTCAA CGTGTTCGAA
GGTGAACGTG TAGAACGTGG TGACGTAATT TCCGACGGTC CGGAAGCGCC GCACGACATT
CTGCGTCTGC GTGGTGTTCA TGCTGTGACT CGTTACATCG TTAACGAAGT ACAGGACGTA
TACCGTCTGC AGGGCGTTAA GATTAACGAT AAACACATCG AAGTTATCGT TCGTCAGATG
CTGCGTAAAG CTACCATCGT TAACGCGGGC AGCTCCGACT TCCTGGAAGG CGAACAGGTT
GAATACTCTC GCGTCAAGAT CGCAAACCGC GAACTGGAAG CGAACGGCAA AGTGGGTGCA
ACTTACTCCC GCGATCTGCT GGGTATCACC AAAGCGTCTC TGGCAACCGA GTCCTTCATC
TCCGCGGCAT CGTTCCAGGA GACCACTCGC GTGCTGACCG AAGCAGCCGT TGCGGGCAAA
CGCGACGAAC TGCGCGGCCT GAAAGAGAAC GTTATCGTGG GTCGTCTGAT CCCGGCAGGT
ACCGGTTACG CGTACCACCA GGATCGTATG CGTCGCCGTG CTGCGGGTGA AGCTCCGGCT
GCACCGCAGG TGACTGCAGA AGACGCATCT GCCAGCCTGG CAGAACTGCT GAACGCAGGT
CTGGGCGGTT CTGATAACGA GTAA
 
Protein sequence
MKDLLKFLKA QTKTEEFDAI KIALASPDMI RSWSFGEVKK PETINYRTFK PERDGLFCAR 
IFGPVKDYEC LCGKYKRLKH RGVICEKCGV EVTQTKVRRE RMGHIELASP TAHIWFLKSL
PSRIGLLLDM PLRDIERVLY FESYVVIEGG MTNLERQQIL TEEQYLDALE EFGDEFDAKM
GAEAIQALLK SMDLEQECEQ LREELNETNS ETKRKKLTKR IKLLEAFVQS GNKPEWMILT
VLPVLPPDLR PLVPLDGGRF ATSDLNDLYR RVINRNNRLK RLLDLAAPDI IVRNEKRMLQ
EAVDALLDNG RRGRAITGSN KRPLKSLADM IKGKQGRFRQ NLLGKRVDYS GRSVITVGPY
LRLHQCGLPK KMALELFKPF IYGKLELRGL ATTIKAAKKM VEREEAVVWD ILDEVIREHP
VLLNRAPTLH RLGIQAFEPV LIEGKAIQLH PLVCAAYNAD FDGDQMAVHV PLTLEAQLEA
RALMMSTNNI LSPANGEPII VPSQDVVLGL YYMTRDCVNA KGEGMVLTGP KEAERLYRSG
LASLHARVKV RITEYEKDAN GELVAKTSLK DTTVGRAILW MIVPKGLPYS IVNQALGKKA
ISKMLNTCYR ILGLKPTVIF ADQIMYTGFA YAARSGASVG IDDMVIPEKK HEIISEAEAE
VAEIQEQFQS GLVTAGERYN KVIDIWAAAN DRVSKAMMDN LQTETVINRD GQEEKQVSFN
SIYMMADSGA RGSAAQIRQL AGMRGLMAKP DGSIIETPIT ANFREGLNVL QYFISTHGAR
KGLADTALKT ANSGYLTRRL VDVAQDLVVT EDDCGTHEGI MMTPVIEGGD VKEPLRDRVL
GRVTAEDVLK PGTADILVPR NTLLHEQWCD LLEENSVDAV KVRSVVSCDT DFGVCAHCYG
RDLARGHIIN KGEAIGVIAA QSIGEPGTQL TMRTFHIGGA ASRAAAESSI QVKNKGSIKL
SNVKSVVNSS GKLVITSRNT ELKLIDEFGR TKESYKVPYG AVLAKGDGEQ VAGGETVANW
DPHTMPVITE VSGFVRFTDM IDGQTITRQT DELTGLSSLV VLDSAERTAG GKDLRPALKI
VDAQGNDVLI PGTDMPAQYF LPGKAIVQLE DGVQISSGDT LARIPQESGG TKDITGGLPR
VADLFEARRP KEPAILAEIS GIVSFGKETK GKRRLVITPV DGSDPYEEMI PKWRQLNVFE
GERVERGDVI SDGPEAPHDI LRLRGVHAVT RYIVNEVQDV YRLQGVKIND KHIEVIVRQM
LRKATIVNAG SSDFLEGEQV EYSRVKIANR ELEANGKVGA TYSRDLLGIT KASLATESFI
SAASFQETTR VLTEAAVAGK RDELRGLKEN VIVGRLIPAG TGYAYHQDRM RRRAAGEAPA
APQVTAEDAS ASLAELLNAG LGGSDNE