Gene EcSMS35_4436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4436 
SymbolrpoC 
ID6144315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4530801 
End bp4535024 
Gene Length4224 bp 
Protein Length1407 aa 
Translation table11 
GC content54% 
IMG OID641619256 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_001746372 
Protein GI170682674 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000514009 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAAGATT TATTAAAGTT TCTGAAAGCG CAGACTAAAA CCGAAGAGTT TGATGCGATC 
AAAATTGCTC TGGCTTCGCC AGACATGATC CGTTCATGGT CTTTCGGTGA AGTTAAAAAG
CCGGAAACCA TCAACTACCG TACGTTCAAA CCAGAACGTG ACGGCCTTTT CTGCGCCCGT
ATCTTTGGGC CGGTAAAAGA TTACGAGTGC CTGTGCGGTA AGTACAAGCG CCTGAAACAC
CGTGGCGTCA TCTGTGAGAA GTGCGGTGTT GAAGTGACCC AGACTAAAGT ACGCCGTGAG
CGTATGGGCC ACATCGAACT GGCTTCCCCG ACTGCGCACA TCTGGTTCCT GAAATCGCTG
CCGTCCCGTA TCGGCCTGCT GCTCGATATG CCGCTGCGCG ATATCGAACG CGTACTGTAC
TTCGAATCCT ATGTGGTTAT CGAAGGCGGT ATGACCAACC TGGAGCGTCA GCAGATCCTG
ACTGAAGAGC AGTATCTGGA CGCGCTGGAA GAGTTCGGTG ACGAATTCGA CGCGAAGATG
GGTGCGGAAG CAATCCAGGC CCTGCTGAAG AGCATGGATC TGGAGCAAGA GTGCGAACAG
CTGCGTGAAG AGCTGAACGA AACCAACTCC GAAACCAAGC GTAAAAAGCT GACCAAGCGT
ATCAAACTGC TGGAAGCGTT CGTTCAGTCT GGTAACAAAC CAGAGTGGAT GATCCTGACC
GTTCTGCCGG TACTGCCGCC AGATCTGCGT CCGCTGGTTC CGCTGGATGG TGGTCGTTTC
GCGACTTCTG ACCTGAACGA TCTGTATCGT CGCGTCATTA ACCGTAACAA CCGTCTGAAA
CGTCTGCTGG ATCTGGCTGC GCCGGACATC ATCGTACGTA ACGAAAAACG TATGCTGCAG
GAAGCGGTAG ACGCCCTGCT GGATAACGGT CGTCGCGGTC GTGCAATCAC CGGTTCTAAC
AAGCGTCCTC TGAAATCTTT GGCCGACATG ATCAAAGGTA AACAGGGTCG TTTCCGTCAG
AACCTGCTCG GTAAGCGTGT TGACTACTCC GGTCGTTCTG TAATCACCGT AGGTCCATAC
CTGCGTCTGC ATCAGTGCGG TCTGCCGAAG AAAATGGCAC TGGAGCTGTT CAAACCGTTC
ATCTACGGCA AGCTGGAACT GCGTGGTCTT GCTACCACCA TTAAAGCTGC GAAGAAAATG
GTTGAGCGCG AAGAAGCTGT CGTTTGGGAT ATCCTGGACG AAGTTATCCG CGAACACCCG
GTACTGCTGA ACCGTGCACC GACTCTGCAC CGTCTGGGTA TCCAGGCATT TGAACCGGTA
CTGATCGAAG GTAAAGCTAT CCAGCTGCAC CCGCTGGTTT GTGCGGCATA TAACGCCGAC
TTCGATGGTG ACCAGATGGC TGTTCACGTA CCGCTGACGC TGGAAGCCCA GCTGGAAGCG
CGTGCGCTGA TGATGTCTAC CAACAACATC CTGTCCCCGG CAAACGGCGA ACCAATCATC
GTTCCGTCTC AGGACGTTGT ACTGGGTCTG TACTACATGA CCCGCGACTG TGTTAACGCC
AAAGGCGAAG GCATGGTGCT GACTGGCCCG AAAGAAGCAG AACGTCTGTA TCGCTCTGGT
CTGGCTTCTC TGCATGCGCG CGTTAAAGTG CGTATCACCG AGTATGAAAA AGATGCTAAC
GGTGAATTAG TGGCGAAAAC CAGCCTGAAA GACACGACTG TTGGCCGTGC CATTCTGTGG
ATGATTGTAC CGAAAGGTCT GCCTTACACC ATCGTCAACC AGGCGCTGGG TAAAAAAGCA
ATCTCCAAAA TGCTGAACAC CTGCTACCGC ATTCTCGGTC TGAAACCGAC CGTTATTTTT
GCGGACCAGA TCATGTATAC CGGCTTTGCC TATGCAGCGC GTTCTGGTGC ATCTGTTGGT
ATCGATGACA TGGTCATCCC GGAGAAGAAA CACGAAATCA TCTCCGAGGC AGAAGCAGAA
GTTGCTGAAA TTCAGGAGCA GTTCCAGTCT GGTCTGGTAA CTGCGGGCGA ACGCTACAAC
AAAGTTATCG ATATCTGGGC TGCGGCGAAC GATCGTGTAT CCAAAGCGAT GATGGATAAC
CTGCAAACTG AAACCGTTAT TAACCGTGAC GGTCAGGAAG AGAAGCAGGT TTCCTTCAAC
AGCATCTACA TGATGGCCGA CTCCGGTGCG CGTGGTTCTG CGGCACAGAT TCGTCAGCTG
GCGGGTATGC GTGGTCTGAT GGCGAAGCCG GATGGCTCCA TCATCGAAAC GCCAATCACC
GCGAACTTCC GTGAAGGTCT GAACGTACTC CAGTACTTCA TCTCCACCCA CGGTGCTCGT
AAAGGTCTGG CGGATACCGC ACTGAAAACG GCGAACTCCG GTTACCTGAC TCGTCGTCTG
GTTGACGTGG CGCAGGACCT GGTGGTTACC GAAGACGATT GTGGTACCCA TGAAGGTATC
ATGATGACTC CGGTTATCGA GGGTGGTGAC GTTAAAGAGC CGCTGCGCGA TCGCGTATTG
GGTCGTGTAA CTGCTGAAGA CGTTCTGAAG CCGGGTACTG CTGATATCCT CGTTCCGCGC
AACACGCTGC TGCACGAACA GTGGTGTGAC CTGCTGGAAG AGAACTCTGT CGACGCGGTT
AAAGTACGTT CTGTTGTATC TTGTGACACC GACTTTGGTG TATGTGCGCA CTGCTACGGT
CGTGACCTGG CGCGTGGCCA CCTCATCAAC AAAGGTGAAG CAATCGGTGT AATCGCGGCA
CAGTCCATCG GTGAACCGGG TACACAGCTG ACCATGCGTA CGTTCCACAT CGGTGGTGCG
GCATCTCGTG CGGCTGCTGA ATCCAGCATC CAGGTGAAAA ACAAAGGTAG CATCAAGCTC
AGCAACGTGA AGTCGGTTGT GAACTCCAGC GGTAAACTGG TTATCACTTC CCGTAACACC
GAACTGAAAC TGATCGACGA ATTCGGTCGT ACCAAAGAAA GCTACAAAGT ACCTTACGGT
GCGGTACTGG CGAAAGGCGA TGGCGAACAG GTTGCTGGCG GCGAAACCGT TGCAAACTGG
GACCCGCACA CCATGCCGGT TATCACCGAA GTAAGCGGTT TTGTACGCTT TACTGACATG
ATCGACGGCC AGACCATTAC TCGTCAGACC GACGAATTGA CCGGTCTGTC TTCGCTGGTG
GTTCTGGATT CCGCAGAACG TACCGCAGGT GGTAAAGATC TGCGTCCGGC ACTGAAAATC
GTTGATGCTC AGGGTAACGA CGTTCTGATC CCGGGTACCG ATATGCCTGC GCAGTACTTC
CTGCCGGGTA AAGCGATTGT TCAGCTGGAA GATGGCGTAC AGATCAGCTC TGGTGACACC
CTGGCGCGTA TTCCGCAGGA ATCCGGCGGT ACCAAGGACA TCACCGGTGG TCTGCCGCGC
GTTGCGGACC TGTTCGAAGC ACGTCGTCCG AAAGAGCCGG CAATCCTGGC TGAAATCAGC
GGTATCGTTT CCTTCGGTAA AGAAACCAAA GGTAAACGTC GTCTGGTTAT CACCCCGGTA
GACGGTAGCG ATCCGTACGA AGAGATGATT CCGAAATGGC GTCAGCTCAA CGTGTTCGAA
GGTGAACGTG TAGAACGTGG TGACGTAATT TCCGACGGTC CGGAAGCGCC GCACGACATT
CTGCGTCTGC GTGGTGTTCA TGCTGTTACT CGTTACATCG TTAACGAAGT ACAGGACGTA
TACCGTCTGC AGGGCGTTAA GATTAACGAT AAACACATCG AAGTTATCGT TCGTCAGATG
CTGCGTAAAG CTACCATCGT TAACGCGGGC AGCTCCGACT TCCTGGAAGG TGAACAGGTT
GAATACTCTC GCGTCAAGAT CGCAAACCGC GAACTGGAAG CGAACGGCAA AGTGGGGGCA
ACTTACTCCC GCGATCTGCT GGGTATCACC AAAGCGTCTC TGGCAACCGA GTCCTTCATC
TCCGCGGCAT CGTTCCAGGA GACCACTCGC GTGCTGACCG AAGCAGCCGT TGCGGGCAAA
CGCGACGAAT TGCGCGGCCT GAAAGAGAAC GTTATCGTGG GTCGTCTGAT CCCGGCAGGT
ACCGGTTACG CGTACCACCA GGATCGTATG CGTCGCCGTG CTGCGGGTGA AGCTCCGGCT
GCACCGCAGG TGACTGCAGA AGACGCATCT GCCAGCCTGG CAGAACTGCT GAACGCAGGT
CTGGGCGGTT CTGATAACGA GTAA
 
Protein sequence
MKDLLKFLKA QTKTEEFDAI KIALASPDMI RSWSFGEVKK PETINYRTFK PERDGLFCAR 
IFGPVKDYEC LCGKYKRLKH RGVICEKCGV EVTQTKVRRE RMGHIELASP TAHIWFLKSL
PSRIGLLLDM PLRDIERVLY FESYVVIEGG MTNLERQQIL TEEQYLDALE EFGDEFDAKM
GAEAIQALLK SMDLEQECEQ LREELNETNS ETKRKKLTKR IKLLEAFVQS GNKPEWMILT
VLPVLPPDLR PLVPLDGGRF ATSDLNDLYR RVINRNNRLK RLLDLAAPDI IVRNEKRMLQ
EAVDALLDNG RRGRAITGSN KRPLKSLADM IKGKQGRFRQ NLLGKRVDYS GRSVITVGPY
LRLHQCGLPK KMALELFKPF IYGKLELRGL ATTIKAAKKM VEREEAVVWD ILDEVIREHP
VLLNRAPTLH RLGIQAFEPV LIEGKAIQLH PLVCAAYNAD FDGDQMAVHV PLTLEAQLEA
RALMMSTNNI LSPANGEPII VPSQDVVLGL YYMTRDCVNA KGEGMVLTGP KEAERLYRSG
LASLHARVKV RITEYEKDAN GELVAKTSLK DTTVGRAILW MIVPKGLPYT IVNQALGKKA
ISKMLNTCYR ILGLKPTVIF ADQIMYTGFA YAARSGASVG IDDMVIPEKK HEIISEAEAE
VAEIQEQFQS GLVTAGERYN KVIDIWAAAN DRVSKAMMDN LQTETVINRD GQEEKQVSFN
SIYMMADSGA RGSAAQIRQL AGMRGLMAKP DGSIIETPIT ANFREGLNVL QYFISTHGAR
KGLADTALKT ANSGYLTRRL VDVAQDLVVT EDDCGTHEGI MMTPVIEGGD VKEPLRDRVL
GRVTAEDVLK PGTADILVPR NTLLHEQWCD LLEENSVDAV KVRSVVSCDT DFGVCAHCYG
RDLARGHLIN KGEAIGVIAA QSIGEPGTQL TMRTFHIGGA ASRAAAESSI QVKNKGSIKL
SNVKSVVNSS GKLVITSRNT ELKLIDEFGR TKESYKVPYG AVLAKGDGEQ VAGGETVANW
DPHTMPVITE VSGFVRFTDM IDGQTITRQT DELTGLSSLV VLDSAERTAG GKDLRPALKI
VDAQGNDVLI PGTDMPAQYF LPGKAIVQLE DGVQISSGDT LARIPQESGG TKDITGGLPR
VADLFEARRP KEPAILAEIS GIVSFGKETK GKRRLVITPV DGSDPYEEMI PKWRQLNVFE
GERVERGDVI SDGPEAPHDI LRLRGVHAVT RYIVNEVQDV YRLQGVKIND KHIEVIVRQM
LRKATIVNAG SSDFLEGEQV EYSRVKIANR ELEANGKVGA TYSRDLLGIT KASLATESFI
SAASFQETTR VLTEAAVAGK RDELRGLKEN VIVGRLIPAG TGYAYHQDRM RRRAAGEAPA
APQVTAEDAS ASLAELLNAG LGGSDNE