Gene EcSMS35_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0124 
SymbolaceE 
ID6142768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp133439 
End bp136102 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content53% 
IMG OID641615025 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_001742241 
Protein GI170680617 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.594782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.632458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAC GTTTCCCAAA TGACGTGGAT CCGATCGAAA CTCGCGACTG GCTCCAGGCG 
ATCGAATCGG TCATCCGTGA AGAAGGTGTT GAGCGTGCTC AGTATCTGAT CGACCAACTG
CTTGCTGAAG CCCGCAAAGG CGGTGTCAAC GTAGCCGCAG GCACAGGTAT CAGCAACTAC
ATCAACACCA TCCCCGTTGA AGAACAACCG GAGTATCCGG GTAATCTGGA ACTGGAACGC
CGTATTCGTT CAGCTATCCG CTGGAACGCC ATCATGACGG TTCTGCGTGC GTCGAAAAAA
GACCTCGAAC TGGGCGGCCA CATGGCGTCC TTCCAGTCTT CCGCAACCAT TTATGATGTG
TGCTTTAACC ACTTCTTCCG TGCACGCAAC GAGCAGGATG GCGGCGACCT GGTTTACTTC
CAGGGCCACA TCTCCCCGGG CGTTTACGCA CGTGCTTTCC TGGAAGGTCG TCTGACTCAG
GAGCAGCTGG ATAACTTCCG TCAGGAAGTT CACGGCAATG GCCTCTCTTC CTATCCGCAC
CCGAAACTGA TGCCGGAATT CTGGCAGTTC CCGACCGTAT CTATGGGTCT GGGTCCGATT
GGTGCTATTT ACCAGGCTAA GTTCCTGAAA TATCTGGAAC ACCGTGGCCT GAAAGATACC
TCTAAACAGA CCGTTTACGC ATTCCTCGGC GACGGTGAAA TGGACGAACC GGAATCCAAA
GGTGCGATCA CCATCGCAAC CCGTGAAAAA CTGGATAACC TGGTCTTCGT TATCAACTGT
AACCTGCAAC GTCTTGACGG CCCGGTCACC GGTAACGGCA AGATCATCAA CGAACTGGAA
GGCATCTTCG AAGGTGCTGG CTGGAACGTG ATCAAAGTGA TGTGGGGTAG CCGTTGGGAT
GAACTGCTGC GTAAAGATAC CAGCGGTAAA CTGATCCAGC TGATGAACGA AACCGTTGAC
GGCGACTACC AGACCTTCAA ATCGAAAGAT GGTGCGTACG TTCGTGAACA CTTCTTCGGT
AAATATCCTG AAACCGCAGC ACTGGTTGCA GACTGGACTG ACGAGCAGAT CTGGGCACTG
AACCGTGGCG GTCACGATCC GAAGAAAATC TACGCTGCAT TCAAGAAAGC GCAGGAAACC
AAAGGCAAAG CGACAGTAAT CCTTGCTCAT ACCATTAAAG GTTACGGCAT GGGCGACGCG
GCTGAAGGTA AAAACATCGC GCACCAGGTT AAGAAAATGA ACATGGACGG CGTGCGTCAC
ATCCGCGACC GTTTCAATGT GCCGGTGTCT GATGCCGATA TCGAAAAACT GCCGTACATC
ACCTTCCCGG AAGGTTCTGA AGAGCATACC TATCTGCACG CGCAGCGTCA GAAACTGCAC
GGTTATCTGC CAAGCCGTCA GCCGAACTTC ACCGAGAAGC TTGAGCTGCC GAGCCTGCAA
GATTTCGGCG CGCTGCTGGA AGAGCAGAGC AAAGAGATCT CTACCACTAT CGCTTTCGTT
CGTGCTCTGA ACGTGATGCT GAAGAACAAG TCGATCAAAG ACCGACTGGT GCCGATCATC
GCCGACGAAG CGCGTACTTT CGGTATGGAA GGTCTGTTCC GTCAGATTGG TATTTACAGC
CCGAACGGTC AGCAGTACAC CCCGCAGGAC CGCGAGCAGG TTGCTTACTA TAAAGAAGAC
GAGAAAGGTC AGATTCTGCA GGAAGGGATC AACGAGCTGG GCGCAGGTTG TTCCTGGCTG
GCAGCGGCGA CCTCTTACAG CACCAACAAT CTGCCGATGA TTCCGTTCTA CATCTATTAC
TCGATGTTCG GCTTCCAGCG TATCGGCGAT CTGTGCTGGG CGGCTGGTGA CCAGCAAGCG
CGTGGCTTCC TGATCGGCGG TACTTCCGGT CGTACCACCC TGAACGGCGA AGGTCTGCAG
CACGAAGATG GTCACAGCCA CATTCAGTCG CTGACTATCC CGAACTGTAT CTCTTACGAC
CCGGCTTACG CTTACGAAGT TGCTGTCATC ATGCATGACG GTCTGGAGCG TATGTACGGT
GAAAAACAAG AGAACGTTTA CTACTACATC ACCACGCTGA ACGAAAACTA CCACATGCCG
GCAATGCCCG AAGGTGCTGA GGAAGGTATC CGTAAAGGTA TCTACAAACT CGAAACCATT
GAAGGTAGCA AAGGTAAAGT TCAGCTGCTC GGCTCCGGTT CTATCCTGCG TCACGTCCGT
GAAGCAGCTG AGATCCTGGC GAAAGATTAC GGCGTAGGTT CTGACGTTTA TAGCGTGACA
TCCTTCACCG AACTGGCGCG TGATGGTCAG GATTGTGAAC GCTGGAACAT GCTGCACCCG
CTGGAAACTC CGCGCGTTCC GTATATCGCT CAGGTGATGA ATGACGCTCC GGCAGTGGCA
TCTACCGACT ATATGAAACT GTTCGCTGAG CAGGTCCGTA CTTACGTACC GGCTGACGAC
TACCGCGTAC TGGGTACTGA TGGCTTCGGT CGTTCCGACA GCCGTGAGAA CCTGCGTCAC
CACTTCGAAG TTGATGCTTC TTATGTCGTG GTTGCGGCGC TGGGCGAACT GGCTAAACGT
GGCGAAATCG ATAAGAAAGT GGTTGCTGAC GCAATCGCCA AATTCAACAT CGATGCAGAT
AAAGTTAACC CGCGTCTGGC GTAA
 
Protein sequence
MSERFPNDVD PIETRDWLQA IESVIREEGV ERAQYLIDQL LAEARKGGVN VAAGTGISNY 
INTIPVEEQP EYPGNLELER RIRSAIRWNA IMTVLRASKK DLELGGHMAS FQSSATIYDV
CFNHFFRARN EQDGGDLVYF QGHISPGVYA RAFLEGRLTQ EQLDNFRQEV HGNGLSSYPH
PKLMPEFWQF PTVSMGLGPI GAIYQAKFLK YLEHRGLKDT SKQTVYAFLG DGEMDEPESK
GAITIATREK LDNLVFVINC NLQRLDGPVT GNGKIINELE GIFEGAGWNV IKVMWGSRWD
ELLRKDTSGK LIQLMNETVD GDYQTFKSKD GAYVREHFFG KYPETAALVA DWTDEQIWAL
NRGGHDPKKI YAAFKKAQET KGKATVILAH TIKGYGMGDA AEGKNIAHQV KKMNMDGVRH
IRDRFNVPVS DADIEKLPYI TFPEGSEEHT YLHAQRQKLH GYLPSRQPNF TEKLELPSLQ
DFGALLEEQS KEISTTIAFV RALNVMLKNK SIKDRLVPII ADEARTFGME GLFRQIGIYS
PNGQQYTPQD REQVAYYKED EKGQILQEGI NELGAGCSWL AAATSYSTNN LPMIPFYIYY
SMFGFQRIGD LCWAAGDQQA RGFLIGGTSG RTTLNGEGLQ HEDGHSHIQS LTIPNCISYD
PAYAYEVAVI MHDGLERMYG EKQENVYYYI TTLNENYHMP AMPEGAEEGI RKGIYKLETI
EGSKGKVQLL GSGSILRHVR EAAEILAKDY GVGSDVYSVT SFTELARDGQ DCERWNMLHP
LETPRVPYIA QVMNDAPAVA STDYMKLFAE QVRTYVPADD YRVLGTDGFG RSDSRENLRH
HFEVDASYVV VAALGELAKR GEIDKKVVAD AIAKFNIDAD KVNPRLA