Gene EcSMS35_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4403 
Symbolppc 
ID6145036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4496566 
End bp4499217 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content56% 
IMG OID641619224 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001746348 
Protein GI170683041 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.494951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC AATATTCCGC ATTGCGTAGT AATGTCAGTA TGCTCGGCAA AGTGCTGGGA 
GAAACCATCA AGGATGCGTT AGGAGAACAC ATTCTTGAAC GCGTAGAAAC TATCCGTAAG
TTGTCCAAAT CTTCACGCGC TGGCAATGAT GCTAACCGCC AGGAGTTGCT CACCACCTTA
CAAAATTTGT CGAACGACGA GCTGCTGCCC GTTGCGCGTG CGTTTAGTCA GTTTCTGAAC
CTGGCCAACA CCGCCGAGCA ATACCACAGC ATTTCGCCGA AAGGCGAAGC TGCCAGCAAC
CCGGAAGTGA TCGCCCGCAC CCTGCGCAAA CTGAAAAACC AGCCGGAACT GAGCGAAGAC
ACCATAAAAA AAGCAGTGGA ATCGCTGTCG CTGGAACTGG TCCTCACGGC TCACCCAACC
GAAATTACCC GTCGTACACT GATCCACAAA ATGGTGGAAG TGAACGCCTG CTTAAAACAG
CTCGATAACA AAGATATCGC TGACTACGAA CACAACCAGC TGATGCGCCG CCTGCGCCAG
TTGATCGCCC AGTCATGGCA TACCGATGAA ATCCGTAAGC TGCGTCCAAG CCCGGTAGAT
GAAGCCAAAT GGGGCTTTGC CGTAGTAGAA AACAGCCTGT GGCAAGGCGT ACCGAATTAC
CTGCGCGAAC TGAACGAACA ACTGGAAGAG AATCTCGGCT ACAAACTGCC CGTCGAATTT
GTTCCGGTCC GTTTTACCTC GTGGATGGGC GGTGACCGCG ACGGCAACCC GAACGTCACT
GCCGATATCA CCCGCCACGT CCTGCTGCTC AGTCGCTGGA AAGCCACCGA TCTGTTCCTG
AAAGATATTC AGGTGCTGGT TTCTGAACTG TCGATGGTTG AAGCGACCCC TGAACTGCTG
GCGCTGGTTG GCGAAGAAGG TGCCGCAGAA CCGTATCGCT ATCTGATGAA AAACCTGCGT
TCTCGCCTGA TGGCGACACA GGCATGGCTG GAAGCGCGCC TGAAAGGCGA AGAACTGCCA
AAACCAGAAG GCCTGCTGAC ACAAAACGAA GAACTGTGGG AACCGCTCTA CGCTTGCTAC
CAGTCACTTC AGGCGTGTGG CATGGGTATT ATCGCCAACG GCGATCTGCT CGACACCCTG
CGCCGCGTGA AATGTTTCGG CGTACCGCTG GTCCGTATTG ATATCCGTCA GGAGAGCACC
CGCCATACCG AAGCGCTGGG CGAGCTGACC CGCTACCTCG GTATCGGCGA CTACGAAAGC
TGGTCAGAGG CCGACAAACA GGCGTTCCTG ATCCGCGAAC TGAACTCCAA ACGTCCGCTT
CTGCCGCGCA ACTGGCAACC AAGCGCCGAA ACGCGCGAAG TGCTCGATAC CTGCCAGGTG
ATTGCCGAAG CACCGCAAGG TTCCATTGCC GCCTACGTGA TCTCGATGGC AAAAACGCCG
TCCGACGTAC TGGCTGTCCA CCTGCTGCTG AAAGAAGCGG GTATCGGGTT TGCAATGCCG
GTTGCTCCGC TGTTTGAAAC CCTCGATGAC CTGAACAACG CCAACGATGT CATGACCCAG
CTGCTCAATA TCGACTGGTA TCGCGGCCTG ATTCAGGGCA AACAGATGGT GATGATTGGC
TATTCCGACT CAGCAAAAGA TGCGGGCGTG ATGGCAGCTT CCTGGGCGCA ATATCAGGCA
CAGGATGCAT TAATCAAAAC CTGCGAAAAA GCGGGTATTG AGCTGACGTT GTTCCACGGT
CGCGGCGGTT CCATTGGTCG TGGCGGCGCA CCTGCTCATG CGGCGCTGCT GTCACAACCG
CCAGGTAGCC TGAAAGGTGG CCTGCGCGTG ACTGAACAGG GCGAGATGAT CCGCTTTAAA
TATGGTCTGC CAGAAATCAC CGTCAGCAGC CTGTCGCTGT ACACCGGGGC GATTCTGGAA
GCCAACCTGC TGCCACCGCC TGAGCCGAAA GAGAGCTGGC GTCGCATTAT GGATGAACTG
TCAGTCATCT CCTGCGATCT CTACCGTGGC TACGTACGTG AAAACAAAGA TTTTGTGCCT
TACTTCCGCT CCGCTACGCC GGAACAAGAA TTGGGCAAAC TGCCGTTGGG TTCACGTCCG
GCGAAACGTC GCCCAACCGG CGGCGTCGAG TCGCTGCGCG CCATTCCGTG GATTTTTGCC
TGGACGCAAA ACCGCCTGAT GCTCCCCGCC TGGCTGGGTG CAGGTACGGC GCTGCAAAAA
GTGGTCGAAG ATGGCAAGCA GAGCGAACTG GAAGCCATGT GCCGCGATTG GCCATTCTTC
TCGACGCGTC TCGGCATGCT GGAGATGGTC TTCGCCAAAG CAGACCTGTG GCTGGCGGAA
TACTATGATC AACGTCTGGT AGACAAAGCA CTGTGGCCGT TAGGTAAAGA GTTACGCAAT
CTGCAAGAAG AAGACATCAA AGTGGTGCTG GCGATTGCCA ACGATTCCCA TCTGATGGCC
GATCTGCCGT GGATTGCAGA GTCTATTCAG CTACGGAATA TTTACACCGA CCCGCTGAAC
GTATTGCAGG CCGAGTTGCT GCACCGCTCC CGCCAGGCAG AAAAAGAAGG CCATGAGCCG
GATCCTCGCG TCGAGCAGGC GTTAATGGTC ACTATTGCCG GGATTGCGGC AGGTATGCGT
AATACCGGCT AA
 
Protein sequence
MNEQYSALRS NVSMLGKVLG ETIKDALGEH ILERVETIRK LSKSSRAGND ANRQELLTTL 
QNLSNDELLP VARAFSQFLN LANTAEQYHS ISPKGEAASN PEVIARTLRK LKNQPELSED
TIKKAVESLS LELVLTAHPT EITRRTLIHK MVEVNACLKQ LDNKDIADYE HNQLMRRLRQ
LIAQSWHTDE IRKLRPSPVD EAKWGFAVVE NSLWQGVPNY LRELNEQLEE NLGYKLPVEF
VPVRFTSWMG GDRDGNPNVT ADITRHVLLL SRWKATDLFL KDIQVLVSEL SMVEATPELL
ALVGEEGAAE PYRYLMKNLR SRLMATQAWL EARLKGEELP KPEGLLTQNE ELWEPLYACY
QSLQACGMGI IANGDLLDTL RRVKCFGVPL VRIDIRQEST RHTEALGELT RYLGIGDYES
WSEADKQAFL IRELNSKRPL LPRNWQPSAE TREVLDTCQV IAEAPQGSIA AYVISMAKTP
SDVLAVHLLL KEAGIGFAMP VAPLFETLDD LNNANDVMTQ LLNIDWYRGL IQGKQMVMIG
YSDSAKDAGV MAASWAQYQA QDALIKTCEK AGIELTLFHG RGGSIGRGGA PAHAALLSQP
PGSLKGGLRV TEQGEMIRFK YGLPEITVSS LSLYTGAILE ANLLPPPEPK ESWRRIMDEL
SVISCDLYRG YVRENKDFVP YFRSATPEQE LGKLPLGSRP AKRRPTGGVE SLRAIPWIFA
WTQNRLMLPA WLGAGTALQK VVEDGKQSEL EAMCRDWPFF STRLGMLEMV FAKADLWLAE
YYDQRLVDKA LWPLGKELRN LQEEDIKVVL AIANDSHLMA DLPWIAESIQ LRNIYTDPLN
VLQAELLHRS RQAEKEGHEP DPRVEQALMV TIAGIAAGMR NTG