Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0363 |
Symbol | prpB |
ID | 6143411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 373606 |
End bp | 374496 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615259 |
Product | 2-methylisocitrate lyase |
Protein accession | YP_001742466 |
Protein GI | 170679897 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2513] PEP phosphonomutase and related enzymes |
TIGRFAM ID | [TIGR02317] methylisocitrate lyase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTAC ACTCTCCAGG TAAAGCGTTT CGCGCTGCAC TTAGCAAAGA AACCCCGTTG CAAATTGTTG GCACCATCAA CGCTAACCAT GCGCTGCTGG CGCAGCGTGC CGGATATCAG GCGATTTATC TCTCCGGCGG TGGCGTGGCG GCAGGATCGC TGGGGCTGCC CGATCTCGGT ATTTCTACTC TTGATGATGT GCTGACTGAC ATTCGCCGTA TTACCGACGT TTGTTCGCTG CCGCTGCTGG TGGATGCGGA TATCGGTTTT GGTTCTTCGG CCTTTAACGT GGCGCGCACC GTGAAATCGA TGATTAAAGC CGGTGCGGCA GGATTGCATA TTGAAGATCA GGTTGGTGCG AAACGCTGCG GTCATCGTCC GAACAAAGCG ATCGTCTCGA AAGAGGAGAT GGTGGATCGG ATCCGCGCGG CGGTGGATGC GAAAACCGAT CCTGATTTTG TGATCATGGC GCGCACCGAT GCGCTGGCGG TAGAGGGGCT GGACGCCGCG ATCGAGCGCG CGCAGGCCTA TGTTGAAGCG GGTGCCGAGA TGTTGTTCCC GGAGGCGATT ACCGAACTCG CCATGTACCG CCAGTTTGCT GATGCGGTGC AGGTGCCAAT CCTCGCCAAC ATTACCGAAT TTGGCGCAAC ACCGCTGTTT ACCACCGACG AATTACGCAG CGCCCATGTC GCAATGGCGC TGTACCCACT TTCAGCGTTC CGTGCCATGA ACCGCGCCGC CGAACATGTC TACAACGTCC TGCGTCAGGA AGGCACGCAG AAAAGCGTCA TCGACACCAT GCAGACCCGC AACGAGCTGT ACGAAAGCAT CAACTACTAC CAGTACGAAG AGAAGCTCGA CGACCTGTTT GCCCGTAACC AGGCGAAATA A
|
Protein sequence | MSLHSPGKAF RAALSKETPL QIVGTINANH ALLAQRAGYQ AIYLSGGGVA AGSLGLPDLG ISTLDDVLTD IRRITDVCSL PLLVDADIGF GSSAFNVART VKSMIKAGAA GLHIEDQVGA KRCGHRPNKA IVSKEEMVDR IRAAVDAKTD PDFVIMARTD ALAVEGLDAA IERAQAYVEA GAEMLFPEAI TELAMYRQFA DAVQVPILAN ITEFGATPLF TTDELRSAHV AMALYPLSAF RAMNRAAEHV YNVLRQEGTQ KSVIDTMQTR NELYESINYY QYEEKLDDLF ARNQAK
|
| |