Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0404 |
Symbol | prpC |
ID | 6971924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 410307 |
End bp | 411476 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384456 |
Product | methylcitrate synthase |
Protein accession | YP_002268970 |
Protein GI | 209395959 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCTGTG GCGCTTTCTG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT GACTTGCATT ACCGTGGCTA CGATATTCTT GATCTGGCGG AACATTGCGA ATTTGAAGAA GTGGCGCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCCTACAAA ACGAAACTGA AAGCCCTGCG CGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGCG TTTCCGCGCT CGGCTGCACG CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG GCGTCACTTA GCTCGATTCT TCTCTACTGG TATCACTACA GCCACAACGG CGAACGCATC CAGCCGGAAA CCGATGACGA CTCCATCGGT GGTCACTTCC TGCATCTGCT GCACGGCGAA AAGCCGTCGC AAAGCTGGGA AAAGGCGATG CATATTTCGC TGGTGCTGTA CGCCGAACAC GAGTTTAACG CCTCCACCTT TACCAGCCGG GTGATTGCGG GCACTGGCTC TGATATGTAT TCCGCGATTA TTGGCGCGAT TGGTGCACTG CGCGGGCCGA AGCACGGCGG GGCGAATGAA GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC CGATATCCGC AAGCGGGTGG AAAGCAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA TACCATCGCC GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG TTCCCCAATC TCGACTGGTT CTCCGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC CAGTTTGTCG CGCTGGATAA GCGCCAGTAA
|
Protein sequence | MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE VSLEIQQRYE TPDEAEADIR KRVESKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE QRQDNKIIRP SANYVGPEDR QFVALDKRQ
|
| |