Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0398 |
Symbol | prpC |
ID | 5591599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 415608 |
End bp | 416777 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640919583 |
Product | methylcitrate synthase |
Protein accession | YP_001457168 |
Protein GI | 157159850 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCGGTG GCACTTTCCG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT GACCTGCATT ACCGTGGCTA CGATATTCTT GATCTGGCGG AACATTGCGA ATTTGAAGAA GTGGCTCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCTTACAAA ACGAAACTGA AAGCCCTGCG TGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGTG TTTCCGCGCT CGGCTGCACG CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG GCGTCGCTTA GCTCGATTCT CCTTTATTGG TATCACTACA GCCACAACGG CGAACGCATC CAACCGGAAA CCGATGACGA CTCCATCGGC GGTCACTTCC TGCATCTGCT GCACGGCGAA AAGCCATCGC AAAGCTGGGA AAAGGCGATG CATATCTCGC TGGTGCTGTA CGCCGAACAC GAGTTTAACG CCTCCACCTT TACCAGTCGG GTGATTGCGG GCACCGGCTC TGATATGTAT TCCGCGATTA TTGGCGCGAT TGGCGCACTG CGCGGGCCAA AACACGGCGG GGCGAATGAA GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC AGATATCCGC AAGCGCGTGG AAAACAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA CACCATCGCT GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG TTCCCCAATC TCGACTGGTT CTCTGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC CCGTTTGTCG CGCTGGATAA GCGCCAGTAA
|
Protein sequence | MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE QRQDNKIIRP SANYVGPEDR PFVALDKRQ
|
| |