Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3292 |
Symbol | |
ID | 6067028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3605953 |
End bp | 3607122 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602707 |
Product | methylcitrate synthase |
Protein accession | YP_001726241 |
Protein GI | 170021287 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCTGTG GCACTTTCTG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT GACCTGCATT ACCGCGGCTA CGATATTCTT GATCTGGCGA AACATTGCGA ATTTGAAGAA GTGGCGCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCTTACAAA ACGAAACTGA AAGCCCTGCG CGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGTG TTTCCGCGCT CGGCTGCACG CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG GCGTCGCTTA GCTCGATTCT CCTTTATTGG TATCACTACA GCCACAACGG CGAACGCATC CAACCGGAAA CCGATGACGA CTCCATCGGC GGTCACTTCC TGCATCTGCT GCACGGCGAA AAGCCATCGC AAAGCTGGGA AAAGGCGATG CATATCTCGC TGGTGCTGTA CGCCGAACAC GAGTTTAACG CCTCCACCTT TACCAGTCGG GTGATTGCGG GCACCGGCTC TGATATGTAT TCCGCGATTA TTGGCGCGAT TGGCGCACTG CGCGGGCCAA AACACGGCGG GGCGAATGAA GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC AGATATCCGC AAGCGCGTGG AAAACAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA CACCATCGCT GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG TTCCCCAATC TCGACTGGTT CTCTGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC CCGTTTGTCG CGCTGGATAA GCGCCAGTAA
|
Protein sequence | MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAKHCEFEE VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE QRQDNKIIRP SANYVGPEDR PFVALDKRQ
|
| |