Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1504 |
Symbol | |
ID | 6144662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1488208 |
End bp | 1489074 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616382 |
Product | quinate/shikimate dehydrogenase |
Protein accession | YP_001743562 |
Protein GI | 170679975 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0010931 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGTTA CCGCAAAATA CGAATTGATT GGGTTGATGG CCTATCCCAT CCGCCACAGT TTATCGCCCG AAATGCAGAA TAAAGCCTTA GAAAAAGCGG GATTGCCATT TACCTATATG GCCTTTGAAG TGGATAACGA TAGTTTTCCC GCAGCAATTG AAGGATTAAA AGCCCTCAAA ATGCGTGGAA CTGGTGTGTC GATGCCGAAC AAACAACTGG CGTGTGAATA TGTTGATGAA TTAACGCCAG CGGCCAAACT GGTGGGTGCC ATCAACACCA TCGTTAATGA TGATGGCTAT CTGCGTGGCT ATAACACCGA CGGCACGGGT CATATTCGCG CCATTAAAGA GAGCGGTTTT GATATCAAAG GCAAAACGAT GGTGCTATTA GGGGCCGGTG GTGCCTCAAC GGCAATTGGC GCGCAGGGGG CAATTGAAGG TTTAAAAGAA ATTAAACTCT TTAACCGTCG GGATGAGTTC TTCGATAAAG CCCTCGCCTT CGCGCAGCGG GTTAACGAAA ACACCGATTG TGTCGTCACG GTTACCGATC TCGCCGATCA GCAAGCCTTT GCTGAAGCCC TGGCTTCCGC CGACATTTTA ACCAATGGCA CAAAAGTGGG TATGAAACCC CTTGAGAATG AATCATTGGT TCATGATATC AGTCTGTTAC ATCCGGGACT TCTGGTCACT GAATGCGTGT ATAACCCGCA TATGACGAAG TTATTGCAGC AGGCGCAACA AGCGGGTTGC AAAACGATTG ATGGATACGG CATGTTGTTG TGGCAAGGGG CTGAACAGTT CACGTTATGG ACTGGCAAAG ATTTCCCTCT GGAATATGTT AAACAGGTCA TGGGGTTCGG TGCCTGA
|
Protein sequence | MNVTAKYELI GLMAYPIRHS LSPEMQNKAL EKAGLPFTYM AFEVDNDSFP AAIEGLKALK MRGTGVSMPN KQLACEYVDE LTPAAKLVGA INTIVNDDGY LRGYNTDGTG HIRAIKESGF DIKGKTMVLL GAGGASTAIG AQGAIEGLKE IKLFNRRDEF FDKALAFAQR VNENTDCVVT VTDLADQQAF AEALASADIL TNGTKVGMKP LENESLVHDI SLLHPGLLVT ECVYNPHMTK LLQQAQQAGC KTIDGYGMLL WQGAEQFTLW TGKDFPLEYV KQVMGFGA
|
| |