Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2688 |
Symbol | |
ID | 6066710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2950263 |
End bp | 2951546 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641602094 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001725644 |
Protein GI | 170020690 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00014062 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.815677 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCCC TGACGTTACA ACCCATCGCT CGTGTCGATG GCACTATTAA TCTGCCCGGT TCCAAGAGCG TTTCTAACCG CGCTTTATTG CTGGCGGCAT TAGCACACGG CAAAACAGTA TTAACCAATC TGCTGGATAG CGATGACGTG CGCCATATGC TGAATGCATT AACAGGGTTA GGGGTAAGCT ATACGCTTTC AGCCGATCGT ACGCGTTGCG AAATTATCGG TAACGGCGGT CCATTACACG CAGAAGGTGC CCTGGAGTTG TTCCTCGGTA ACGCCGGAAC GGCAATGCGT CCGCTGGCGG CAGCTCTTTG TCTGGGTAGC AATGATATTG TGCTGACCGG TGAGCCGCGT ATGAAAGAAC GCCCGATTGG TCATCTGGTG GATGCTCTGC GCCTGGGCGG GGCGAAGATC ACTTACCTGG AACAAGAAAA TTATCCGCCG TTGCGTTTAC AGGGCGGCTT TACCGGCGGC AACGTTGACG TTGATGGCTC CGTTTCCAGC CAATTCCTCA CCGCACTGTT AATGACTGCG CCTCTTGCGC CGGAAGATAC GGTGATTCGT ATTAAAGGCG ATCTGGTTTC TAAACCTTAT ATCGACATCA CACTCAATCT GATGAAGACG TTTGGTGTTG AAATTGAAAA TCAGCACTAT CAACAATTTG TCGTAAAAGG CGGGCAGTCT TATCAGTCTC CGGGTACTTA TTTGGTCGAA GGCGATGCAT CTTCGGCTTC TTACTTTCTG GCAGCAGCAG CAATCAAAGG CGGCACTGTA AAAGTGACCG GTATTGGACG TAACAGTATG CAGGGTGATA TTCGCTTTGC TGATGTGCTG GAAAAAATGG GCGCGACCAT TTGCTGGGGC GATGATTATA TTTCCTGCAC GCGTGGTGAA CTGAACGCTA TTGATATGGA TATGAACCAT ATTCCCGATG CGGCGATGAC CATTGCCACG GCGGCGTTAT TTGCAAAAGG CACCACCACG CTGCGCAATA TCTATAACTG GCGTGTTAAA GAAACCGATC GCCTGTTTGC GATGGCAACA GAACTGCGTA AAGTCGGTGC GGAAGTAGAA GAGGGGCACG ATTACATTCG TATCACTCCA CCGGAAAAAC TGAACTTTGC CGAGATCGCG ACATACAATG ATCACCGGAT GGCGATGTGT TTCTCGCTGG TGGCGTTGTC AGATACACCA GTGACGATTC TTGATCCCAA ATGCACGGCC AAAACATTTC CGGATTATTT CGAGCAGCTG GCGCGGATTA GCCAGGCAGC CTGA
|
Protein sequence | MESLTLQPIA RVDGTINLPG SKSVSNRALL LAALAHGKTV LTNLLDSDDV RHMLNALTGL GVSYTLSADR TRCEIIGNGG PLHAEGALEL FLGNAGTAMR PLAAALCLGS NDIVLTGEPR MKERPIGHLV DALRLGGAKI TYLEQENYPP LRLQGGFTGG NVDVDGSVSS QFLTALLMTA PLAPEDTVIR IKGDLVSKPY IDITLNLMKT FGVEIENQHY QQFVVKGGQS YQSPGTYLVE GDASSASYFL AAAAIKGGTV KVTGIGRNSM QGDIRFADVL EKMGATICWG DDYISCTRGE LNAIDMDMNH IPDAAMTIAT AALFAKGTTT LRNIYNWRVK ETDRLFAMAT ELRKVGAEVE EGHDYIRITP PEKLNFAEIA TYNDHRMAMC FSLVALSDTP VTILDPKCTA KTFPDYFEQL ARISQAA
|
| |