Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2823 |
Symbol | shiA |
ID | 6970221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2630742 |
End bp | 2632058 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643386673 |
Product | shikimate transporter |
Protein accession | YP_002271147 |
Protein GI | 209399883 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.347269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.535086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCCA CGCTCATCTC CACTCGTCCC GATGAAAGGA CGGTTTCGTT AAGTCGCGCC CGACGAGCTG CGTTAGGCAG TTTCGCTGGT GCCGTCGTCG ACTGGTATGA TTTTTTACTC TATGGCATTA CCGCCGCACT GGTGTTTAAT CGCGAGTTTT TCCCGCAAGT AAGCCCGGCT ATGGGAACGC TCGCCGCATT TGCCACCTTT GGCGTCGGAT TCCTTTTCCG TCCGCTCGGC GGTGTCATTT TCGGTCACTT TGGCGACCGA CTGGGACGTA AGCGCATGTT AATGCTGACC GTCTGGATGA TGGGCATCGC GACAGCCTTG ATTGGTATTC TTCCTTCATT CTCGACCATT GGGTGGTGGG CACCTATTTT ACTGGTGACG CTGCGTGCCA TTCAGGGATT TGCCGTCGGC GGCGAATGGG GAGGCGCGGC GTTGCTTTCC GTTGAAAGTG CACCGAAAAA TAAAAAAGCC TTTTACAGTA GCGGTGTACA AGTTGGCTAC GGTGTAGGTT TACTGCTTTC AACCGGACTG GTTTCATTGA TCAGTATGAT GACGACTGAC GAACAGTTTT TAAGCTGGGG CTGGCGCATT CCTTTCCTGT TTAGCATCGT ACTGGTACTG GGAGCATTGT GGGTGCGCAA TGGCATGGAA GAGTCCGCGG AATTTGAACA ACAGCAATAT AATCAAGCGG CCGCGAAAAA ACGCATCCCG GTTATCGAAG CGCTGTTACG ACATCCCGGT GCTTTCCTGA AGATTATTGC GCTACGACTG TGCGAGTTGC TGACAATGTA CATCGTTACC GCCTTTGCAC TTAATTATTC AACTCAGAAT ATGGGGTTAC CGCGCGAACT TTTCCTTAAT ATTGGTTTGC TGGTAGGTGG ATTAAGCTGC CTGACAATTC CCTGTTTTGC CTGGCTTGCC GATCGTTTTG GTCGCCGCAG GGTTTATATC ACAGGCGCGT TGATCGGAAC GTTGAGCGCA TTTCCTTTCT TTATGGCGCT TGAAGCACAA TCTATTTTCT GGATAGTTTT CTTCTCCATA ATGCTGGCAA ACATTGCGCA TGACATGGTG GTGTGTGTGC AACAACCGAT GTTTACCGAA ATGTTTGGTG CCAGTTATCG CTATAGTGGT GCTGGAGTCG GTTATCAGGT TGCCAGTGTG GTTGGCGGTG GATTTACACC TTTTATTGCC GCTGCGCTCA TCACTTACTT TGCCGGGAAC TGGCATAGCG TCGCCATTTA TTTGCTGGCT GGATGTCTGA TTTCCGCAAT GACCGCTTTG TTGATGAAAG ACAATCAACG CGCTTGA
|
Protein sequence | MDSTLISTRP DERTVSLSRA RRAALGSFAG AVVDWYDFLL YGITAALVFN REFFPQVSPA MGTLAAFATF GVGFLFRPLG GVIFGHFGDR LGRKRMLMLT VWMMGIATAL IGILPSFSTI GWWAPILLVT LRAIQGFAVG GEWGGAALLS VESAPKNKKA FYSSGVQVGY GVGLLLSTGL VSLISMMTTD EQFLSWGWRI PFLFSIVLVL GALWVRNGME ESAEFEQQQY NQAAAKKRIP VIEALLRHPG AFLKIIALRL CELLTMYIVT AFALNYSTQN MGLPRELFLN IGLLVGGLSC LTIPCFAWLA DRFGRRRVYI TGALIGTLSA FPFFMALEAQ SIFWIVFFSI MLANIAHDMV VCVQQPMFTE MFGASYRYSG AGVGYQVASV VGGGFTPFIA AALITYFAGN WHSVAIYLLA GCLISAMTAL LMKDNQRA
|
| |