Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1144 |
Symbol | shiA |
ID | 6144011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1162007 |
End bp | 1163323 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616022 |
Product | shikimate transporter |
Protein accession | YP_001743211 |
Protein GI | 170680503 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000733491 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000000475287 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTCCA CGCTCATCTC CACTCGTCCC GATGAAGGGA CGCTTTCGTT AAGTCGCGCC CGACGAGCTG CGTTAGGCAG CTTCGCTGGT GCCGTCGTCG ACTGGTATGA TTTTTTACTC TATGGCATCA CCGCCGCACT GGTGTTTAAT CGCGAGTTTT TCCCGCAAGT AAGCCCGGCG ATGGGAACGC TCGCCGCATT TGCCACCTTT GGCGTCGGAT TCCTTTTCCG TCCGCTCGGC GGTGTCATTT TCGGTCACTT TGGCGACCGA CTGGGACGTA AGCGCATGTT AATGCTGACC GTCTGGATGA TGGGCATCGC GACAGCCTTG ATTGGTATTC TTCCTTCATT CTCGACCATT GGGTGGTGGG CACCTATTTT GCTGGTGACA CTGCGTGCCA TTCAGGGATT TGCCGTCGGC GGCGAATGGG GAGGCGCGGC GTTGCTTTCC GTTGAAAGTG CACCGAAAAA TAAAAAAGCC TTTTACAGTA GCGGTGTACA AGTTGGCTAC GGTGTAGGTT TACTGCTTTC AACCGGACTG GTTTCATTGA TCAGTATGAT GACGACTGAC GAACAGTTTT TAAGCTGGGG CTGGCGCATT CCTTTCCTGT TTAGCATCGT ACTGGTACTG GGAGCATTGT GGGTGCGCAA TGGCATGGAG GAGTCCGCGG AATTTGAACA ACAGCAACAT TATCAAGCTG CCGCGAAAAA ACGCATCCCG GTTATCGAAG CGCTGTTACG ACATCCCGGT GCTTTCCTGA AGATTATTGC GCTACGACTG TGCGAATTGC TGACGATGTA CATCGTTACT GCCTTTGCAC TTAATTATTC AACCCAGAAT ATGGGGTTAC CGCGCGAACT TTTCCTCAAT ATTGGTTTGC TGGTAGGTGG ATTAAGCTGC TTGACAATTC CCTGTTTTGC CTGGCTTGCC GATCGTTTTG GTCGCCGCAG GGTTTATATC ACAGGTGCGT TAATCGGAAC GTTGAGCGCA TTTCCTTTCT TTATGGCGCT TGAAGCACAA TCTATTTTCT GGATAGTTTT CTTCTCCATA ATGCTGGCAA ACATTGCGCA TGACATGGTG GTGTGTGTGC AACAACCGAT GTTTACCGAA ATGTTTGGTG CCAGTTATCG CTATAGTGGT GCTGGAGTCG GTTATCAGGT CGCCAGTGTG GTTGGCGGTG GATTTACACC TTTTATTGCC GCTGCACTCA TCACTTACTT TGCCGGGAAC TGGCATAGCG TCGCCATTTA TTTGCTGGCT GGATGTCTGA TTTCCGCAAT GACCGCTTTG TTGATGAAAG ACAATCAACG CGCTTGA
|
Protein sequence | MDSTLISTRP DEGTLSLSRA RRAALGSFAG AVVDWYDFLL YGITAALVFN REFFPQVSPA MGTLAAFATF GVGFLFRPLG GVIFGHFGDR LGRKRMLMLT VWMMGIATAL IGILPSFSTI GWWAPILLVT LRAIQGFAVG GEWGGAALLS VESAPKNKKA FYSSGVQVGY GVGLLLSTGL VSLISMMTTD EQFLSWGWRI PFLFSIVLVL GALWVRNGME ESAEFEQQQH YQAAAKKRIP VIEALLRHPG AFLKIIALRL CELLTMYIVT AFALNYSTQN MGLPRELFLN IGLLVGGLSC LTIPCFAWLA DRFGRRRVYI TGALIGTLSA FPFFMALEAQ SIFWIVFFSI MLANIAHDMV VCVQQPMFTE MFGASYRYSG AGVGYQVASV VGGGFTPFIA AALITYFAGN WHSVAIYLLA GCLISAMTAL LMKDNQRA
|
| |