Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4951 |
Symbol | |
ID | 6968228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4592055 |
End bp | 4593455 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388634 |
Product | sugar transporter, glycoside-pentoside-hexuronide family |
Protein accession | YP_002273061 |
Protein GI | 209400085 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.841107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCTA CACCGATTAC TACCGCTGAT ATCGCTAAAG GTAAAATTGA CGATGCGTTA TCTGTACGGG AAAAAATAGG CTACGGCCTG GGTGACGCAG GCGGCACCGT AATAACTTGC CTGATCATGA ATTTTCTCAC CTTTTTCTAC ACCGACGTTT TTGGATTAAC TCCGGCGCTG GTTGGCACGC TGTTTATTGC ACTGCGCGTG TTTGATGCCA TCTCCGACCC GGTGATGGGC GTCATTGCCG ACCGGACGCA AAGCCGATGG GGGCGCTTTC GTCCGTGGCA GCTATGGATT GCCATTCCCA TCGGCATTAT CGGCATCCTG ACGTTCACCG TGCCAGATGC CAGCATGGGA GTAAAAATCG CCTGGGCGTT CGGTACTTAC CTGCTCCTTT CAGTCGGTTA TACCGCCATC AACGTACCGT ACTGCGCGCG GATCAACACC ATGACCACCC GCCACAATGA AGTGATCTCC TGCCAGTCCT GGCGATTCGT TCTCTGCGGC GTAGCGGGAT TTCTGGTTTC GGTAGGCTTA CCGTGGATGG TAGCTCTCTT CGGTCAGGGC AACGCTGCAC GCGGCTATCA ACTGGGCGTC GGGGTATTGT GCGCCATTGC CGTGGTGATG TTCCTGTGCT GTTTCTTCTG GGTTCGTGAA CGGGTGCCGC TCTCCACAAT GGGGAAATTT ACCCTGCGCG AACATCTTGC CGGGCTGCGG AACAACGACC AACTGCTGCT GATGCTGGTC ATGTCTTTCC TGCTGATTAA CGTCTTTAAC ATTCGCGGCG GTGGGTATAT GTACTTCATT ACCTACGTCT TACAAGGCAG CACGGGCTAC ACGTCGCTGT TCTTCACCAT GGTCACCTTC GCCTCCATTA TCGGCTCGGT GATTGTCAGC CCGTTAACGC GGCGTTTCGA TACCGTCAAA ATTTATTACT ACACCAACCT GCTCCTCGCT GCACTGGCGG TGTTGATGTG GTTCCTGCCC TCCGGCCCGG CTTATCAAAC GCTGTGGCTG GCGGTGATCC TCGGTAATGG CGTGATTCTT GGCTTCACAT TGCCACTGCA CTTCTCATTG ATGGCCTTTG CCGATGACTA CGGCGAGTGG AAAACCCACG TACGTTCTTC CGGCATGAAC TTCGCCTTCA ATCTGTTTTT CATCAAGCTG GCCTGGGCCT CCAGCGCCGG GATCATCAGC CTGCTGTTTA TTTTTGTCGC CTACCAGCCT GGCGTGGAAA ACCAGACCGC CAGTTCGCTT GGCGGGATCG CGGCGATGGA AACATTACTG CCTGCGCTAT TCCACCTGCT GCTGGCAGGG GCGATCCGCT TTTGCAAACT CAATAATCCT ATGATGTCAC GCATAGCTAC CGACCTGCGT CAGCGTCATG TACAGCCTTA A
|
Protein sequence | MTSTPITTAD IAKGKIDDAL SVREKIGYGL GDAGGTVITC LIMNFLTFFY TDVFGLTPAL VGTLFIALRV FDAISDPVMG VIADRTQSRW GRFRPWQLWI AIPIGIIGIL TFTVPDASMG VKIAWAFGTY LLLSVGYTAI NVPYCARINT MTTRHNEVIS CQSWRFVLCG VAGFLVSVGL PWMVALFGQG NAARGYQLGV GVLCAIAVVM FLCCFFWVRE RVPLSTMGKF TLREHLAGLR NNDQLLLMLV MSFLLINVFN IRGGGYMYFI TYVLQGSTGY TSLFFTMVTF ASIIGSVIVS PLTRRFDTVK IYYYTNLLLA ALAVLMWFLP SGPAYQTLWL AVILGNGVIL GFTLPLHFSL MAFADDYGEW KTHVRSSGMN FAFNLFFIKL AWASSAGIIS LLFIFVAYQP GVENQTASSL GGIAAMETLL PALFHLLLAG AIRFCKLNNP MMSRIATDLR QRHVQP
|
| |