Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3908 |
Symbol | |
ID | 6145753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3977925 |
End bp | 3979325 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618734 |
Product | sugar glycoside-pentoside-hexuronide family protein |
Protein accession | YP_001745873 |
Protein GI | 170680906 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.845629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCTA CACCGATTAC TACCGCTGAT ATCGCTAAAG GTAAAATTGA CGATGCGTTA TCTGTACGGG AAAAAATAGG CTACGGCCTG GGTGACGCAG GCGGCACCGT AATAACTTGC CTGATCATGA ACTTTCTCAC CTTTTTCTAC ACCGACGTTT TTGGCTTAAC TCCGGCGCTG GTCGGCACGC TGTTTATTGC ACTGCGCGTG TTTGATGCCA TCTCCGACCC GGTGATGGGC GTCATTGCCG ACCGGACGCA AAGTCGCTGG GGGCGCTTTC GTCCGTGGCA GTTGTGGATT GCCATTCCCA TCGGCATTAT CGGCATCCTG ACGTTCACCG TGCCAGATGC CAGCATGGGA GTAAAAATCG CCTGGGCGTT CGGCACTTAC CTGCTCCTTT CAGTCGGTTA TACCGCCATC AACGTGCCGT ACTGCGCGCT GATCAACACC ATGACCACCC GCCACAATGA AGTGATCTCC TGCCAGTCCT GGCGATTCGT TCTCTGCGGC GTGGCGGGAT TTCTGGTTTC GGTAGGCTTA CCGTGGATGG TAGATCTCTT CGGTCAGGGT AATGCCGCGC GCGGCTATCA ACTGGGCGTC GGCGTGCTGT GCGCCATCGC TGTGGTGATG TTCCTGTGCT GTTTCTTCTG GGTTCGTGAA CGGGTGCCAC TCTCCACAAT GGGGAAATTT ACCCTGCGCG AACATCTTGC CGGGCTGCGG AACAACGACC AACTGCTGCT GATGCTGGTC ATGTCTTTCC TGCTGATTAA CGTCTTTAAT ATTCGCGGCG GTGGGTATAT GTACTTCATT ACCTACGTCT TACAAGGCAG TACGGGCTAC ACGTCGCTGT TCTTCACTAT GGTCACCTTC GCCTCCATTA TCGGCTCGGT GATTGTCAGC CCGTTAACGC GGCGTTTCGA TACCGTCAAA ATTTATTACT ACACCAACCT GCTCCTCGCT GCGCTGGCGG TGTTGATGTG GTTCCTGCCC TCCAGCCCGG CTTATCAAAC GCTGTGGCTG GCGGTGATCC TCGGTAATGG CGTGATTCTT GGCTTCACAT TGCCATTGCA CTTCTCATTG ATGGCCTTTG CCGATGACTA CGGCGAGTGG AAAACCCGCG TACGTTCTTC CGGCATGAAC TTCGCCTTCA ATCTGTTTTT CATCAAGCTG GCCTGGGCCT CCAGCGCCGG GATCATCAGC CTGCTGTTTA TTTTTGTCGC CTACCAGCCA GGCGTGGAAA ACCAGACCGC CAGTTCGCTT GGCGGGATCA CAGCAATGGA AACATTGCTG CCTGCGCTGT TCCACCTGCT GCTGGCGATG GCGATCCGCT TTTGCAAACT CAATAACCCT ATGATGTCAC GCATTGCTAG CGACCTGCGT CAGCGTCATG TACAGCCTTA A
|
Protein sequence | MTSTPITTAD IAKGKIDDAL SVREKIGYGL GDAGGTVITC LIMNFLTFFY TDVFGLTPAL VGTLFIALRV FDAISDPVMG VIADRTQSRW GRFRPWQLWI AIPIGIIGIL TFTVPDASMG VKIAWAFGTY LLLSVGYTAI NVPYCALINT MTTRHNEVIS CQSWRFVLCG VAGFLVSVGL PWMVDLFGQG NAARGYQLGV GVLCAIAVVM FLCCFFWVRE RVPLSTMGKF TLREHLAGLR NNDQLLLMLV MSFLLINVFN IRGGGYMYFI TYVLQGSTGY TSLFFTMVTF ASIIGSVIVS PLTRRFDTVK IYYYTNLLLA ALAVLMWFLP SSPAYQTLWL AVILGNGVIL GFTLPLHFSL MAFADDYGEW KTRVRSSGMN FAFNLFFIKL AWASSAGIIS LLFIFVAYQP GVENQTASSL GGITAMETLL PALFHLLLAM AIRFCKLNNP MMSRIASDLR QRHVQP
|
| |