Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4262 |
Symbol | |
ID | 6146474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4360441 |
End bp | 4361826 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619083 |
Product | sugar transporter family protein |
Protein accession | YP_001746207 |
Protein GI | 170680543 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCACA TCACAACGGA AGATCCAGCA ACTTTACGCC TGCCCTTTAA AGAGAAACTC TCTTACGGTA TTGGCGACCT GGCCTCTAAC ATCCTGCTGG ATATTGGTAC GCTTTATCTT TTGAAGTTTT ATACCGACGT TCTGGGGCTG CCTGGCACCT ATGGTGGCAT TATCTTTTTG ATTTCAAAAT TCTTTACTGC GTTTACCGAT ATGGGAACAG GCATCATGCT GGATTCCCGA CGCAAGATCG GCCCAAAAGG CAAGTTCCGT CCTTTTATTC TGTATGCGTC ATTCCCGGTC ACCCTGCTGG CGATCGCCAA CTTTATCGGT ACGCCGTTCG ATGTCACCGG TAAAACGGTG ATGGCCACCA TCCTGTTTAT GCTTTACGGA CTGTTTTTCA GCATGATGAA CTGCTCGTAT GGCGCGATGG TCCCCGCTAT TACTAAAAAC CCCAACGAAC GTGCATCGCT GGCAGCATGG CGTCAGGGTG GCGCTACATT AGGCCTGCTG CTGTGTACGG TGGGATTCGT GCCGGTTATG AATCTTATCG AAGGTAATCA GCAACTTGGC TATATCTTCG CCGCCACGCT GTTTTCACTG TTCGGCCTGC TGTTTATGTG GATCTGCTAC TCGGGCGTGA AAGAGCGTTA TGTCGAAACC CAACCTACCA ATCCGGCGCA AAAGCCTGGC TTGTTGCAGT CTTTCCGCGC GATTGCTGGT AACCGCCCAC TGTTCATTCT GTGTATTGCC AACCTCTGCA CTTTAGGGGC GTTTAACGTC AAGCTCGCCA TTCAGGTCTA TTACACCCAG TACGTGCTCA ACGATCCCAT CCTGTTGTCA TATATGGGAT TTTTCAGCAT GGGCTGTATT TTCATCGGCG TGTTCCTGAT GCCCGGCGCA GTCAGACGTT TTGGTAAGAA GAAGGTCTAT ATCGGCGGCC TGCTGATTTG GGTGCTTGGC GATCTGCTCA ACTATTTCTT TGGCGGCGGT TCGGTCAGCT TCGTGGCGTT CTCCTGCCTG GCATTCTTCG GCTCAGCGTT TGTTAACAGC CTGAACTGGG CGCTGGTTTC CGACACCGTC GAGTACGGCG AGTGGCGCAC CGGTGTGCGT TCGGAAGGTA CGGTCTACAC CGGTTTCACC TTCTTTCGCA AAGTATCTCA GGCGCTGGCA GGTTTCTTCC CAGGCTGGAT GCTGACGCAA ATCGGCTATG TGCCAAACGT CGCCCAGGCT GACCACACTA TCGAAGGGTT GCGCCAACTG ATCTTCATCT ACCCAAGCGC ACTGGCAGTA GTCACCATCG TGGCAATGGG TTGCTTCTAC AGTCTGAACG AGAAGATGTA CGTCCGCATT GTTGAAGAGA TAGAAGCCCG TAAACGCACG GCGTAA
|
Protein sequence | MSHITTEDPA TLRLPFKEKL SYGIGDLASN ILLDIGTLYL LKFYTDVLGL PGTYGGIIFL ISKFFTAFTD MGTGIMLDSR RKIGPKGKFR PFILYASFPV TLLAIANFIG TPFDVTGKTV MATILFMLYG LFFSMMNCSY GAMVPAITKN PNERASLAAW RQGGATLGLL LCTVGFVPVM NLIEGNQQLG YIFAATLFSL FGLLFMWICY SGVKERYVET QPTNPAQKPG LLQSFRAIAG NRPLFILCIA NLCTLGAFNV KLAIQVYYTQ YVLNDPILLS YMGFFSMGCI FIGVFLMPGA VRRFGKKKVY IGGLLIWVLG DLLNYFFGGG SVSFVAFSCL AFFGSAFVNS LNWALVSDTV EYGEWRTGVR SEGTVYTGFT FFRKVSQALA GFFPGWMLTQ IGYVPNVAQA DHTIEGLRQL IFIYPSALAV VTIVAMGCFY SLNEKMYVRI VEEIEARKRT A
|
| |