Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4108 |
Symbol | |
ID | 5591406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4100094 |
End bp | 4101443 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640923212 |
Product | major facilitator family transporter |
Protein accession | YP_001460671 |
Protein GI | 157163353 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.073539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAA AGTTACCCGC ACCGCGTGAA GGGCTTTCCG GTAAAGCCAT GAGACGTGTC GTTATGGGCA GTTTCGCAGG CGCGTTAATG GAATGGTATG ACTTCTTTAT TTTTGGCACG GCAGCAGGAC TGGTATTTGC ACCGCTGTTT TATCCAGACA GCGATCCTTT TATAGGTCTT ATAGCCGCCT TTGCTACCTT TGGCGTTGGT TTTTTGACCC GTCCTTTAGG TGGTATCGTC TTTGGCCATT TTGGCGATAA AATTGGGCGA AAAATAACAC TTATCTGGAC ACTGGCTATT GTCGGTTGCT CCACATTTTT GATTGGTTTT ATTCCGACGT ATCAGGAAAT TGGGATCTGG GCACCGCTCA TATTAATGGC GCTGCGTTTA ATTCAGGGCT TTGGTCTTGG TGGCGAGTAT GGCGGAGCGG CATTGATGAC CATAGAATCA GCGCCCGAAT CCCGACGCGG ATTTTTAGGT TCACTCCCGC AAACGGCTGC GTCTGTGGGT ATTATGCTGG CAACCGGTAT TTTTGCTCTC TGTAACCATT TTCTTACTTC GGAGCAGTTT CTCTCCTGGG GATGGCGCAT TCCGTTTTGG CTTTCTGCGG TGATGTTGAT CGTTGGGCTG TTTATCCGCC TGCATACCGA AGAAACGCTG GATTTTCAAA AGCAAAAAAC GACGAATAAT AAAGAAAAGT CCGTTCCTCC GTTGATTGAA TTATTCAAAA AACATCCACG AAATATTTTA TTGGCACTGG GTGCGAGGCT GGCGGAAAGT GTCTCCTCTA ATATTATTAA CGCCTTTGGT ATTGTCTATA TTTCCAGCCA ATTAGCATTG TCGCGAGATA TTCCCCTGAC GGGTATGTTG ATTGCCTCGG CGATCGGTAT TTTCAGTTGT CCATTGGTTG GATGGTTATC AGATCGTATA GGTCAAAAAA GTCTGTATTT GTCGGGAGCA GGATTTTGTG TCCTGTTCGC GTTTCCTTTC TTCTTATTAC TGGATAGCAA AAGTACGCTC ATCATCTGGT GCAGTATGAT CCTCGGCTAC AACTTAGGTC CGACGATGAT GTTTGCCGTG CAGCCAACAC TTTTCACCCG CATGTTCGGT ACCAAAGTAC GATATACCGG CCTTTCTTTT GCTTATCAAT TTTCCGCGAT TCTTGGCGGA CTAAGCCCCC TTATCGCCTC AAGTTTGCTG GCGCTAGGGG GAGGAAAACC GTGGTATGTC GCTCTGTTTT TATTTGCTGT CTCAGTTCTT TCCTTTGTTT GCGTCTGGCT GATTGAGCCG ACCGATGAAC AAGAAACCGC TTCTTACCGC TACATCAGGG AACAATCTCA TGAAAACTGA
|
Protein sequence | MSEKLPAPRE GLSGKAMRRV VMGSFAGALM EWYDFFIFGT AAGLVFAPLF YPDSDPFIGL IAAFATFGVG FLTRPLGGIV FGHFGDKIGR KITLIWTLAI VGCSTFLIGF IPTYQEIGIW APLILMALRL IQGFGLGGEY GGAALMTIES APESRRGFLG SLPQTAASVG IMLATGIFAL CNHFLTSEQF LSWGWRIPFW LSAVMLIVGL FIRLHTEETL DFQKQKTTNN KEKSVPPLIE LFKKHPRNIL LALGARLAES VSSNIINAFG IVYISSQLAL SRDIPLTGML IASAIGIFSC PLVGWLSDRI GQKSLYLSGA GFCVLFAFPF FLLLDSKSTL IIWCSMILGY NLGPTMMFAV QPTLFTRMFG TKVRYTGLSF AYQFSAILGG LSPLIASSLL ALGGGKPWYV ALFLFAVSVL SFVCVWLIEP TDEQETASYR YIREQSHEN
|
| |