Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4246 |
Symbol | galP |
ID | 6971256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3934180 |
End bp | 3935574 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387984 |
Product | galactose-proton symporter |
Protein accession | YP_002272423 |
Protein GI | 209400814 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.487781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.587718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGACG CTAAAAAACA GGGGCAGTCA AACAAGGCAA TGACGTTTTT CGTCTGCTTC CTTGCCGCTC TGGCGGGATT ACTCTTTGGC CTGGATATCG GTGTAATTGC TGGCGCACTG CCGTTTATTG CAGATGAATT CCAGATTACT TCGCACACGC AAGAATGGGT CGTAAGCTCC ATGATGTTCG GTGCGGCAGT CGGTGCGGTG GGCAGCGGCT GGCTCTCCTT TAAACTCGGG CGCAAAAAGA GCCTGATGAT CGGCGCAATT TTGTTTGTTG CCGGTTCGCT GTTCTCTGCG GCTGCGCCAA ACGTTGAAGT ACTGATTCTT TCCCGCGTTC TGCTGGGGCT GGCGGTAGGT GTGGCCTCTT ATACCGCACC ACTGTACCTC TCTGAAATTG CGCCGGAAAA AATTCGCGGC AGTATGATCT CGATGTATCA GTTAATGATC ACAATCGGGA TCCTCGGTGC TTATCTTTCT GATACCGCCT TCAGCTACAC CGGTGCATGG CGCTGGATGC TGGGTGTGAT TATCATCCCG GCAATTTTGC TGCTGATTGG TGTCTTCTTC CTGCCAGACA GCCCACGTTG GTTTGCCGCC AAACGCCGTT TTGTTGATGC CGAACGCGTG CTGCTACGCC TGCGTGACAC CAGCGCGGAA GCGAAACGCG AACTGGATGA AATCCGTGAA AGTTTGCAGG TTAAACAGAG TGGCTGGGCG CTGTTTAAAG AGAATAGCAA CTTCCGCCGC GCGGTGTTCC TTGGCGTACT GTTACAGGTA ATGCAGCAAT TCACCGGGAT GAACGTCATC ATGTATTACG CGCCGAAAAT CTTCGAACTG GCGGGTTATA CCAACACCAC CGAGCAAATG TGGGGGACAG TGATTGTCGG CCTGACCAAC GTACTTGCCA CCTTTATCGC AATCGGCCTT GTTGACCGCT GGGGACGTAA ACCAACGCTA ACGCTGGGCT TCCTGGTGAT GGCTGCTGGT ATGGGCGTAC TCGGTACAAT GATGCATATC GGTATCCACT CTCCGTCGGC GCAGTATTTC GCCATCGCCA TGCTGCTGAT GTTTATTGTC GGTTTTGCCA TGAGTGCCGG TCCGCTGATT TGGGTACTGT GCTCCGAAAT TCAGCCGCTG AAAGGCCGCG ATTTTGGCAT CACCTGCTCC ACCGCCACCA ACTGGATTGC CAACATGATC GTTGGCGCAA CGTTCCTGAC CATGCTCAAC ACGCTGGGCA ACGCTAATAC CTTCTGGGTT TACGCGGGTC TGAACGTCCT GTTTATCCTG CTGACACTGT GGCTGGTGCC AGAAACCAAA CACGTTTCGC TGGAACATAT TGAACGTAAT CTGATGAAAG GTCGTAAACT GCGCGAAATC GGCGCTCACG ATTAA
|
Protein sequence | MPDAKKQGQS NKAMTFFVCF LAALAGLLFG LDIGVIAGAL PFIADEFQIT SHTQEWVVSS MMFGAAVGAV GSGWLSFKLG RKKSLMIGAI LFVAGSLFSA AAPNVEVLIL SRVLLGLAVG VASYTAPLYL SEIAPEKIRG SMISMYQLMI TIGILGAYLS DTAFSYTGAW RWMLGVIIIP AILLLIGVFF LPDSPRWFAA KRRFVDAERV LLRLRDTSAE AKRELDEIRE SLQVKQSGWA LFKENSNFRR AVFLGVLLQV MQQFTGMNVI MYYAPKIFEL AGYTNTTEQM WGTVIVGLTN VLATFIAIGL VDRWGRKPTL TLGFLVMAAG MGVLGTMMHI GIHSPSAQYF AIAMLLMFIV GFAMSAGPLI WVLCSEIQPL KGRDFGITCS TATNWIANMI VGATFLTMLN TLGNANTFWV YAGLNVLFIL LTLWLVPETK HVSLEHIERN LMKGRKLREI GAHD
|
| |