Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3085 |
Symbol | galP |
ID | 6145909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3175182 |
End bp | 3176576 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617953 |
Product | galactose-proton symporter |
Protein accession | YP_001745104 |
Protein GI | 170681091 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0156709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000187462 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGACG CTAAAAAACA GGGGCGGTCA AACAAGGCAA TGACGTTTTT CGTCTGCTTC CTTGCCGCTC TGGCGGGATT ACTCTTTGGC CTGGATATCG GTGTAATTGC TGGCGCACTG CCGTTTATTG CAGATGAATT CCAGATTACT TCGCACACGC AAGAATGGGT CGTAAGCTCC ATGATGTTCG GTGCGGCAGT CGGTGCGGTG GGCAGCGGCT GGCTCTCCTT TAAACTCGGG CGCAAAAAGA GCCTGATGAT CGGCGCAATC CTGTTTGTTG CCGGTTCACT GTTCTCTGCG GCTGCGCCAA ACGTTGAAGT ACTGATTCTT TCCCGCGTTC TGCTGGGGCT GGCGGTGGGT GTGGCCTCTT ATACCGCACC GCTTTACCTC TCTGAAATTG CGCCGGAAAA AATTCGCGGC AGTATGATCT CGATGTATCA GTTGATGATC ACTATCGGGA TCCTCGGTGC TTATCTTTCT GATACCGCCT TCAGCTACAC CGGTGCATGG CGCTGGATGC TGGGTGTGAT TATCATCCCG GCAATTTTGC TGCTGATTGG TGTCTTCTTC CTGCCAGACA GCCCTCGTTG GTTTGCCGCC AAACGCCGTT TTGTTGATGC CGAACGCGTG CTGCTACGCC TGCGTGACAC CAGCGCGGAA GCGAAACGCG AACTGGATGA AATCCGTGAA AGTTTGCAGG TTAAACAGAG TGGCTGGGCG CTGTTTAAAG AGAACAGCAA CTTCCGCCGC GCGGTGTTCC TTGGCGTACT GTTGCAGGTA ATGCAGCAAT TCACCGGGAT GAACGTCATC ATGTATTACG CGCCGAAAAT CTTCGAACTG GCGGGTTATA CCAACACCAC CGAGCAAATG TGGGGGACAG TGATTGTCGG CCTGACCAAC GTACTTGCCA CCTTTATCGC AATCGGCCTT GTTGACCGCT GGGGACGCAA ACCAACGCTA ACGCTGGGCT TCCTGGTGAT GGCTGCTGGC ATGGGCGTAC TCGGTACAAT GATGCATATC GGTATCCACT CTCCGTCGGC GCAGTATTTC GCTATCGCCA TGCTGCTGAT GTTTATTGTC GGTTTTGCCA TGAGTGCCGG TCCGCTGATT TGGGTACTGT GTTCCGAAAT TCAGCCGCTG AAAGGCCGCG ATTTTGGCAT CACCTGCTCC ACCGCCACCA ACTGGATTGC CAACATGATC GTTGGCGCAA CGTTCCTGAC CATGCTCAAC ACGCTGGGTA ACGCCAACAC CTTCTGGGTG TATGCGGCTC TGAACGTACT GTTTATCCTG CTGACATTGT GGCTGGTACC GGAAACCAAA CACGTTTCGC TGGAACATAT TGAACGTAAT CTGATGAAAG GTCGTAAACT GCGCGAAATC GGCGCTCACG ATTAA
|
Protein sequence | MPDAKKQGRS NKAMTFFVCF LAALAGLLFG LDIGVIAGAL PFIADEFQIT SHTQEWVVSS MMFGAAVGAV GSGWLSFKLG RKKSLMIGAI LFVAGSLFSA AAPNVEVLIL SRVLLGLAVG VASYTAPLYL SEIAPEKIRG SMISMYQLMI TIGILGAYLS DTAFSYTGAW RWMLGVIIIP AILLLIGVFF LPDSPRWFAA KRRFVDAERV LLRLRDTSAE AKRELDEIRE SLQVKQSGWA LFKENSNFRR AVFLGVLLQV MQQFTGMNVI MYYAPKIFEL AGYTNTTEQM WGTVIVGLTN VLATFIAIGL VDRWGRKPTL TLGFLVMAAG MGVLGTMMHI GIHSPSAQYF AIAMLLMFIV GFAMSAGPLI WVLCSEIQPL KGRDFGITCS TATNWIANMI VGATFLTMLN TLGNANTFWV YAALNVLFIL LTLWLVPETK HVSLEHIERN LMKGRKLREI GAHD
|
| |