Gene EcSMS35_3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3085 
SymbolgalP 
ID6145909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3175182 
End bp3176576 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content53% 
IMG OID641617953 
Productgalactose-proton symporter 
Protein accessionYP_001745104 
Protein GI170681091 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0156709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000187462 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGACG CTAAAAAACA GGGGCGGTCA AACAAGGCAA TGACGTTTTT CGTCTGCTTC 
CTTGCCGCTC TGGCGGGATT ACTCTTTGGC CTGGATATCG GTGTAATTGC TGGCGCACTG
CCGTTTATTG CAGATGAATT CCAGATTACT TCGCACACGC AAGAATGGGT CGTAAGCTCC
ATGATGTTCG GTGCGGCAGT CGGTGCGGTG GGCAGCGGCT GGCTCTCCTT TAAACTCGGG
CGCAAAAAGA GCCTGATGAT CGGCGCAATC CTGTTTGTTG CCGGTTCACT GTTCTCTGCG
GCTGCGCCAA ACGTTGAAGT ACTGATTCTT TCCCGCGTTC TGCTGGGGCT GGCGGTGGGT
GTGGCCTCTT ATACCGCACC GCTTTACCTC TCTGAAATTG CGCCGGAAAA AATTCGCGGC
AGTATGATCT CGATGTATCA GTTGATGATC ACTATCGGGA TCCTCGGTGC TTATCTTTCT
GATACCGCCT TCAGCTACAC CGGTGCATGG CGCTGGATGC TGGGTGTGAT TATCATCCCG
GCAATTTTGC TGCTGATTGG TGTCTTCTTC CTGCCAGACA GCCCTCGTTG GTTTGCCGCC
AAACGCCGTT TTGTTGATGC CGAACGCGTG CTGCTACGCC TGCGTGACAC CAGCGCGGAA
GCGAAACGCG AACTGGATGA AATCCGTGAA AGTTTGCAGG TTAAACAGAG TGGCTGGGCG
CTGTTTAAAG AGAACAGCAA CTTCCGCCGC GCGGTGTTCC TTGGCGTACT GTTGCAGGTA
ATGCAGCAAT TCACCGGGAT GAACGTCATC ATGTATTACG CGCCGAAAAT CTTCGAACTG
GCGGGTTATA CCAACACCAC CGAGCAAATG TGGGGGACAG TGATTGTCGG CCTGACCAAC
GTACTTGCCA CCTTTATCGC AATCGGCCTT GTTGACCGCT GGGGACGCAA ACCAACGCTA
ACGCTGGGCT TCCTGGTGAT GGCTGCTGGC ATGGGCGTAC TCGGTACAAT GATGCATATC
GGTATCCACT CTCCGTCGGC GCAGTATTTC GCTATCGCCA TGCTGCTGAT GTTTATTGTC
GGTTTTGCCA TGAGTGCCGG TCCGCTGATT TGGGTACTGT GTTCCGAAAT TCAGCCGCTG
AAAGGCCGCG ATTTTGGCAT CACCTGCTCC ACCGCCACCA ACTGGATTGC CAACATGATC
GTTGGCGCAA CGTTCCTGAC CATGCTCAAC ACGCTGGGTA ACGCCAACAC CTTCTGGGTG
TATGCGGCTC TGAACGTACT GTTTATCCTG CTGACATTGT GGCTGGTACC GGAAACCAAA
CACGTTTCGC TGGAACATAT TGAACGTAAT CTGATGAAAG GTCGTAAACT GCGCGAAATC
GGCGCTCACG ATTAA
 
Protein sequence
MPDAKKQGRS NKAMTFFVCF LAALAGLLFG LDIGVIAGAL PFIADEFQIT SHTQEWVVSS 
MMFGAAVGAV GSGWLSFKLG RKKSLMIGAI LFVAGSLFSA AAPNVEVLIL SRVLLGLAVG
VASYTAPLYL SEIAPEKIRG SMISMYQLMI TIGILGAYLS DTAFSYTGAW RWMLGVIIIP
AILLLIGVFF LPDSPRWFAA KRRFVDAERV LLRLRDTSAE AKRELDEIRE SLQVKQSGWA
LFKENSNFRR AVFLGVLLQV MQQFTGMNVI MYYAPKIFEL AGYTNTTEQM WGTVIVGLTN
VLATFIAIGL VDRWGRKPTL TLGFLVMAAG MGVLGTMMHI GIHSPSAQYF AIAMLLMFIV
GFAMSAGPLI WVLCSEIQPL KGRDFGITCS TATNWIANMI VGATFLTMLN TLGNANTFWV
YAALNVLFIL LTLWLVPETK HVSLEHIERN LMKGRKLREI GAHD