Gene EcSMS35_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0047 
Symbol 
ID6145129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp50989 
End bp52320 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID641614948 
Productmajor facilitator family transporter 
Protein accessionYP_001742164 
Protein GI170681587 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.454742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGT CCAGAAACTT TGACGATCTT AAATTCTCCT CTATTCACCG CCGCATTTTG 
CTGTGGGGAA GCGGTGGTCC GTTTCTGGAT GGTTATGTAC TGGTAATGAT TGGCGTGGCG
CTGGAGCAAC TGACTCCGGC GCTGAAACTG GACGCTGACT GGATTGGCTT GCTGGGCGCG
GGAACGCTCG CCGGGCTGTT CGTTGGCACA TCGCTGTTTG GCTATATCTC CGATAAAGTC
GGACGGCGCA AAATGTTCCT CATTGATATC ATCGCCATCG GCGTGATATC TGTGGCGACG
ATGTTTGTCT CATCCCCCGT CGAACTGTTG GTGATGCGGG TACTTATCGG CATTGTCATC
GGTGCGGATT ATCCCATCGC CACCTCAATG ATCACCGAGT TCTCCAGTAC CCGTCAGCGG
GCGTTTTCCA TCAGCTTTAT TGCCGCGATG TGGTATGTCG GCGCGACCTG CGCCGATCTG
GTCGGTTACT GGCTTTATGA TGTGGAAGGT GGTTGGCGCT GGATGCTGGG TAGCGCAGCG
ATCCCCTGTT TGTTGATTTT GATTGGTCGA TTCGAACTGC CTGAATCTCC CCGCTGGTTG
TTACGCAAAG GGCGAGTAAA AGAGTGCGAA GAGATGATGA TAAAACTGTT TGGCGAACCG
GTGGCTTTCG ATGAAGAGCA GCCGCAGCAA ACCCGTTTTC GCGATCTGTT TAATCGCCGC
CATTTTCCTT TTGTTCTGTT TGTTGCCGCC ATCTGGACCT GCCAGGTGAT CCCCATGTTC
GCCATTTACA CCTTTGGCCC GCAAATCGTT GGTTTGTTGG GATTGGGAGT TGGCAAAAAC
GCGGCGTTGG GGAATGTGGT GATTAGCCTG TTCTTTATGC TTGGCTGTAT TCCGCCGATG
CTGTGGTTAA ACACCGCCGG ACGGCGTCCA TTGTTGATTG GCAGTTTTGC CATGATGACG
CTGGCGCTGG CGGTTTTGGG GCTGATCCCG GATATGGGGA TCTGGCTGGT AGTGATGGCA
TTTGCGGTGT ATGCCTTTTT CTCTGGCGGG CCGGGTAATT TGCAGTGGCT CTATCCTAAT
GAACTCTTCC CGACGGATAT CCGCGCCTCT GCCGTGGGCG TGATTATGTC CTTAAGCCGT
ATTGGCACCA TTGTTTCGAC CTGGGCACTG CCAATCTTTA TCAATAATTA CGGCATCAGT
AGCACCATGC TGATGGGGGC GGGTATCTCG CTATTTGGCT TGTTGATTTC CGTAGCGTTT
GCTCCGGAGA CTCGAGGGAT GTCACTGGCG CAGACCAGCA ATATGACGAT CCGCGGGCAG
AGAATGGGGT AA
 
Protein sequence
MQPSRNFDDL KFSSIHRRIL LWGSGGPFLD GYVLVMIGVA LEQLTPALKL DADWIGLLGA 
GTLAGLFVGT SLFGYISDKV GRRKMFLIDI IAIGVISVAT MFVSSPVELL VMRVLIGIVI
GADYPIATSM ITEFSSTRQR AFSISFIAAM WYVGATCADL VGYWLYDVEG GWRWMLGSAA
IPCLLILIGR FELPESPRWL LRKGRVKECE EMMIKLFGEP VAFDEEQPQQ TRFRDLFNRR
HFPFVLFVAA IWTCQVIPMF AIYTFGPQIV GLLGLGVGKN AALGNVVISL FFMLGCIPPM
LWLNTAGRRP LLIGSFAMMT LALAVLGLIP DMGIWLVVMA FAVYAFFSGG PGNLQWLYPN
ELFPTDIRAS AVGVIMSLSR IGTIVSTWAL PIFINNYGIS STMLMGAGIS LFGLLISVAF
APETRGMSLA QTSNMTIRGQ RMG