Gene ECH74115_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0049 
Symbol 
ID6968532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp50217 
End bp51548 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID643384130 
Productmajor facilitator family transporter 
Protein accessionYP_002268653 
Protein GI209398741 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGT CCAGAAACTT TGACGATCTT AAATTTTCCT CTATTCACCG CCGCATTTTG 
CTGTGGGGAA GCGGTGGTCC GTTTCTGGAT GGTTATATAC TGGTAATGAT TGGCGTGGCG
CTGGAGCAAC TGACGCCGGC GCTAAAACTG GACGCTGACT GGATTGGCTT GCTGGGCGCG
GGAACGCTCG CCGGGCTGTT CGTTGGCACA TCGCTGTTTG GTTATATTTC CGATAAAGTC
GGACGGCGCA AAATGTTCCT CATTGATATC ATCGCCATCG GCGTGATATC GGTGGCGACG
ATGTTTGTTT CATCCCCCGT CGAACTGTTG GTGATGCGGG TACTTATCGG CATTGTCATC
GGTGCAGACT ATCCCATCGC CACTTCGATG ATCACTGAGT TCTCCAGTAC CCGTCAGCGG
GCGTTTTCCA TCAGCTTTAT CGCCGCCATG TGGTATGTCG GCGCGACCTG TGCCGATCTG
GTCGGCTACT GGCTTTATGA TGTGGAAGGC GGCTGGCGCT GGATGCTGGG TAGCGCGGCG
ATCCCCTGTT TGTTGATTTT GATTGGTCGA TTCGAACTGC CTGAATCTCC CCGCTGGTTA
TTACGCAAAG GGCGAGTAAA AGAGTGCGAA GAGATGATGA TCAAACTGTT TGGCGAACCG
GTGGCTTTCG ATGAAGAGCA GCCGCAGCAA ACCCGTTTTC GCGATCTGTT TAATCGCCGC
CATTTTCCTT TTGTTCTGTT TGTTGCCGCC ATCTGGACCT GCCAGGTGAT CCCAATGTTC
GCCATTTACA CCTTTGGCCC GCAAATCGTT GGTTTGTTGG GATTGGGTGT TGGCAAAAAC
GCGGCACTGG GGAACGTGGT GATTAGCCTG TTCTTTATGC TCGGCTGTAT TCCGCCGATG
CTGTGGCTAA ACACTGCCGG ACGGCGTCCA TTGTTGATTG GCAGCTTTGC CATGATGACG
CTGGCGCTGG CGGTTTTGGG GCTGATCCCG GATATGGGGA TCTGGCTGGT AGTGATGGCC
TTTGCGGTGT ATGCCTTTTT CTCTGGCGGG CCGGGTAATT TGCAGTGGCT CTATCCTAAT
GAACTCTTCC CGACGGATAT CCGCGCCTCT GCCGTGGGCG TGATTATGTC CTTAAGTCGT
ATTGGCACCA TTGTTTCGAC CTGGGCACTA CCGATCTTTA TCAATAATTA CGGTATCAGT
AACACGATGC TAATGGGGGC GGGTATCTCG CTGTTTGGCT TGTTGATTTC CGTAGCGTTT
GCCCCGGAGA CTCGAGGGAT GTCACTGGCG CAGACCAGCA ATATGACGAT CCGCGGGCAG
AGAATGGGGT AA
 
Protein sequence
MQPSRNFDDL KFSSIHRRIL LWGSGGPFLD GYILVMIGVA LEQLTPALKL DADWIGLLGA 
GTLAGLFVGT SLFGYISDKV GRRKMFLIDI IAIGVISVAT MFVSSPVELL VMRVLIGIVI
GADYPIATSM ITEFSSTRQR AFSISFIAAM WYVGATCADL VGYWLYDVEG GWRWMLGSAA
IPCLLILIGR FELPESPRWL LRKGRVKECE EMMIKLFGEP VAFDEEQPQQ TRFRDLFNRR
HFPFVLFVAA IWTCQVIPMF AIYTFGPQIV GLLGLGVGKN AALGNVVISL FFMLGCIPPM
LWLNTAGRRP LLIGSFAMMT LALAVLGLIP DMGIWLVVMA FAVYAFFSGG PGNLQWLYPN
ELFPTDIRAS AVGVIMSLSR IGTIVSTWAL PIFINNYGIS NTMLMGAGIS LFGLLISVAF
APETRGMSLA QTSNMTIRGQ RMG