Gene ECH74115_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4050 
SymbolgudP 
ID6967524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3747545 
End bp3748897 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID643387810 
Productglucarate permease 
Protein accessionYP_002272253 
Protein GI209400750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTT TAAGTCAGGC TGCGAGCAGT GTGGAAAAAC GCACGAATGC TCGTTACTGG 
ATAGTGGTGA TGTTGTTTAT CGTCACATCC TTCAACTACG GCGACCGCGC TACGCTTTCT
ATTGCCGGTT CGGAAATGGC CAAAGATATC GGCCTTGATC CCGTGGGAAT GGGCTATGTG
TTCTCTGCCT TCTCATGGGC TTATGTTATC GGGCAGATCC CTGGTGGTTG GTTGCTGGAT
CGTTTTGGTT CAAAACGCGT CTACTTCTGG TCGATCTTTA TCTGGTCGAT GTTTACCTTG
CTGCAAGGCT TCGTCGATAT CTTTAGTGGA TTCGGCATTA TCGTTGCCCT GTTTACTCTG
CGCTTCCTGG TCGGGCTTGC TGAAGCGCCA TCTTTCCCCG GCAACAGTCG CATTGTTGCG
GCCTGGTTTC CGGCGCAGGA AAGGGGAACG GCGGTGTCGA TTTTTAACTC CGCTCAATAC
TTCGCAACGG TGATCTTCGC GCCGATTATG GGCTGGCTGA CGCATGAAGT GGGCTGGTCA
CACGTCTTCT TCTTTATGGG CGGTCTGGGC ATTGCCATCA GCTTTATCTG GTTGAAAGTC
ATCCACGAGC CAAATCAACA TCCGGGGGTA AATCAGAAAG AGCTGGAGTA CATCGCCGCG
GGTGGCGCGC TGATCAATAT GGATCAGCAA AACACCAAAG TTAAAGTGCC GTTCAGCGTG
AAGTGGGGGC AGATCAAACA ACTGCTCGGG TCACGGATGA TGATCGGCGT TTATATCGGT
CAGTACTGTA TCAACGCCCT GACTTACTTT TTCATTACCT GGTTTCCGGT TTATCTGGTG
CAGGCGCGTG GGATGTCGAT TCTGAAAGCG GGCTTTGTGG CTTCCGTTCC GGCGGTTTGT
GGTTTTATCG GTGGTGTGCT GGGTGGGATT ATTTCCGACT GGCTGATGCG CCGCACGGGA
TCGCTGAACA TTGCGCGTAA AACACCGATC GTAATGGGCA TGTTGCTGTC GATGGTGATG
GTGTTCTGCA ACTACGTCAA CGTTGAGTGG ATGATCATCG GCTTTATGGC GCTGGCCTTC
TTCGGTAAGG GCATTGGGGC GCTGGGTTGG GCAGTAATGG CAGATACCGC GCCAAAAGAG
ATCAGCGGTC TTTCCGGTGG CCTGTTCAAC ATGTTCGGTA ACATTTCTGG CATCGTCACG
CCAATCGCAA TTGGTTATAT CGTTGGCACG ACTGGCTCGT TTAATGGGGC GCTGATTTAT
GTTGGTGTTC ATGCCTTAAT CGCGGTACTG AGCTACCTGG TGCTGGTGGG CGATATCAAG
CGTATCGAGC TGAAACCTGT CGTGGGGCAA TAA
 
Protein sequence
MSSLSQAASS VEKRTNARYW IVVMLFIVTS FNYGDRATLS IAGSEMAKDI GLDPVGMGYV 
FSAFSWAYVI GQIPGGWLLD RFGSKRVYFW SIFIWSMFTL LQGFVDIFSG FGIIVALFTL
RFLVGLAEAP SFPGNSRIVA AWFPAQERGT AVSIFNSAQY FATVIFAPIM GWLTHEVGWS
HVFFFMGGLG IAISFIWLKV IHEPNQHPGV NQKELEYIAA GGALINMDQQ NTKVKVPFSV
KWGQIKQLLG SRMMIGVYIG QYCINALTYF FITWFPVYLV QARGMSILKA GFVASVPAVC
GFIGGVLGGI ISDWLMRRTG SLNIARKTPI VMGMLLSMVM VFCNYVNVEW MIIGFMALAF
FGKGIGALGW AVMADTAPKE ISGLSGGLFN MFGNISGIVT PIAIGYIVGT TGSFNGALIY
VGVHALIAVL SYLVLVGDIK RIELKPVVGQ