Gene ECH74115_4826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4826 
Symbol 
ID6968182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4459820 
End bp4461328 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID643388518 
Productcarbohydrate kinase, FGGY family 
Protein accessionYP_002272946 
Protein GI209398014 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATA ACAGCGCAGC TATCGTTATC GATATTGGCA CCACCAATTG CAAAGTCACC 
TGCTTTTCCT GCCTGGACGC AACGACGTTG GGCGCGCATA AATTCGTGAC GGCAAAGCAG
ATCTCCCCAC AGGGCGATGT CGATTTCGAT ATCGACGCCC TCTGGCAGGA GGTCCGCCAG
GCGATAGCGC AACTGAACGC CGCTTCGCCG CTGCCAGTCA GACGGATCAG CATTGCCAGT
TTTGGCGAAT CAGGCGTGTT CCTTGACGAG CATGGCGAGA TCCTGACGCC AATGCTGGCA
TGGTATGACC GTCGCGGTGA AGAGTATCTG GCAACGCTTA GCGAGGCAGA CAGTGCGGCA
CTTTATGACA TCTGCGGTCT ACCACTACAC AGCAATTACT CTGCCTTCAA AATGCGCTGG
TTGCTGGAAC ATTACCCGCT GCGTAATCGC CGCGGCCTGC GCTGGCTACA TGCGCCGGAA
GTGCTGCTCT GGCGGCTGAC TGGCGAACAG CACACGGATA TCACCTTAGC CAGCCGCACG
CTGTGTCTGG ACGTGCGCAA AGGCGAATGG TCAGCGAAAG CGGCGGCGTT GTTACACGTT
CCCTGTACGG CATTTGCGCC ATTGGTGCAG CCAGGCGAGC ACGCCGGATG GGTCAGCGAG
TCACTTTGCA AGACGCTTGG GTTCTCGCAA CCGGTCAGCG TGACGCTGGC CGGACATGAC
CATATGGTGG GTGCGCGAGC GTTGCAGATG ATGCCAGGCG ATATCCTTAA CTCGACGGGG
ACCACGGAAG GCATTCTGCA ACTGGATACA CAACCGACGC TGGATGAACA GGCCAAACGT
GACAAGCTGG CAAACGGCTG TTACTCACTT GCCAACCAGT TCACCCTGTT TGCGTCGCTG
CCCGTGGGCG GTTTCGCGCT GGAGTGGCTG CGCAACACGT TCCGGCTAAC CGATGAGGAG
ATCGCCGCAT CACTTACTCG CGGACATGCT GATTATCTGG CGGGGAATTG GTTGCTCGAT
GACATTCCCG TCTTTATTCC ACATCTTCGC GGTTCGGGTT CGCCCTATAA AAATCGCCAT
ACCCGTGGAT TATTTTATGG GCTTGGCGAT ACGTTAAGTA TCGACATGTT AATTGCCAGC
GTATCACTGG GATTAACCAT GGAATTTGCC AACTGCTTCG CCTGTTTTAA CGTGCCTGGC
ACCAGCGCGT TAAAAGTGAT CGGTCCGGCA ACCCATAATC CTCTTTGGCT GCAATTAAAG
GCGGATATTT TACAGCGTCC GGTTGAAGCA ATTGCATTTA ACGAGGCGGT TTCTGTCGGA
GCATTATTAA CCGCCGCACC GGATATTCCA CCGCCGCCAG TCGCTATAGC CCAACGTTTG
TTACCGAATC GGGCGAGATA CCATCAATTA CAGCGTTATC AGCACAAATG GAAAAGCTGG
TATCAGTTGA AATTACAACA AGAAGGCGTG ATGCCATTAC ATCATCGAGA GGAACACTAT
GTTGAATAA
 
Protein sequence
MPDNSAAIVI DIGTTNCKVT CFSCLDATTL GAHKFVTAKQ ISPQGDVDFD IDALWQEVRQ 
AIAQLNAASP LPVRRISIAS FGESGVFLDE HGEILTPMLA WYDRRGEEYL ATLSEADSAA
LYDICGLPLH SNYSAFKMRW LLEHYPLRNR RGLRWLHAPE VLLWRLTGEQ HTDITLASRT
LCLDVRKGEW SAKAAALLHV PCTAFAPLVQ PGEHAGWVSE SLCKTLGFSQ PVSVTLAGHD
HMVGARALQM MPGDILNSTG TTEGILQLDT QPTLDEQAKR DKLANGCYSL ANQFTLFASL
PVGGFALEWL RNTFRLTDEE IAASLTRGHA DYLAGNWLLD DIPVFIPHLR GSGSPYKNRH
TRGLFYGLGD TLSIDMLIAS VSLGLTMEFA NCFACFNVPG TSALKVIGPA THNPLWLQLK
ADILQRPVEA IAFNEAVSVG ALLTAAPDIP PPPVAIAQRL LPNRARYHQL QRYQHKWKSW
YQLKLQQEGV MPLHHREEHY VE