Gene ECH74115_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5349 
Symbol 
ID6968569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4991604 
End bp4992674 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID643389005 
Productputative fructose-specific phosphotransferase system protein FrvX 
Protein accessionYP_002273414 
Protein GI209396256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG AGTTACTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG 
GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TTAATGAGAT CACCTTTGAT
GGTCTGGGCA GCTTTGTTGC CCGTAAGGGA AATAAAGGTC CAAAAGTTGC CGTTGTCGGG
CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACG AGAGCGGTTT TCTGCGCTTT
ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC AATACGCACA
CACAAGGGAG TGAAAATCCC TGGTGTGATT GGTTCCGTAG CGCCTCATGC GTTAACGGAA
AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT
CGCGAAGAAG CGAAAAAACG CGGTGCGGAA ATTGGCGATT TTATTAGCCC GGAAGCCAAT
TTTGCCTGCT GGGGCGAAGA TAAAATAGTC GGCAAGGCGC TGGATAATCG CATCGGCTGC
GCGATGATGG CTGAGCTACT ACAGACAGTA AATAACCCAG GAATTACGCT GTACGGCGTC
GGCAGCGTGG AAGAAGAAGT TGGGCTACGC GGGGCACAAA CCTCGGCTGA ACACACTAAA
CCGGATGTGG TGATCGTGCT GGATACCGCC GTCGCGGGCG ATGTTCCGGG CATTGATAAC
ATTAAATACC CGCTGAAACT GGGCAACGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC
TTACCCAACC AGAAACTGGT GGCGGCGTTA AAAAACTGTG CCGCACATAA CGGTTTACCG
CTGCAATTTT CCACCATGAA AACCGGAGCG ACGGATGGCG GGCGCTACAA CGTAATGGGC
GGAGGGCGTC CGGTTGTCGC GCTGTGTCTG CCAACTCGTT ATCTGCACGC TAACAGCGGT
ATGATTTCAA AAGCCGATTA TGATGCTCTG CTCACGCTGA TACGGGATTT TCTGACGACC
TTAACTGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
 
Protein sequence
MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG 
HMDEVGFMVT HIDESGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE
KQKQQPLSFD EMFIDIGANS REEAKKRGAE IGDFISPEAN FACWGEDKIV GKALDNRIGC
AMMAELLQTV NNPGITLYGV GSVEEEVGLR GAQTSAEHTK PDVVIVLDTA VAGDVPGIDN
IKYPLKLGNG PGLMLFDKRY LPNQKLVAAL KNCAAHNGLP LQFSTMKTGA TDGGRYNVMG
GGRPVVALCL PTRYLHANSG MISKADYDAL LTLIRDFLTT LTAEKVNAFS QFRQVD