Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5349 |
Symbol | |
ID | 6968569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4991604 |
End bp | 4992674 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643389005 |
Product | putative fructose-specific phosphotransferase system protein FrvX |
Protein accession | YP_002273414 |
Protein GI | 209396256 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTG AGTTACTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TTAATGAGAT CACCTTTGAT GGTCTGGGCA GCTTTGTTGC CCGTAAGGGA AATAAAGGTC CAAAAGTTGC CGTTGTCGGG CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACG AGAGCGGTTT TCTGCGCTTT ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC AATACGCACA CACAAGGGAG TGAAAATCCC TGGTGTGATT GGTTCCGTAG CGCCTCATGC GTTAACGGAA AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT CGCGAAGAAG CGAAAAAACG CGGTGCGGAA ATTGGCGATT TTATTAGCCC GGAAGCCAAT TTTGCCTGCT GGGGCGAAGA TAAAATAGTC GGCAAGGCGC TGGATAATCG CATCGGCTGC GCGATGATGG CTGAGCTACT ACAGACAGTA AATAACCCAG GAATTACGCT GTACGGCGTC GGCAGCGTGG AAGAAGAAGT TGGGCTACGC GGGGCACAAA CCTCGGCTGA ACACACTAAA CCGGATGTGG TGATCGTGCT GGATACCGCC GTCGCGGGCG ATGTTCCGGG CATTGATAAC ATTAAATACC CGCTGAAACT GGGCAACGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC TTACCCAACC AGAAACTGGT GGCGGCGTTA AAAAACTGTG CCGCACATAA CGGTTTACCG CTGCAATTTT CCACCATGAA AACCGGAGCG ACGGATGGCG GGCGCTACAA CGTAATGGGC GGAGGGCGTC CGGTTGTCGC GCTGTGTCTG CCAACTCGTT ATCTGCACGC TAACAGCGGT ATGATTTCAA AAGCCGATTA TGATGCTCTG CTCACGCTGA TACGGGATTT TCTGACGACC TTAACTGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
|
Protein sequence | MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG HMDEVGFMVT HIDESGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE KQKQQPLSFD EMFIDIGANS REEAKKRGAE IGDFISPEAN FACWGEDKIV GKALDNRIGC AMMAELLQTV NNPGITLYGV GSVEEEVGLR GAQTSAEHTK PDVVIVLDTA VAGDVPGIDN IKYPLKLGNG PGLMLFDKRY LPNQKLVAAL KNCAAHNGLP LQFSTMKTGA TDGGRYNVMG GGRPVVALCL PTRYLHANSG MISKADYDAL LTLIRDFLTT LTAEKVNAFS QFRQVD
|
| |