Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3594 |
Symbol | |
ID | 6968510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3308531 |
End bp | 3309973 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387391 |
Product | sucrose-6-phosphate hydrolase |
Protein accession | YP_002271850 |
Protein GI | 209399695 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | [TIGR01322] sucrose-6-phosphate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.000217681 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAAAAA TGACGCAATC TCGATTGCAT GCGGCGCAAA ACGCACTAGC AAAACTTCAC GAGCGCCGAG GTAACACTTT CTATCCCCAT TTTCACCTCG CGCCTCCTGC CGGGTGGATG AACGATCCAA ACGGCCTGAT CTGGTTTAAC GATCGTTATC ACGCGTTTTA TCAACATCAC CCAATGAGTG AACACTGGGG GCCAATGCAC TGGGGACATG CCACCAGCGA CGATATGATC CACTGGCAGC ATGAGCCTAT TGCGCTAGCG CCAGGAGACG AGAATGACAA AGATGGGTGT TTTTCAGGTA GTGCTGTCGA TGACAATGGT GTCCTCTCAC TTATCTACAC CGGACACGTC TGGCTCGATG GTGCAGGTAA TGACGATGCA ATTCGCGAAG TACAATGTCT GGCTACCAGT CGGGATGGTA TTCATTTCGA GAAACAGGGT GTGATCCTCA CTCCACCAGA AGGAATCATG CACTTCCGCG ATCCTAAAGT GTGGCGTGAA GCCGACACAT GGTGGATGGT AGTCGGGGCG AAAGACCCAG GCAACACTGG GCAGATCCTG CTTTATCGCG GCAGTTCATT ACGTGAATGG ACTTTCGATC GCGTACTGGC CCACGCTGAT GCGGGTGAAA GCTATATGTG GGAATGTCCG GACTTTTTCA GCCTTGGCGA TCAGCATTAT CTGATGTTTT CCCCGCAGGG AATGAATGCC GAGGGATACA GTTACCGAAA TCGCTTTCAA AGTGGCGTAA TACCCGGAAT GTGGTCGCCA GGACGACTTT TTGCACAATC CGGGCATTTT ACTGAACTTG ATAACGGGCA TGACTTTTAT GCACCACAAA GCTTTGTAGC GAAGGATGGT CGGCGTATTG TTATCGGCTG GATGGATATG TGGGAATCGC CAATGCCCTC AAAACGTGAA GGCTGGGCAG GCTGCATGAC GCTGGCGCGC GAGCTATCAG AGAGCAATGG CAAACTTCTA CAACGCCCGG TACACGAAGC TGAGTCGTTA CGCCAGCAGC ATCAATCTAT CTCTCCCCGC ACAATCAGCA ATAAATATGT TTTGCAGGAA AACGCGCAAG CAGTTGAGAT TCAGTTGCAG TGGGCGCTGA AGAACAGTGA TGCCGAACAT TACGGATTAC AGCTCGGCAC TGGAATGCGG CTGTATATTG ATAACCAATC TGAGCGACTT GTTTTGTGGC GGTATTACCC ACACGAGAAT TTAGACGGCT ACCGTAGTAT TCCCCTCCCG CAGCGTGACA CGCTCGCCCT AAGGATATTT ATCGATACAT CATCCGTGGA AGTATTTATT AACGACGGGG AAGCGGTGAT GAGTAGTCGA ATCTATCCGC AGCCAGAAGA ACGGGAACTG TCGCTTTATG CCTCCCACGG AGTGGCTGTG GTGCAACATG GAGCACTCTG GCAACTGGGT TAA
|
Protein sequence | MIKMTQSRLH AAQNALAKLH ERRGNTFYPH FHLAPPAGWM NDPNGLIWFN DRYHAFYQHH PMSEHWGPMH WGHATSDDMI HWQHEPIALA PGDENDKDGC FSGSAVDDNG VLSLIYTGHV WLDGAGNDDA IREVQCLATS RDGIHFEKQG VILTPPEGIM HFRDPKVWRE ADTWWMVVGA KDPGNTGQIL LYRGSSLREW TFDRVLAHAD AGESYMWECP DFFSLGDQHY LMFSPQGMNA EGYSYRNRFQ SGVIPGMWSP GRLFAQSGHF TELDNGHDFY APQSFVAKDG RRIVIGWMDM WESPMPSKRE GWAGCMTLAR ELSESNGKLL QRPVHEAESL RQQHQSISPR TISNKYVLQE NAQAVEIQLQ WALKNSDAEH YGLQLGTGMR LYIDNQSERL VLWRYYPHEN LDGYRSIPLP QRDTLALRIF IDTSSVEVFI NDGEAVMSSR IYPQPEEREL SLYASHGVAV VQHGALWQLG
|
| |