Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3296 |
Symbol | |
ID | 6971839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3028041 |
End bp | 3029129 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387108 |
Product | hypothetical protein |
Protein accession | YP_002271572 |
Protein GI | 209400797 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.170772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.250376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACC GGGAAAAGGA GATCCTTGCA ATTTTACGGC GTAACCCGCT GATTCAGCAG AACGAAATTG CGGACATGCT GCAAATCAGC CGTTCGCGCG TTGCGGCGCA TATTATGGAT TTAATGCGCA AAGGCCGGAT TAAAGGCAAA GGTTACATTC TCACCGAGCA GGAATACTGC GTAGTGGTGG GGACAATCAA TATGGATATT CGCGGGATGG CGGATATCCG TTACCCGCAA TCGGCTTCTC ATCCCGGTAC AATTCATTGC TCAGCGGGCG GCGTGGGACG CAACATCGCC CACAATCTGG CGCTGTTAGG CCGTGACGTC CATTTGCTTT CAGTGATTGG CGATGACTTT TATGGCGAAA TGCTCCTGGA AGAAACGCGC CGCGCCGGCG TGAATGTCTC CGGCTGCGTT CGTTTACATG GTCAAAGCAC ATCGACGTAT CTGGCAATTG CCAATCGAGA CGATGAAACC GTGCTGGCGA TTAACGATAC CCATCTGCTG GAACAGTTAT CGCCGCAATT ATTGAACGGG TCGCGCGATT TACTTCGTCA TGCGGGCGTG GTACTGGCTG ATTGCAACCT GACAGCCGAG GCGCTGGAAT GGGTCTTTAC CCTCGCTGGT GAGATCCCGG TGTTTGTCGA TACCGTTTCA GAATTCAAAG CGGGCAAAAT CAAACACTGG CTGGCGCATA TTCACACCCT GAAACCCACT TTACCGGAGC TGGAAATTTT ATGGGGACAG GCGATCACCA GCGATGCTGA CCGTAATGCT GCAGTGAATG CGTTGCATCA GCAAGGTGTT CAGCAACTGT TTGTTTATTT GCCCGATGAG TCAGTTTATT GCAGCGAAAA GGATGGAGAA CAATTTTTGC TGACCGCACC AGCGCATACG ACAGTAGACA GTTTTGGTGC TGACGATGGT TTTATGGCGG GCCTGGTATA TAGCTTTCTG GAAGGAAACA ATTTCCGTGA CAGCGCCCGT TTTGCGATGG CCTGCGCGGC AATTTCACGC GCCAGCGGCA GCTTAAACAA CCCTACCCTG TCTGCCGATA ACGCGCTTTC ATTAGTGCCA ATGGTGTAA
|
Protein sequence | MNNREKEILA ILRRNPLIQQ NEIADMLQIS RSRVAAHIMD LMRKGRIKGK GYILTEQEYC VVVGTINMDI RGMADIRYPQ SASHPGTIHC SAGGVGRNIA HNLALLGRDV HLLSVIGDDF YGEMLLEETR RAGVNVSGCV RLHGQSTSTY LAIANRDDET VLAINDTHLL EQLSPQLLNG SRDLLRHAGV VLADCNLTAE ALEWVFTLAG EIPVFVDTVS EFKAGKIKHW LAHIHTLKPT LPELEILWGQ AITSDADRNA AVNALHQQGV QQLFVYLPDE SVYCSEKDGE QFLLTAPAHT TVDSFGADDG FMAGLVYSFL EGNNFRDSAR FAMACAAISR ASGSLNNPTL SADNALSLVP MV
|
| |