Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3302 |
Symbol | |
ID | 6970355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3034645 |
End bp | 3035586 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643387114 |
Product | hypothetical protein |
Protein accession | YP_002271578 |
Protein GI | 209398647 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0135155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0377447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAA AGGATTATGT CGTAATTATA GGCTCGGCGA ATATTGATGT CGCCGGATAT TCACATGAAT CATTAAATTA TGCGGATTCA AATCCAGGTA AAATAAAATT TACGCCTGGT GGGGTAGGGC GCAATATTGC ACAAAACCTG GCGTTGCTGG GTAATAAAGC CTGGCTACTG AGCGCCGTAG GCAGTGATTT TTATGGCCAA TCGCTGCTAA CGCAAACCAA TCAATCTGGC GTTTATGTCG ATAAATGCCT GATTGTGCCG GGAGAAAATA CGTCGAGCTA TTTATCATTA CTCGATAATA CCGGTGAAAT GCTGGTTGCT ATAAATGACA TGAATATTAG CAACGCTATT ACAGCTGAAT ATCTCGCACA GCACCGTGAA TTTATTCAGA GGGCAAAGGT CATTGTCGCG GACTGTAATA TCAGTGAAGA GGCACTGGCA TGGATTCTGG ATAATGCTGC CAACGTACCC GTATTTGTCG ATCCGGTTTC CGCGTGGAAA TGCGTCAAAG TACGCGAGCG ATTAAGTCAA ATCCATACTC TCAAGCCAAA CCGCCTTGAA GCGGAAACCC TGAGTGGGAT TGCGCTGTCA GGGCGTGAAG ATGTGGCAAA AGTTGCTGCC TGGTTCCATC AACATGGCCT GAACCGACTG GTATTGAGCA TGGGCGGCGA CGGCGTTTAT TACAGCGATA TCAGCGGTGA AAGTGGCTGG TCTGCGCCGA TCAAAACCAA TGTTATTAAT GTTACCGGAG CGGGCGATGC CATGATGGCG GGACTTGCTT CGTGTTGGGT AGACGGAATG CCGTTTGCCG AATCTGTTCG TTTCGCACAG GGATGTTCGT CAATGGCGCT CTCCTGTGAA TACACCAATA ACCCCGATTT ATCGATTGCC AACGTTATAT CGTTAGTGGA GAACGCAGAA TGTCTGAATT AA
|
Protein sequence | MREKDYVVII GSANIDVAGY SHESLNYADS NPGKIKFTPG GVGRNIAQNL ALLGNKAWLL SAVGSDFYGQ SLLTQTNQSG VYVDKCLIVP GENTSSYLSL LDNTGEMLVA INDMNISNAI TAEYLAQHRE FIQRAKVIVA DCNISEEALA WILDNAANVP VFVDPVSAWK CVKVRERLSQ IHTLKPNRLE AETLSGIALS GREDVAKVAA WFHQHGLNRL VLSMGGDGVY YSDISGESGW SAPIKTNVIN VTGAGDAMMA GLASCWVDGM PFAESVRFAQ GCSSMALSCE YTNNPDLSIA NVISLVENAE CLN
|
| |