Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1066 |
Symbol | |
ID | 6967958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1087871 |
End bp | 1089631 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385078 |
Product | hypothetical protein |
Protein accession | YP_002269577 |
Protein GI | 209395737 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0179082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAA CATTTATCCC CGGCAAAGAT GCCGCACTGG AAGATTCCAT CGCTCGCTTC CAGCAAAAAC TTTCAGACCT CGGCTTTCAG ATTGAAGAGG CCTCCTGGCT GAATCCCGTG CCTAACGTCT GGTCTGTACA TATTCGCGAC AAAGAGTGCG CACTGTGTTT TACCAACGGT AAAGGCGCAA CCAAGAAAGC GGCGCTGGCT TCTGCACTCG GTGAATATTT CGAGCGTCTC TCAACCAACT ACTTTTTTGC TGATTTCTGG CTGGGCGAAA CCATCGCCAA CGGTCCATTT GTGCATTATC CCAACGAAAA ATGGTTCCCA CTGACCGAAA ATGACGATGT GCCAGAAGGG CTGCTCGATG AACGTCTGCG CGCATTTTAC GATCCGGAGA ATGAATTGAC CGGCAGTATG CTGATTGACC TACAATCCGG TAACGAAGAT CGTGGTATTT GCGGTCTGCC GTTTACACGT CAGTCTGATA ATCAGACCGT TTATATTCCG ATGAATATCA TTGGTAACCT GTACGTTTCT AACGGTATGT CTGCTGGCAA TACCCGTAAC GAAGCACGCG TTCAGGGGTT GTCCGAAGTT TTCGAACGCT ACGTGAAAAA CCGCATTATT GCTGAAAGCA TCAGCTTGCC GGAAATCCCG GCAGACGTGC TGGCGCGTTA CCCAGCGGTG GTTGAAGCGA TCGAAACACT GGAAGCAGAA GGTTTCCCGA TCTTCGCATA TGATGGTTCC CTTGGCGGCC AGTATCCGGT GATTTGCGTG GTACTGTTTA ATCCTGCTAA CGGCACCTGC TTTGCCTCTT TCGGTGCGCA TCCTGATTTT GGCGTAGCAC TGGAACGTAC CGTGACCGAG CTGCTGCAAG GTCGTGGCCT GAAAGATTTG GATGTGTTTA CTCCGCCAAC CTTCGATGAT GAAGAAGTCG CTGAACATAC CAACCTCGAA ACGCACTTTA TCGATTCCAG CGGTTTAATC TCCTGGGACC TGTTCAAGCA GGATGCCGAT TATCCGTTTG TGGACTGGAA TTTCTCCGGC ACCACGGAAG AAGAGTTTGC TACGCTGATG GCTATCTTCA ACAAAGAAGA TAAAGAAGTT TATATTGCCG ATTACGAGCA TCTGGGCGTT TATGCTTGCC GTATTATCGT GCCTGGCATG TCCGATATTT ATCCGGCTGA AGATCTGTGG CTCGCGAATA ACAGTATGGG CAGCCATTTA CGTGAAACGA TTCTTTCGCT ACCAGGCAGC GAGTGGGAAA AAGAAGATTA CCTGAACCTC ATCGAGCAAC TGGATGAAGA AGGTTTTGAT GACTTTACCC GCGTGCGTGA GCTGTTGGGT CTGGCGACCG GGTCGGATAA CGGTTGGTAC ACCCTGCGTA TTGGTGAATT AAAAGCCATG CTGGCGCTGG CTGGTGGCGA TCTGGAACAG GCTCTGGTCT GGACCGAATG GACGATGGAG TTTAACTCAT CGGTATTCAG CCCGGAACGC GCCAACTATT ATCGCTGCCT GCAAACGTTG TTATTACTGG CGCAGGAAGA AGATCGCCAG CCGCTGCAAT ATCTGAATGC GTTTGTTCGC ATGTACGGCG CAGATGCCGT GGAAGCCGCC AGTGCGGCAA TGAGCGGCGA AGCGGCGTTT TATGGCTTGC AACCAGTAGA TAGCGATCTG CACGCGTTTG CTGCACATCA GTCGCTGTTG AAGGCCTACG AAAAGCTGCA GCGCGCCAAA GCAGCATTCT GGGCAAAATA A
|
Protein sequence | MTQTFIPGKD AALEDSIARF QQKLSDLGFQ IEEASWLNPV PNVWSVHIRD KECALCFTNG KGATKKAALA SALGEYFERL STNYFFADFW LGETIANGPF VHYPNEKWFP LTENDDVPEG LLDERLRAFY DPENELTGSM LIDLQSGNED RGICGLPFTR QSDNQTVYIP MNIIGNLYVS NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP ADVLARYPAV VEAIETLEAE GFPIFAYDGS LGGQYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFVDWNFSG TTEEEFATLM AIFNKEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNSMGSHL RETILSLPGS EWEKEDYLNL IEQLDEEGFD DFTRVRELLG LATGSDNGWY TLRIGELKAM LALAGGDLEQ ALVWTEWTME FNSSVFSPER ANYYRCLQTL LLLAQEEDRQ PLQYLNAFVR MYGADAVEAA SAAMSGEAAF YGLQPVDSDL HAFAAHQSLL KAYEKLQRAK AAFWAK
|
| |