Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0141 |
Symbol | |
ID | 6968457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 151321 |
End bp | 152217 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643384218 |
Product | hypothetical protein |
Protein accession | YP_002268741 |
Protein GI | 209398235 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCAC CGAGTACCAC ACCGCATGAC GCGGTATTTA AACAATTTTT AATGCATGCG GAGACGGCTC GCGACTTTCT GGAGATACAT TTGCCAGTGG AATTACGCGA ACTTTGTGAC CTCAACACGC TTCATTTAGA GTCGGGGAGT TTCATTGAAG AGAGCCTTAA AGGACACAGC ACGGACGTGC TCTATTCCGT GCAAATGCAG GGCAATCCCG GTTATCTGCA TGTTGTGATT GAACACCAAA GCAAGCCGGA TAAGAAAATG GCCTTTCGCA TGATGCGTTA TTCTATAGCC GCCATGCATC GGCATCTGGA GGCGGATCAC GACAAACTAC CGCTGGTAGT GCCGATTCTC TTTTATCAGG GCGAGGCCAC ACCTTATCCG CTCTCAATGT GCTGGTTTGA TATGTTTTAC TCGCCGGAGC TGGCGCGACG CGTCTATAAC AGTCCTTTCC CGCTGGTGGA TATCACCATC ACACCGGATG ACGAAATCAT GCAACATCGG CGGATTGCGA TTCTCGAACT ACTGCAAAAA CATATTCGCC AGCGCGACTT AATGTTATTG CTGGAGCAAC TGGTCACGCT GATTGACGAA GGGTACACTA GCGAAAGTCA GTTAGTTGCC ATGCAAAACT ATATGCTGCA ACGCGGTCAT ACTGAACAAG CGGATTTGTT TTATGGTGTG CTGAGAGACA GGGAAACGGG AGGGAAGTCT ATGATGACGC TGGCACAGTG GTTTGAAGAG AAAGGGATTG AGAAGGGGAT TCAGCAGGGA AGACAGGAGG AGAGGCAAGA ATTCGCCCAG CGTTTTCTGA GTAAAGGGAT GTCTCGGGAA GACGTTGCAG AGATGACAAA TTTATCTCTT GCTGAGATTG ATAGGCTGAT TAACTAA
|
Protein sequence | MDAPSTTPHD AVFKQFLMHA ETARDFLEIH LPVELRELCD LNTLHLESGS FIEESLKGHS TDVLYSVQMQ GNPGYLHVVI EHQSKPDKKM AFRMMRYSIA AMHRHLEADH DKLPLVVPIL FYQGEATPYP LSMCWFDMFY SPELARRVYN SPFPLVDITI TPDDEIMQHR RIAILELLQK HIRQRDLMLL LEQLVTLIDE GYTSESQLVA MQNYMLQRGH TEQADLFYGV LRDRETGGKS MMTLAQWFEE KGIEKGIQQG RQEERQEFAQ RFLSKGMSRE DVAEMTNLSL AEIDRLIN
|
| |