Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2028 |
Symbol | |
ID | 6966838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1927387 |
End bp | 1928730 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385944 |
Product | hypothetical protein |
Protein accession | YP_002270433 |
Protein GI | 209396192 |
COG category | [S] Function unknown |
COG ID | [COG5383] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.279125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000000000691698 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAACA GCATCACGGC GGATGAGATT CGGGAACAGT TTTCGCAGGC AATGTCAGCC ATGTACCAGC AAGAAGTTCC GCAATATGGC ACGCTGCTGG AACTGGTAGC TGATGTGAAT CTGGCTGTGC TGGAAAACAA TCCTCAACTG CACGAAAAAA TGGTAAATGC AGACGAGCTG GCGCGACTGA ATGTTGAACG TCATGGGGCG ATTCGCGTTG GGACTGCACA AGAGCTTGCT ACTCTTCGGC GGATGTTTGC CATTATGGGG ATGTACCCGG TGAGCTATTA CGATCTCTCG CAGGCAGGGG TGCCGGTACA TTCGACAGCA TTTCGGCCCA TTGATGATGC TTCTCTGGCG CGTAATCCCT TCCGCGTTTT TACCTCCTTA CTCCGCCTTG AGCTTATCGA GAACGAAATT TTGCGCCAGA AAGCGGCGGA GATTCTGCGT CAGCGCGATA TCTTCACCCC ACGTTGTCGA CAACTGTTAG AGGAGTATGA CCAGCGGGGC GGTTTTAACG AAACACAGGC ACAGGAGTTT GTGCAGGAGG CCCTGGAAAC GTTCCGTTGG CACCAGTCAG CAACGGTAGA TGAAGAAACC TATCGCGCCT TGCACAACGA ACATCGGTTG ATTGCTGATG TGGTCTGTTT TCCTGGATGC CATATCAACC ATCTGACGCC ACGTACGCTG GATATTGACC GGGTGCAGTC GATGATGCCT GAATGCGGAA TTGAACCAAA AATTCTGATC GAAGGGCCGC CGCGCCGCGA GGTACCGATT TTACTACGCC AGACCAGCTT TAAAGCACTG GAAGAGACGG TGTTGTTTGC GGGGCAGAAA CAGGGCACGC ATACTGCGCG CTTTGGTGAA ATTGAGCAGC GTGGCGTGGC ATTAACGCCG AAAGGTCGAC AACTGTATGA TGATCTTCTG CGTAACGCTG GAACCGGGCA GGATAATCTC ACTCACCAAA TGCATTTACA GGAAACCTTC CGCACTTTTC CTGACAGTGA GTTTTTAATG CGTCAGCAAG GGCTGGCATG GTTCCGGTAC CGTCTGACGC CTTCAGGTGA GGCGCATCGT CAGGCGATTC ATCCCGGAGA CGATCCACAG CCCTTAATTG AACGTGGTTG GGTCGCGGCG CAACCCATTA CCTATGAAGA TTTCTTGCCC GTTAGCGCGG CGGGGATCTT CCAGTCAAAT CTGGGTAATG AAACGCAGGC ACGCAGCCAC GGTAATGCCA GTCGCGAAGC ATTTGAGCAG GCGTTGGGTT GTCCGGTTTT GGATGAGTTC CAGCTTTATC AGGAAGCGGA AGAACGCAGT AAACGTCGCT GTGGTTTGCT TTAA
|
Protein sequence | MANSITADEI REQFSQAMSA MYQQEVPQYG TLLELVADVN LAVLENNPQL HEKMVNADEL ARLNVERHGA IRVGTAQELA TLRRMFAIMG MYPVSYYDLS QAGVPVHSTA FRPIDDASLA RNPFRVFTSL LRLELIENEI LRQKAAEILR QRDIFTPRCR QLLEEYDQRG GFNETQAQEF VQEALETFRW HQSATVDEET YRALHNEHRL IADVVCFPGC HINHLTPRTL DIDRVQSMMP ECGIEPKILI EGPPRREVPI LLRQTSFKAL EETVLFAGQK QGTHTARFGE IEQRGVALTP KGRQLYDDLL RNAGTGQDNL THQMHLQETF RTFPDSEFLM RQQGLAWFRY RLTPSGEAHR QAIHPGDDPQ PLIERGWVAA QPITYEDFLP VSAAGIFQSN LGNETQARSH GNASREAFEQ ALGCPVLDEF QLYQEAEERS KRRCGLL
|
| |