Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1077 |
Symbol | |
ID | 6971131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1102387 |
End bp | 1103619 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385089 |
Product | hypothetical protein |
Protein accession | YP_002269588 |
Protein GI | 209398698 |
COG category | [S] Function unknown |
COG ID | [COG3214] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00717119 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGC CGCACCTCTC CCTTGCTGAT GCGCGTAATC TTCACCTTGC CGCGCAAGGC CTGTTAAACA AACCCCGCCG TCGAGCGTCG TTGGAGGATA TTCCGGCAAC GATCTCCCGC ATGTCCTTGC TGCAAATCGA TACCATCAAT ATTGTTGCCC GTAGCCCATA TCTGGTGCTT TTCAGTCGTC TGGGAAATTA TCCTGCCCAG TGGCTGGATG AGTCTCTGGC GCGTGGCGAA TTAATGGAAT ACTGGGCGCA TGAAGCCTGC TTTATGCCAC GCAGTGACTT TCGTCTTATT CGCCACCGCA TGCTGGCACC TGAAAAAATG GGCTGGAAAT ACAAAGACGC CTGGATGCAG GAACATGAGG CGGAAATTGC ACAGTTAATT CAGCATATTC ATGATAAGGG GCCGGTACGT TCAGCCGATT TTGAGTATCC TCGTAAAGGT GCAAGCGGCT GGTGGGAATG GAAGCCGCAT AAACGGCATC TGGAAGGTTT ATTTACTGCC GGAAAGGTGA TGGTGATTGA ACGGCGCAAC TTCCAGCGCG TTTATGATTT AACCCACCGT GTCATGCCTG ACTGGGATGA TGAGCGCGAT CTCGTTTCGC AAACAGAAGC AGAAATCATC ATGCTGGATA ACAGTGCGCG TAGCCAGGGA ATATTCCGCG AACAGTGGCT GGCAGATTAC TATCGGCTGA AACGTCCGGC ACTGGCAGCG TGGCGCGAAG CGAGGGCTGA ACAGCGGCAA ATCATTGCTG TGCATGTTGA AAAATTGGGC AATCTTTGGC TGCATGCTGA TTTGCTGCCG CTACTCGAGC GAGCGCTGGC CGGAAAGCTC ACTGCAACGC ACAGCGCGGT ACTTTCGCCT TTTGATCCTG TTGTCTGGGA TCGCAAACGC GCAGAGCAGC TTTTTGATTT TAGCTACCGG CTGGAGTGCT ATACCCCAGC GCCGAAACGC CAGTATGGCT ATTTTGTTCT GCCGTTATTA CATCGTGGGC AATTAGTTGG GCGAATGGAT GCCAAAATGC ATCGCCAGAC AGGCATCCTT GAAGTTATCT CTCTGTGGTT ACAGGAAGGT ATTAAACCAA CGACAACGCT GCAAAAAGGG TTACGTCAGG CGATTACTGA TTTCGCTAAC TGGCAGCAGG CAACGCGGGT GACATTAGGA CACTGCCCGC AAGGTCTCTT TACGGATTGC CGCACCGGCT GGGAAATAGA CCCCGTCGCA TAA
|
Protein sequence | MSLPHLSLAD ARNLHLAAQG LLNKPRRRAS LEDIPATISR MSLLQIDTIN IVARSPYLVL FSRLGNYPAQ WLDESLARGE LMEYWAHEAC FMPRSDFRLI RHRMLAPEKM GWKYKDAWMQ EHEAEIAQLI QHIHDKGPVR SADFEYPRKG ASGWWEWKPH KRHLEGLFTA GKVMVIERRN FQRVYDLTHR VMPDWDDERD LVSQTEAEII MLDNSARSQG IFREQWLADY YRLKRPALAA WREARAEQRQ IIAVHVEKLG NLWLHADLLP LLERALAGKL TATHSAVLSP FDPVVWDRKR AEQLFDFSYR LECYTPAPKR QYGYFVLPLL HRGQLVGRMD AKMHRQTGIL EVISLWLQEG IKPTTTLQKG LRQAITDFAN WQQATRVTLG HCPQGLFTDC RTGWEIDPVA
|
| |