Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2808 |
Symbol | |
ID | 6967688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2611967 |
End bp | 2613004 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386659 |
Product | hypothetical protein |
Protein accession | YP_002271135 |
Protein GI | 209397526 |
COG category | [R] General function prediction only |
COG ID | [COG5529] Pyocin large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAATG CCTGGCTCAG ATTGTGGCAT GACATGCCAA ATGATCCCAA ATGGCGAACC ATTGCCAGGG TCTCAGGACA GCCAATCGCA ACAGTGATGG CGGTGTATAT CCACCTTCTG GTGAGTGCGT CACGAAATGT CACGACATGT CACGGCGTGT CACTACGTGG TCACATTGAT GTCACGACGG AAGATTTAGC AAGTGCGCTT GATGTGACGG AAGACGTGAT TGATTCAATT TTGCATGCAA TGCAGGGGCG AGTTCTGGAT GGTGATCTTA TTTCCGGATG GGAAAAACGC CAGGTGATGA AGGAGGATAA CGGTAATGTT TCGCAAACCG CAAAATCCCC GGCAGAGCGC AAGAGAGCGC AGCGGGAGCG GGAAAGACTG CGGAAGCAGA ACACTGATTG TCACGATGAG TCACGACGCG TCACGCATAT GTCACGACAA ATCACGACAG ATACAGATAC AGATAAAGAA TTAAACCCCA CACATAACGC GCGCATGCGC GAGAGTGCTA CGACCGGTGA GTTGAATGAC GCGTCGTTGC AGACAGCCGA ACCTGAATAC CTGGACGGCC TGAGCGAACC GATCGGGAAA TTTCCGATGA CTGGAGTCTG GCAACCGTCG CCGGATTTTC GACAGCGGGC AGCAGTGTGG GGTATGGCTC TGCCTGAGCC TGAGTTTACA CCTGCTGAGC TTGCCGCATT CCGGGATTAC TGGATGGCGG AGGGGAAGGT TTTCACGCAG GTTCAGTGGG AGCAGAAATT TGCCCGTCAC GTGCAGCACG TCAGGGCACA GGTAAAACCA GTCAGCAAGG GGGTAAGCCA TGCAGCACCA GGTGGCACCG CATCACGGGC AGTTCAGGAA ATTCGGGCAG CACGTGAACA GTGGGAACGT GAAAACGGAT TTATCAGCAA CGGAAACGGC CTGGAAGCTG TGGGAACTTA TGGGGGAGGT GTATTCGAAC CGCTGGACCC AGAAGAACGG GGCTGCGCCG TCGAAGCTCT GGATTGCTCA GATTGGCGCG ATGACTGA
|
Protein sequence | MANAWLRLWH DMPNDPKWRT IARVSGQPIA TVMAVYIHLL VSASRNVTTC HGVSLRGHID VTTEDLASAL DVTEDVIDSI LHAMQGRVLD GDLISGWEKR QVMKEDNGNV SQTAKSPAER KRAQRERERL RKQNTDCHDE SRRVTHMSRQ ITTDTDTDKE LNPTHNARMR ESATTGELND ASLQTAEPEY LDGLSEPIGK FPMTGVWQPS PDFRQRAAVW GMALPEPEFT PAELAAFRDY WMAEGKVFTQ VQWEQKFARH VQHVRAQVKP VSKGVSHAAP GGTASRAVQE IRAAREQWER ENGFISNGNG LEAVGTYGGG VFEPLDPEER GCAVEALDCS DWRDD
|
| |