Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3024 |
Symbol | |
ID | 6970611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2807872 |
End bp | 2809035 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386858 |
Product | phage late control gene D protein |
Protein accession | YP_002271326 |
Protein GI | 209398874 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.051519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATG CACTGACATT TGATGCAGGC AGTACGCTGA CGCCGGATTA CATGCTGATG CTCGACAGCA GGGATATTAC CGGCAATATC AGCGACCGTC TGATGAGCAT GACTCTGACG GATAACCGGG GCTTTGAGGC TGACCAGCTT GATATTGAAC TGAACGATGC CGACGGGCAG GTCGGGCTGC CGATTCGTGG CGCTGTCCTG ACGGTGTATA TCGGCTGGAA AGGTTTTGCC CTGGTATGCA AAGGGAAATT TACCGTTGAT GAGGTTGAAC ACCGGGGCGC GCCGGATGTG GTCACCATCC GCGCCCGGAG TGCAGATTTT CGCGGGACGC TCAATTCCCG CCGTGAAGAC TCCTGGCATG ACACCACGCT CGGTGCGATT GTTGAGGCAA TAGCCTCCCG TAACAGGCTG GAAGCCAGTG TCGCGCCGTC ACTGGCCGGA ATTAAAATCC CGCACATCGA CCAGTCGCAG GAGTCTGATG CAAAATTCCT GACCCGCCTT GCTGAACGCA ACGGCGGTGA GGTCTCGGTA AAAATGGGAA AACTGTTGTT TCTCAAAGCG GGGCAGGGGG TGACGGCCAG CGGTAAAAAA ATCCCGCAGG TCACCATAAC CCGCAGCGAC GGCGACCGCC ATCATTTTGC GATTGCTGAC CGTGGAGCCT ATACCGGTGT AACGGCAAAG TGGTTACACA CCAAAGACCC GAAACCGCAA AAGCAGAAGG TAAAACTGAA ACGCAAAAAG AAAGAGAAAC ACCTGCGCGC ACTGGAGCAC CCGAAAGCGA AACCGGTCAG GCAGAAGAAA GCGCCAAAAG TACCGGAAGC GCGCGAAGGT GAATACATGG CCGGTGAGGC TGACAACGTT TTTGCCCTGA CCACGGTATA TGCCACGAAA GCACAGGCCA TGCGTGCTGC TCAGGCGAAG TGGGACAAAC TGCAACGGGG CGTTGCGGAG TTCTCCATCA TCCTGGCTAC CGGTCGTGCA GATATTTACA CGGAAACACC GGTCAGAGTG TCAGGCTTTA AGCGCGTCAT AGACGAGCAG GACTGGACCA TCACTAAGGT GACACATTTT CTGAATAATA GCGGCTTCAC GACGTCCTTA GAGCTTGAGG TCAGGCTTTC TGATGTGGAA TACGAAACAG AAGATGATGA ATGA
|
Protein sequence | MLDALTFDAG STLTPDYMLM LDSRDITGNI SDRLMSMTLT DNRGFEADQL DIELNDADGQ VGLPIRGAVL TVYIGWKGFA LVCKGKFTVD EVEHRGAPDV VTIRARSADF RGTLNSRRED SWHDTTLGAI VEAIASRNRL EASVAPSLAG IKIPHIDQSQ ESDAKFLTRL AERNGGEVSV KMGKLLFLKA GQGVTASGKK IPQVTITRSD GDRHHFAIAD RGAYTGVTAK WLHTKDPKPQ KQKVKLKRKK KEKHLRALEH PKAKPVRQKK APKVPEAREG EYMAGEADNV FALTTVYATK AQAMRAAQAK WDKLQRGVAE FSIILATGRA DIYTETPVRV SGFKRVIDEQ DWTITKVTHF LNNSGFTTSL ELEVRLSDVE YETEDDE
|
| |