Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1643 |
Symbol | |
ID | 6970828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1587551 |
End bp | 1589101 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385603 |
Product | prophage tail fibre domain protein |
Protein accession | YP_002270097 |
Protein GI | 209400441 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCTTG AGGATGCGAG CACGACGAAA AAGGGGATAG TACAGCTCAG CAGTGCGACT AACAGCACTT CCGAGTCACT GGCGGCAACG CCAAAAGCCG TTAAGGCCGC GTATGAGCTG GCTAACGGGA AATACACCGC ACAGGATGCA ACGACAGCAC AGAAAGGGAT AGTTCAGCTT AGCAACGCGA CCAACAGCAC ATCTGAAATG CTGGCGGCAA CGCCAAAGTC GGTAAAGGCA GCCTATGACC TTGCTAACGG GAAATATACT GCTCAGGACG CTACGACAGC ACAAAAAGGA ATTGTCCAGC TCAGTAGTGC AACCAACAGC GCATCTGAAA CGCTTGCCGC GACACCGAAA GCAGTGAAAG CAGCTAATGA TAATGCGAAT GGTCGGGTAC CTTCTGCCCG TAAGGTGAAT GGTAAGGCGC TTTCATCGGA TATAACACTG ACGCCGAAAG ATATTGGTAC GCTTAACTCA ACAACAATGT CATTCAGCGG TGGTGCTGGT TGGTTCAAAT TAGCAACGGT AACCATGCCA CAGGCGAGTT CTGTTGTTTC AATTACGTTG ATTGGTGGCG CGGGATTTAA CGTGGGGTCA CCTCAACAGG CAGGTATATC TGAACTTGTT TTGCGTGCAG GTAATGGTAA TCCGAAGGGG ATTACTGGTG CTTTATGGCA GCGCACATCG ACAGGGTTTA CAAATTTTGC CTGGGTCAAT ACATCTGGTG ATACTTACGA TATTTACGTT GCAATCGGAA ATTATGCGAC TGGTGTAAAT ATTCAATGGG ATTATACCAG TAATGCCAGC GTGACGATTC ATACGTCACC AGCATATTCT GCTAATAAGC CGGAAGGGTT AACGGACGGT ACAGTTTATT CACTCTATAC GCCATCAGAG CAGTTTTATC CGCCTGGCGC ACCAATCCCG TGGCCATCAG ATACCGTTCC GTCTGGCTAT GCCCTGATGC AGGGGCAGAC TTTTGACAAA TCTGCATACC CGAAACTTGC AGCCGCTTAT CCGTCAGGCG TGATCCCTGA TATGCGTGGC TGGACGATTA AGGGCAAACC TGCCAGTGGT CGGGCCGTAT TGTCTCAGGA ACAGGACGGC ATTAAATCGC ACACCCACAG CGCCAGCGCA TCCAGTACGG ATTTGGGGAC GAAAACCACA TCGTCGTTTG ATTACGGCAC TAAATCCACG AATAACACCG GGGCGCACAC GCACAGTGTG AGCGGTACAG CCGCAAGTGC CGGAAACCAT ACTCATAGTG TCACAGGCGC ATCAGCAGTC AGCCAGTGGT CACAAAATGG GTCAGTACAT AAGGTAGTGT CTGCGGCCAG TGTGAATACA AGTGCTGCAG GAGCGCACAC TCATAGTGTC AGCGGCACAG CTGCATCTGC AGGTGCTCAC GCACATACTG TCGGTATTGG TGCTCATACG CACTCTGTTG CGATTGGCTC ACATGGACAC ACCATCACCG TTAACGCTGC GGGTAACGCG GAAAACACTG TCAAAAACAT CGCATTTAAC TACATTGTGA GGCTTGCATA A
|
Protein sequence | MALEDASTTK KGIVQLSSAT NSTSESLAAT PKAVKAAYEL ANGKYTAQDA TTAQKGIVQL SNATNSTSEM LAATPKSVKA AYDLANGKYT AQDATTAQKG IVQLSSATNS ASETLAATPK AVKAANDNAN GRVPSARKVN GKALSSDITL TPKDIGTLNS TTMSFSGGAG WFKLATVTMP QASSVVSITL IGGAGFNVGS PQQAGISELV LRAGNGNPKG ITGALWQRTS TGFTNFAWVN TSGDTYDIYV AIGNYATGVN IQWDYTSNAS VTIHTSPAYS ANKPEGLTDG TVYSLYTPSE QFYPPGAPIP WPSDTVPSGY ALMQGQTFDK SAYPKLAAAY PSGVIPDMRG WTIKGKPASG RAVLSQEQDG IKSHTHSASA SSTDLGTKTT SSFDYGTKST NNTGAHTHSV SGTAASAGNH THSVTGASAV SQWSQNGSVH KVVSAASVNT SAAGAHTHSV SGTAASAGAH AHTVGIGAHT HSVAIGSHGH TITVNAAGNA ENTVKNIAFN YIVRLA
|
| |