Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3035 |
Symbol | |
ID | 6969184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2816426 |
End bp | 2817625 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386867 |
Product | phage Tail Collar domain protein |
Protein accession | YP_002271335 |
Protein GI | 209395954 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00110884 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGTGA AATACTACGC CATTCTGACT AATCAGGGCG CGGCACGGCT GGCTAACGCG ACGATGCTCG GCAGTAAGCT GAATCTGACG CAAATGGCCG TTGGTGATGC AAATGGTGTC TTGCCGACAC CTGACCCGGC ACAGACAAAA CTTATTAACC AGAAACGCAT CGCGCCGCTG AATCTTCTGA GTGTTGACCC GAACAACCAG AGCCAGATTA TTGCGGAGCA AATCATCCCT GAGAACGAGG GCGGATTCTG GATCCGTGAG ATTGGGCTTT ATGATGATGA AGGCGTACTC ATTGCGGTGG CGAACTGCCC GGAAACGTAC AAACCGCAGT TACAGGAAGG CAGCGGTCGT ACCCAGACTA TCCGCATGAT TCTGGTTGTC ACGAATACCG AAGCCATCAC GCTGAAAATC GACCCGTCGG TGGTACTGGC GACCCGTAAA TACGTGGATG ATAAAGTCCT GGAATTAAGG CTGTATGTGG ATGACCAGAT GAGAAACCAC ATTGCCGCAC AGGATCCTCA TACCCAGTAT GCGCAGAAAC ATAATCCGAC ATTTACCGGA GAACCAAAAG CGCCAACGCC TGCCGCAGGA AATAACACCA CGCGGATTGC GACCACTGAG TTTGTTCAGA CCGCTATTAC CGCTCTGATT AACGGCGCGC CAGACACGCT GGACACACTG AAAGAAATTG CCGCGGCCAT TAACAATGAC CCGAAATTCA GCACCACCAT TAACAATGCG CTGTCAGGTA AGCAGCCACT GGATGAGACG CTGACTCATT TGAGTGGAAA GGATGTTGCC GGTCTTCTCG CATACCTTGG TTTGGGAGAA GCGGCAAAAC GGGATGTGGG GACAGGGGAA AATCAGATAC CGGACATGGC CTCTTTTGCC AGTGGTGATG GATGGATGAA ATTACCCAAC GGTAAAATCC TGCAATATGG TCGTGGTGCG GTTACGCCGA CATTATCGAC GCAAACAATG AGAATTACAT TCAGCATCCC TTTCCCCAAA AAAGCGGACT GCGCCATGCT TACTCATTCT GGTGATGGCG GTGCGCCTTT AGGCGCTGGG CGAGGGTTCG TGATGACTGC AGAAGGCCCA ACGTTAACCG GCTTTAATTC TGCTTACAGA ACGTCATCAA CCAGCGACAC GGTATCGATG AATTACAGTT GGTGGGCTGT TGGTGAGTAA
|
Protein sequence | MTVKYYAILT NQGAARLANA TMLGSKLNLT QMAVGDANGV LPTPDPAQTK LINQKRIAPL NLLSVDPNNQ SQIIAEQIIP ENEGGFWIRE IGLYDDEGVL IAVANCPETY KPQLQEGSGR TQTIRMILVV TNTEAITLKI DPSVVLATRK YVDDKVLELR LYVDDQMRNH IAAQDPHTQY AQKHNPTFTG EPKAPTPAAG NNTTRIATTE FVQTAITALI NGAPDTLDTL KEIAAAINND PKFSTTINNA LSGKQPLDET LTHLSGKDVA GLLAYLGLGE AAKRDVGTGE NQIPDMASFA SGDGWMKLPN GKILQYGRGA VTPTLSTQTM RITFSIPFPK KADCAMLTHS GDGGAPLGAG RGFVMTAEGP TLTGFNSAYR TSSTSDTVSM NYSWWAVGE
|
| |