Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0915 |
Symbol | |
ID | 6966921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 926222 |
End bp | 927538 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643384937 |
Product | tail fiber protein |
Protein accession | YP_002269437 |
Protein GI | 209398128 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAG TACAAATATC AGGCGTGCTG AAAGATGGTG CGGGAAAACC AATACAGAAC TGCACCATTC AACTGAAAGC CAGACGTAAC AGCACCACGG TGGTGGTGAA CACGGTGGCC TCTGAAAATC CGGATGAGGC AGGGCGTTAC AGCATGGACG TCGAGTATGG TCAGTACAGC GTCACTCTGT TGGTGGAGGG ATTCCCGCCA TCACATGCCG GGACCATCAC CGTGTATGAA GATTCTCAAC CGGGGACGCT GAATGATTTT CTCGGTGCCA TGTCGGAGGA TGACGTCCGG CCGGAGGCAC TGCGTCGTTT TGAACTGATG GTGGAAGAAG CGGCGCGTCA CGCTGAGGAG GCGAAGAAGA ATGCCGGAGA GGCGGAGACG TCCGCGAGGA ATGCCGGCAT ATCAGCCAGT CAGGCAGAAG AGAGCGCGGC AAATGCTGAC ACTTCAGCAG GGGATGCATC GGAGTCAGCC CGGCAGGCGG CAGAAAGTGC AGCCGCTGCA AAGCAGTCAG AGGAGGCGTC CTCGTCCTCG GCTTCTGCGG CCGCTCAAAA AGCCAGTGAG TCATCACAAA GTGCAGCAGA AGCTGAATTG TCAAGAAAGA CGGCAGAAAG TGCAGCCGGT AATGCAGCCA GGGATGCAAC GACCGCAACA GAAAAAGCCC GGGAATCAGC AGAAAGCGCA CAGTCAGCGG AACAAAGCAG AATAGCGGCG GAAGACGCCG TAAACAGAAT TCCCACCGTG GTGGGGCCTC CCGGACCAAA GGGGGAACCG GGTCCCGCGG GTCCTCAGGG GCCGAAGGGA GATAAAGGAG AGCGTGGAGA CACCGGTCCG GCAGGGGCAA CCGGTGAACG GGGACCGGGA GGAGATACAG GTCCGGCAGG TCCGCAGGGG CCGAAAGGCG ACAGGGGAGA GCGGGGAGAG ACCGGTCTGA CAGGAAGTAC AGGTCCACAG GGGCCAAAGG GAGATACCGG GGCAACAGGT CCGGCAGGAC CGCAGGGACC GAAAGGGGAA ACAGGTGCGG CTGGCCCGGT GGGGGCTACC GGACCTCAGG GGGCGAAGGG CGACCCGGGG GAGACACAAA TACGGTTCCG TCTGGGGCCG ATGAGAATTA TTGAGACAAA CAGCTATGGC TGGTTCCCGG GTACAGATGG TGCGCTCATC ACCGGACTGA CCTTTCTTGA CCCCAAAGAT GCCACACAGG TTCAGGGGAT GTTTCAGCAT TTGCAGGTCA GATTTGGTGA CGGGCCATGG CAGGATGTTA AGGGACTGGA TGAAGTGGGC AGTGATACAG GCAGAACTGG AGAATGA
|
Protein sequence | MAAVQISGVL KDGAGKPIQN CTIQLKARRN STTVVVNTVA SENPDEAGRY SMDVEYGQYS VTLLVEGFPP SHAGTITVYE DSQPGTLNDF LGAMSEDDVR PEALRRFELM VEEAARHAEE AKKNAGEAET SARNAGISAS QAEESAANAD TSAGDASESA RQAAESAAAA KQSEEASSSS ASAAAQKASE SSQSAAEAEL SRKTAESAAG NAARDATTAT EKARESAESA QSAEQSRIAA EDAVNRIPTV VGPPGPKGEP GPAGPQGPKG DKGERGDTGP AGATGERGPG GDTGPAGPQG PKGDRGERGE TGLTGSTGPQ GPKGDTGATG PAGPQGPKGE TGAAGPVGAT GPQGAKGDPG ETQIRFRLGP MRIIETNSYG WFPGTDGALI TGLTFLDPKD ATQVQGMFQH LQVRFGDGPW QDVKGLDEVG SDTGRTGE
|
| |