Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1203 |
Symbol | |
ID | 6970093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1212392 |
End bp | 1213747 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643385200 |
Product | tail fiber protein |
Protein accession | YP_002269696 |
Protein GI | 209400674 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCCC GCCGGTTCAG GCGGCTTTTT TGTGGGGTGA ATATGGCAGT AAAGATTTCA GGTGTACTGA AAGACGGCAC AGGAAAACCG GTAGAGAACT GCACCATTCA ACTGAAAGCC AGACGGACCA GCAGCACGGT GGTGGTGAAC ACGGTGGCCT CTGAAAATCC GGATGAAGCC GGTCGTTACA GCATGGACGT TGAGTACGGT CAGTACAGCG TCATTCTGTT GGTGGAAGGA TTCCCGCCGT CACATGCCGG GACCATCACC GTGTATGAAG ATTCTCAACC GGGGACGCTG AATGATTTTC TCGGTGCCAT GTCGGAGGAT GACGTCCGGC CGGAGGCACT GCGTCGTTTT GAACTGATGG TGGAAGAAGC GGCGCGTCAC GCTGAGGAGG CGAAGAAGAA TGCCGGAGAG GCGGAGACGT CCGCGAGGAA TGCCGGCATA TCAGCCAGTC AGGCAGAAGA GAGCGCGGCA AATGCTGACA CTTCAGCAGG GGATGCATCG GAGTCAGCCC GGCAGGCGGC AGAAAGTGCA GCCGCTGCAA AGCAGTCAGA GGAGGCGTCC TCGTCCTCGG CTTCTGCGGC CGCTCAAAAA GCCAGTGAGT CATCACAAAG TGCAGCAGAA GCTGAATTGT CAAGAAAGAC GGCAGAAAGT GCAGCCGGTA ATGCAGCCAG GGATGCAACG ACCGCAACAG AAAAAGCCCG GGAGTCAGCA GAAAGCGCAC AGTCAGCGGA ACAAAGCAGG ATAGCGGCGG AAGAAGCCGT AAACCGAATC CCCACCGTGG TGGGACCTCC CGGGCCAAAG GGGGAACCGG GTCCCGCGGG TCCTCAGGGG CCGAAGGGAG ATAAAGGAGA GCGTGGCGAC ACCGGCCCGG CAGGGGCAAC CGGCGAACGG GGACCGGCAG GTGATGCTGG TCCGGCAGGC CCGCAGGGGC CGAAAGGTGA CAGGGGAGAG CGGGGAGAGA CCGGTCTGAC GGGAAATGCA GGTCCACAGG GTCCAAAGGG AGACACCGGG GCAGCAGGCC CGGCAGGCCC ACAGGGACCG AAAGGAGAAA CAGGTGCGGC TGGCCCGGTG GGGGCAACCG GACCTCAGGG ACCGAAGGGC GACCCGGGGG AGACACAAAT CCGTTTTCGT CTGGGGCCGG CGAGCATTAT TGAGACAAAC AGCCATGGCT GGTTCCCGGG TACAGATGGT GCGCTCATCA CCGGACTGAC CTTTCTTGCC CCCAAAGATG CCACACGGGT TCAGGTTTTT TTTCAGCATT TGCAGGTCAG GTTTGGTGAC GGGCCGTGGC AGGATGTTAA GGGGCTGGAT GAAGTGGGCA GTGATACAGG CAGAACAGGA GAATGA
|
Protein sequence | MTARRFRRLF CGVNMAVKIS GVLKDGTGKP VENCTIQLKA RRTSSTVVVN TVASENPDEA GRYSMDVEYG QYSVILLVEG FPPSHAGTIT VYEDSQPGTL NDFLGAMSED DVRPEALRRF ELMVEEAARH AEEAKKNAGE AETSARNAGI SASQAEESAA NADTSAGDAS ESARQAAESA AAAKQSEEAS SSSASAAAQK ASESSQSAAE AELSRKTAES AAGNAARDAT TATEKARESA ESAQSAEQSR IAAEEAVNRI PTVVGPPGPK GEPGPAGPQG PKGDKGERGD TGPAGATGER GPAGDAGPAG PQGPKGDRGE RGETGLTGNA GPQGPKGDTG AAGPAGPQGP KGETGAAGPV GATGPQGPKG DPGETQIRFR LGPASIIETN SHGWFPGTDG ALITGLTFLA PKDATRVQVF FQHLQVRFGD GPWQDVKGLD EVGSDTGRTG E
|
| |