Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2871 |
Symbol | |
ID | 6968461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2666162 |
End bp | 2667475 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643386717 |
Product | tail fiber protein |
Protein accession | YP_002271188 |
Protein GI | 209396826 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0235495 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT AGAGAACTGC ACCATTCAAC TGAAAGCCAG ACGGACCAGC AGCACGGTGG TGGTGAACAC GGTGGCCTCT GAAAATCCGG ATGAAGCCGG TCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTC ATTCTGTTGG TGGAGGGCTT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCTCAACCGG GGACGCTGAA TGATTTTCTC GGTGCCATGT CGGAGGATGA CGTCCGGCCG GAGGCACTGC GTCGTTTTGA ACTGATGGTG GAAGAAGCGG CGCGTCACGC TGAGGAGGCG AAGAAGAATG CCGGAGAGGC GGAGACGTCC GCGAGGAATG CCGGCATATC AGCCAGTCAG GCAGAAGAGA GCGCGGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG CAGGCGGCAG AAAGTGCAGC CGCTGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TTACAAAGTG CAACAGATGC TGAGTTGTCA AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGCCAGGG ATGCAACGAC CGCAGCAGAA AAAGCCCGGG AGTCAGCAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA GAAGCCGTAA ACCGAATCCC CACCGTGGTG GGGCCTCCCG GGCCAAAGGG GGAACCGGGG CCCGCGGGTC CTCAGGGGCC GAAGGGAGAT AAAGGAGAGC GTGGAGACAC CGGCCCGGCA GGGGCAACCG GCGAACGGGG ACCGGCAGGT GATGCTGGTC CGGCAGGCCC GCAGGGGCCG AAAGGTGACA GGGGAGAGCG GGGAGAGACC GGTCTGACGG GAAATGCAGG TCCACAGGGT CCAAAGGGAG ATACCGGTGC GGCAGGCCCG GCAGGCCCAC AGGGACCGAA AGGAGAAACA GGTGCGGCTG GCCCGGTGGG GGCAACCGGA CCTCAGGGGC CGAAGGGCGA CCCGGGGGAG ACGCAAATAC GGTTCCGTCT GGGGCCGGGA AACATTATTG AGACAAACAG CAATGGCTGG TTCCCGGATA CAGATGGTGC GCTCATCACC GGACTGACCT TTCTTGACCC CAAAGATACC ACACAGGTTC AGGGGCTGTT TCGGCATTTG CAGGTCAGGT TTGGTGACGG GCCGTGGCAG GATGTTAAGG GGCTGAATGA AGTGGGCAGT GATACAGGCA GAACAGGAGA ATGA
|
Protein sequence | MAVKISGVLK DGTGKPVENC TIQLKARRTS STVVVNTVAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMSEDDVRP EALRRFELMV EEAARHAEEA KKNAGEAETS ARNAGISASQ AEESAANADT SAGDASESAR QAAESAAAAK QSEEASSSSA SAAAQKASES LQSATDAELS KKTAESAAGN AARDATTAAE KARESAESAQ SAEQSRIAAE EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD KGERGDTGPA GATGERGPAG DAGPAGPQGP KGDRGERGET GLTGNAGPQG PKGDTGAAGP AGPQGPKGET GAAGPVGATG PQGPKGDPGE TQIRFRLGPG NIIETNSNGW FPDTDGALIT GLTFLDPKDT TQVQGLFRHL QVRFGDGPWQ DVKGLNEVGS DTGRTGE
|
| |