Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2165 |
Symbol | |
ID | 6970659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2075703 |
End bp | 2077016 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643386060 |
Product | tail fiber protein |
Protein accession | YP_002270549 |
Protein GI | 209398870 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.468721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT AGAGAACTGC ACCATTCAAC TGAAAGCCAG ACGTAACAGC GCCACGGTGG TGGTGAACAC GGTGGCCTCT GAAAATCCGG ATGAAGCCGG TCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGGTT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCTCAACCGG GGACGCTGAA TGATTTTCTC GGTGCCATGT CGGAGGATGA CGTCCGGCCG GAGGCACTGC GCCGTTTTGA ACTGATGGTG GAAGAAGCGG CGCGTCACGC TGAGGAGGCG AAGAAGAATG CCGGAGAGGC GGAGACGTCC GCGAGGAATG CCGGCATATC AGCCAGTCAG GCAGAAGAGA GCGCGGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG CAGGCGGCAG AAAGTGCAGC CGCTGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TTACAAAGTG CAACAGATGC TGAGTTGTCA AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGCCAGGG ATGCAACGAC CGCAGCAGAA AAAGCCCGGG AGTCAGCAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA GAAGCCGTAA ACCGAATCCC CACCGTGGTG GGGCCTCCCG GGCCAAAGGG GGAACCGGGG CCCGCGGGTC CTCAGGGGCC GAAGGGAGAT AAAGGAGAGC GTGGAGACAC CGGCCCGGCA GGGGCAACCG GCGAACGGGG ACCGGCAGGT GATGCTGGTC CGGCAGGCCC GCAGGGGCCG AAAGGTGACA GGGGAGAGCG GGGAGAGACC GGTCTGACGG GAAATGCAGG TCCACAGGGT CCAAAGGGAG ATACCGGTGC GGCAGGCCCG GCAGGCCCAC AGGGACCGAA AGGAGAAACA GGTGCGGCTG GCCCGGTGGG GGCAACCGGA CCTCAGGGAC CGAAGGGCGA CCCGGGGGAG ACGCAAATAC GGTTCCGTCT GGGGCCGGGA AACATTATTG AGACAAACAG CCATGGCTGG TTCCCGGATA CAGATGGCGC ACTCATCACC GGACTGACCT TTCTTGACCC CAAAGATGCC ACACGGGTTC AGGGTTTTTT TCAGCATTTG CAGGTCAGGT TTGGTGACGG GCCGTGGCAG GATGTCAAGG GGCTGGATGA AGTGGGCAGT GATACAGGCA GAACAGGAGA ATGA
|
Protein sequence | MAVKISGVLK DGTGKPVENC TIQLKARRNS ATVVVNTVAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMSEDDVRP EALRRFELMV EEAARHAEEA KKNAGEAETS ARNAGISASQ AEESAANADT SAGDASESAR QAAESAAAAK QSEEASSSSA SAAAQKASES LQSATDAELS KKTAESAAGN AARDATTAAE KARESAESAQ SAEQSRIAAE EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD KGERGDTGPA GATGERGPAG DAGPAGPQGP KGDRGERGET GLTGNAGPQG PKGDTGAAGP AGPQGPKGET GAAGPVGATG PQGPKGDPGE TQIRFRLGPG NIIETNSHGW FPDTDGALIT GLTFLDPKDA TRVQGFFQHL QVRFGDGPWQ DVKGLDEVGS DTGRTGE
|
| |