Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1804 |
Symbol | |
ID | 6969034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1720530 |
End bp | 1721834 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643385749 |
Product | tail fiber protein |
Protein accession | YP_002270239 |
Protein GI | 209397061 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.982324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00650595 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGTAA AAATTTCTGG CGTGCTTAAA GATGGCACAG GAAAACCAGT ACAGAACTGC ACCATTGTGC TGAAGGCCAG ACGAACCAGC AGCACGGTGG TGGTGAACAC GGTGGCCTCT GAAAATCCGG ATGAAGCCGG ACGTTACAGC ATGGATGTTG AGTACGGTCA GTACAGCGTC ATTCTGTTGG TGGAGGGATT TCCTCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCTCAACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGT CGGAGGATGA CGTCCGGCCG GAGGCACTGC GTCGTTTTGA ACTGATGGTG GAAGAAGCGG CGCGTCACGC TGAGGAGGCG AAGAAGAATG CCGGAGAGGC GGAGACGTCC GCGAGGAATG CCGGCATATC AGCCAGTCAG GCAGAAGAGA ACGCTGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG CAGGCGGCAG AAAGTGCAGC CGCTGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TTACAAAGTG CAACAGATGC TGAGTTGTCA AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGCCAGGG ATGCAACGAC CGCAGCAGAA AAAGCCCGGG AGTCAGCAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA GAAGCCGTAA ACCGAATCCC CACCGTGGTG GGGCCTCCCG GGCCAAAGGG GGAACCGGGG CCCGCGGGTC CTCAGGGGCC GAAGGGAGAT AAAGGAGAGC GTGGAGACAC CGGCCCGGCA GGGGCAACCG GCGAACGGGG ACCGGCAGGT GATGCTGGTC CGGCAGGCCC GCAGGGGCCG AAAGGTGACA GGGGAGAGAC CGGTCTGACG GGAAATGCAG GTCCACAGGG TCCAAAGGGA GACACCGGGG CAGCAGGCCC GGCAGGCCCA CAGGGACCGA AAGGAGAAAC AGGTGCGGCT GGCCCGGTGG GGGCAACCGG ACCTCAGGGA CCGAAGGGCG ACCCGGGGGA GACACAAATC CGTTTTCGTC TGGGGCCGGC GAGCATTATT GAGACAAACA GCCATGGCTG GTTCCCGGGT ACAGATGGTG CGCTCATCAC CGGACTGACC TTTCTTGCCC CCAAAGATAC CACACGGGTT CAGGGTTTTT TTCAGCATTT GCAGGTCAGG TTTGGTGACG GGCCGTGGCA GGATGTTAAG GGGCTTGATG AAGTGGGCAG TGATACAGGC AGAACAGGAG AATGA
|
Protein sequence | MTVKISGVLK DGTGKPVQNC TIVLKARRTS STVVVNTVAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMSEDDVRP EALRRFELMV EEAARHAEEA KKNAGEAETS ARNAGISASQ AEENAANADT SAGDASESAR QAAESAAAAK QSEEASSSSA SAAAQKASES LQSATDAELS KKTAESAAGN AARDATTAAE KARESAESAQ SAEQSRIAAE EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD KGERGDTGPA GATGERGPAG DAGPAGPQGP KGDRGETGLT GNAGPQGPKG DTGAAGPAGP QGPKGETGAA GPVGATGPQG PKGDPGETQI RFRLGPASII ETNSHGWFPG TDGALITGLT FLAPKDTTRV QGFFQHLQVR FGDGPWQDVK GLDEVGSDTG RTGE
|
| |