Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2230 |
Symbol | |
ID | 6967152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2118550 |
End bp | 2119773 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643386118 |
Product | tail fiber protein |
Protein accession | YP_002270605 |
Protein GI | 209399699 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.000000293165 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT AGAGAACTGC ACCATTCAAC TGAAAGCCAG ACGTAACAGC GCCACGGTGG TGGTGAACAC GGTGGCCTCT GAAAATCCGG ATGAAGCCGG TCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGGTT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCTCAACCGG GGACGCTGAA TGATTTTCTC GGTGCCATGT CGGAGGATGA CGTCCGGCCG GAGGCACTGC GTCGTTTTGA ACTGATGGTG GAAGAAGCGG CGCGTCACGC TGAGGAGGCG AAGAAGAATG CCGGAGAGGC GGAGACGTCC GCGAGGAATG CCGGCATATC AGCCAGTCAG GCAGAAGAGA GCGCGGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG CAGGCGGCAG AAAGTGCAGC CGCTGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TCACAAAGTG CAGCAGATGC TGAGTTGTCA AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGCCAGGG ATGCAACGAC CGCAACAGAA AAAGCCCGGG AGTCAGCAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA GAGGCCGTAA ACCGAATCCC CACGGTGGTG GGGCCTCCCG GGCCAAAGGG GGAACCGGGT CCCGCGGGTC CTCAGGGGCC GAAGGGAGAT AAAGGAGAGC GTGGAGACAC CGGTCCGGCA GGGGCAACCG GTGAAAGGGG GCCGGCAGGT GATGCTGGTC CGGCAGGCCC GGCAGGCCCG GCAGGCCCAC AGGGACCGAA AGGAGAAACA GGTGCGGCTG GCCCGGTGGG GGCAACCGGA CCTCAGGGGC CGAAGGGCGA CCCGGGGGAG ACGCAAATAC GGTTCCGTCT GGGGCCGGGA AACATTATTG AGACAAACAG CCATGGCTGG TTCCCGGATA CAGATGGCGC ACTCATCACC GGACTGACCT TTCTTGACCC CAAAGATGCC ACACGGGTTC AGGGTTTTTT TCAGCATTTG CAGGTCAGGT TTGGTGACGG GCCGTGGCAG GATGTTAAGG GGCTGGATGA AGTGGGCAGT GATACAGGCA GAACAGGAGA ATGA
|
Protein sequence | MAVKISGVLK DGTGKPVENC TIQLKARRNS ATVVVNTVAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMSEDDVRP EALRRFELMV EEAARHAEEA KKNAGEAETS ARNAGISASQ AEESAANADT SAGDASESAR QAAESAAAAK QSEEASSSSA SAAAQKASES SQSAADAELS KKTAESAAGN AARDATTATE KARESAESAQ SAEQSRIAAE EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD KGERGDTGPA GATGERGPAG DAGPAGPAGP AGPQGPKGET GAAGPVGATG PQGPKGDPGE TQIRFRLGPG NIIETNSHGW FPDTDGALIT GLTFLDPKDA TRVQGFFQHL QVRFGDGPWQ DVKGLDEVGS DTGRTGE
|
| |