Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1381 |
Symbol | |
ID | 6968450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1376901 |
End bp | 1377965 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385359 |
Product | potra domain, shlb-type family |
Protein accession | YP_002269854 |
Protein GI | 209399951 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2831] Hemolysin activation/secretion protein |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.163797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGTCAC TGACACTGAT GTCTGCTTTA TTATCGCCTT TATCTCTTCA GGCAGCGGAT GTCCGGCGTA GCGGAGATGA AGCATTTATC ATTCAGCAGC AGCGTCAGGA AGCCCTTGAG CAACAACTGA CGCCTTCAGC CCCTGATGTT CGCCTTTCTG CACCTGGCTC TTTTGCCCAT AAGATTAATT TTCCTGTTGA AACGCCCTGT TTTCAGATTA AACAGACGGA ACTGAAGGGG GCTGATGCGT TACCACACTG GCTGCCTTTA CAAAAAATCG CCAACGGGGC GGTCGGGCAT TGCCTGGGGG CGAAAGGAAT TAATCTGCTG ATGAGTACAT TGCAGAACCG TCTGGTCGAT CATGGTTATG TCACCACCCG TGTTCTGGCA CCTTCGCAGG ATTTAAAAAG CGGTATCCTC CGGCTGGTTA TTATTCCCGG TGTTGTGCGA CATGTGCGTC TGACACCGGA CAGTGATGAC TATATTCAGT TGTATTCCTC ATTCCCGGCA CACGAAGGTT CTCTGCTGGA TTTACGGGAC ATTGAGCAGG GGCTGGATTT AGGTAACAGC CGGATACAGG GACAACATAC TGAGCTGAAT GCAACCAGTG GAAATCTGTC TACACAGAAT GCGCAACTGA GTGCCGATAC GCTTTCCGCC CGGACTGCCG GGCAGTTCAG CAGTAATGGC GGTACGATAA ATGCCGACAC ACTGCAGATA TCGGCACAAA GCCTGTCAAA TCGTAAAGGC AGTCTGATTC AGACGGGAAC AGGGGATTTT TCGCTGAGTC TGCCGGGAAG CGTGGATAAC CGGGAAGGGC TGCTTGCGGC AAATGGCGCG GTGCGTCTGG ATGCACTGAG CCTTGATAAT CGCAAGGGGA AAGTGCAGGC GGAGCAGTCA CCCTCCCTTC AGAAATCCCC GCCCACGTTT CTGAAACCGT TTGTGGCTGG TGTCTGTGCG GCATTGCTGG CGGTCAGCGT GGCTATTCCG GGATGGCAGT TTCTGACACA GCCATCACCG GAGGAGCAGC ATTTTACCTG GGGGAATGGT TGTAAAAAGC AGTGA
|
Protein sequence | MLSLTLMSAL LSPLSLQAAD VRRSGDEAFI IQQQRQEALE QQLTPSAPDV RLSAPGSFAH KINFPVETPC FQIKQTELKG ADALPHWLPL QKIANGAVGH CLGAKGINLL MSTLQNRLVD HGYVTTRVLA PSQDLKSGIL RLVIIPGVVR HVRLTPDSDD YIQLYSSFPA HEGSLLDLRD IEQGLDLGNS RIQGQHTELN ATSGNLSTQN AQLSADTLSA RTAGQFSSNG GTINADTLQI SAQSLSNRKG SLIQTGTGDF SLSLPGSVDN REGLLAANGA VRLDALSLDN RKGKVQAEQS PSLQKSPPTF LKPFVAGVCA ALLAVSVAIP GWQFLTQPSP EEQHFTWGNG CKKQ
|
| |