Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1967 |
Symbol | |
ID | 6968217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1860340 |
End bp | 1861401 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385893 |
Product | hypothetical protein |
Protein accession | YP_002270382 |
Protein GI | 209397939 |
COG category | [S] Function unknown |
COG ID | [COG3768] Predicted membrane protein |
TIGRFAM ID | [TIGR01620] conserved hypothetical protein, TIGR01620 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.650951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC CGTTAAAACC ACGTATTGAT TTCGACGGTC CGCTGGAGGT CGAACAGAAT CCAAAATTCA GGGCGCAGCA GACCTTTGAC GAAAATCAGG CGCAAAATTT TGCCCCGGCC ACGCTCGACG AAGCGCAGGA AGAAGAGGGG CAAGTCGAAG CGGTAATGGA CGCAGCGTTA CGTCCGAAAC GCAGCCTGTG GCGCAAAATG GTGATGGGCG GGCTGGCTCT GTTTGGCGCA AGCGTTGTCG GGCAGGGTGT ACAGTGGACA ATGAATGCCT GGCAAACTCA GGACTGGGTG GCGCTGGGTG GATGTGCTGC TGGGGCATTG ATTATCGGCG CTGGCGTAGG TTCTGTGGTA ACAGAGTGGC GGCGCTTATG GCGCTTGCGA CAGCGCGCCC ATGAACGCGA CGAAGCGCGC GATTTGTTGC ACAGCCACGG CACGGGCAAA GGCCGCGCAT TTTGCGAAAA ACTGGCGCAG CAGGCGGGTA TTGATCAGTC TCATCCAGCG CTGCAACGCT GGTATGCCTC AATCCATGAA ACGCAGAACG ATCGTGAAGT GGTCAGCCTG TATGCTCATC TGGTCCAGCC GGTTTTAGAT GCCCAGGCGC GGCGCGAAAT CAGCCGCTCA GCAGCTGAAT CAACGTTGAT GATTGCGGTC AGCCCGCTGG CGCTGGTGGA TATGGCATTT ATCGCCTGGC GCAATCTGCG TTTGATTAAT CGCATCGCCA CGCTGTATGG CATTGAACTG GGATATTACA GCCGTTTGCG CCTGTTCAAG CTGGTATTGC TGAATATCGC TTTCGCCGGA GCCAGCGAAT TGGTGCGCGA AGTGGGAATG GACTGGATGT CGCAAGATCT CGCTGCTCGT TTGTCTACCC GCGCAGCTCA GGGGATTGGT GCAGGACTTC TGACGGCACG ACTGGGGATT AAAGCTATGG AGCTTTGCCG CCCGCTGCCG TGGATTGACG ATGACAAACC TCGCCTCGGG GATTTTCGTC GTCAGCTTAT CGGTCAGGTG AAAGAAACGC TGCAAAAAGG CAAAACGCCC AGCGAAAAAT AA
|
Protein sequence | MTEPLKPRID FDGPLEVEQN PKFRAQQTFD ENQAQNFAPA TLDEAQEEEG QVEAVMDAAL RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV TEWRRLWRLR QRAHERDEAR DLLHSHGTGK GRAFCEKLAQ QAGIDQSHPA LQRWYASIHE TQNDREVVSL YAHLVQPVLD AQARREISRS AAESTLMIAV SPLALVDMAF IAWRNLRLIN RIATLYGIEL GYYSRLRLFK LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG AGLLTARLGI KAMELCRPLP WIDDDKPRLG DFRRQLIGQV KETLQKGKTP SEK
|
| |