Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3503 |
Symbol | |
ID | 6971922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3248705 |
End bp | 3249919 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387305 |
Product | hypothetical protein |
Protein accession | YP_002271768 |
Protein GI | 209400413 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATTG GCACCGGGCA GCGTGGTGAC GGACGCCACG CATTTGTGAC CCGTGAGGAA CTGGTTGGTC TTAAACTCGC CCGGCGTCGA ACATCGGGTG GTGCCTCATA TGCACTGAAT CCGGGTATTG AGATTGACAG TACTTTAATG ACTGTTGATT TTCCCACAAA ACCGCTGAAT TTTAAGGCGA CAGGAGGATT TGGCTCGGTT CTTCTTGAAT GGGATATGCC TAATTATCGC GGACATTCAC TGACTGAAAT CTGGCGGGGT ACGGAGGATG ACCTTGCTGA TGCAGTGCTG GTTGCCACGA CGCCGGGGCA GGTTTACGGC GATCCGGTTG ACCCTGGCTG GTCGGGATTT TACTGGATAC GTTTTGTTAA CGCGGCAGGA GTGAAAGGTC CATGGAATGC TGAAAAAGGC ACTCAGGCAC AAACACAGAT CGGCGTGAAG GCCATCATTG ACCAGATCCG CGATGAGGCT GCAAAGTCGC CGGTTGTGTC CGAGCTGCGT AAAGAAATAA AAAACGCGCA GGGGCAGGCT GTAAAGGATG CTGCAATTAA GACAACCGAA GTTGTGGGGA CTCTCAGGGA AGAAACGACA AGAACGATTG GTGGTATTGA AACCCGCATT AGCACACTGG ATTCGTCAAC CAGTGAATCG CTTAATGAGG TCGACAAGCG CATCACTAAA CTGGATAAAG AAGGCGGTGA GGCGTTTCTG GCAATGTGGT CAAAAAAAGC GGGAGTTGAT GGTATCACTG CGGGGATCGG GATTGTCGCC GGAAAAGACA GTGAAGGCAG GCCTGTAAGT CAGGTTGCAA TTTCTGCGTC GCAGTTGTTT GTCTTTGACC CGAACAACCC GGATAACACC GCCTATCCGT TTGCGGTATC AGGTGGCAAG GTTGTGATCC CGAAAGCGAT GATTTATGAC GCGGTGATTG AAACACTGGT GTCGCGGAAG GTTGTGGCGG ATGAGGTAAA AGCCGGGGTA AGTATCACTT CGCCAGTTAT CCGGAGTGCC GTTATTCAGA ACGGAAACTT TCAGGTTGAT TCTCAGGGTA ACCTGAATAT TGGAGGCCTT TTCAGTGTTA CGTCACAAGG GCAACTGACA ATTCGTTACT CTAATCAGAA TGTAGGACTG GTGATCCGCA ATGATAAAAT TGAGGTTTAT GATCAGAATG GACGACTGGC TGTTCGCATA GGCAGATTAC GCTGA
|
Protein sequence | MEIGTGQRGD GRHAFVTREE LVGLKLARRR TSGGASYALN PGIEIDSTLM TVDFPTKPLN FKATGGFGSV LLEWDMPNYR GHSLTEIWRG TEDDLADAVL VATTPGQVYG DPVDPGWSGF YWIRFVNAAG VKGPWNAEKG TQAQTQIGVK AIIDQIRDEA AKSPVVSELR KEIKNAQGQA VKDAAIKTTE VVGTLREETT RTIGGIETRI STLDSSTSES LNEVDKRITK LDKEGGEAFL AMWSKKAGVD GITAGIGIVA GKDSEGRPVS QVAISASQLF VFDPNNPDNT AYPFAVSGGK VVIPKAMIYD AVIETLVSRK VVADEVKAGV SITSPVIRSA VIQNGNFQVD SQGNLNIGGL FSVTSQGQLT IRYSNQNVGL VIRNDKIEVY DQNGRLAVRI GRLR
|
| |