Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1684 |
Symbol | engD |
ID | 6970747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1624099 |
End bp | 1625190 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385643 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_002270137 |
Protein GI | 209398838 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000467912 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.464774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCA AATGCGGTAT CGTCGGTTTG CCCAACGTCG GGAAATCTAC CCTGTTCAAC GCGCTGACCA AAGCCGGTAT TGAAGCGGCC AACTTTCCAT TCTGCACCAT TGAGCCGAAC ACAGGCGTCG TACCAATGCC TGATCCTCGC CTGGATCAAC TGGCTGAAAT CGTAAAACCG CAACGTACGC TTCCCACGAC CATGGAATTT GTCGATATCG CCGGTCTGGT AAAAGGCGCA TCGAAAGGCG AAGGTCTGGG TAACCAGTTC CTGACCAACA TCCGTGAAAC CGAAGCGATC GGTCACGTTG TTCGCTGCTT TGAAAATGAC AACATCATTC ACGTTTCCGG CAAAGTTAAC CCGGCTGACG ATATTGAAGT TATCAACACC GAACTGGCGC TGGCAGACCT CGACACCTGC GAACGTGCGA TTCATCGCGT ACAGAAGAAA GCCAAAGGTG GCGATAAAGA CGCGAAAGCT GAGCTGGCGG TCCTGGAAAA ATGCCTGCCC CAGCTGGAAA ACGCAGGTAT GCTGCGCGCG CTGGATTTAA GCGCTGAAGA GAAAGCGGCT ATTCGTTACC TGAGCTTCCT GACGCTGAAA CCAACAATGT ACATCGCCAA CGTCAACGAA GACGGTTTTG AAAACAACCC ATATCTTGAC CAGGTGCGTG AAATCGCGGC GAAAGAAGGT TCTGTTGTGG TTCCGGTTTG TGCTGCTGTT GAAGCAGACA TTGCCGAACT GGACGACGAA GAACGTGACG AGTTTATGCA GGAGCTTGGG CTGGAAGAGC CGGGTCTGAA CCGTGTGATC CGTGCCGGTT ATAAGCTGCT GAACCTGCAA ACTTACTTCA CCGCTGGGGT GAAAGAAGTG CGTGCATGGA CCATTCCGGT TGGAGCAACC GCGCCGCAGG CAGCGGGCAA AATCCATACT GATTTTGAAA AAGGCTTTAT CCGTGCACAA ACCATCTCGT TTGAAGATTT CATCACTTAC AAAGGTGAAC AAGGCGCGAA AGAAGCAGGC AAAATGCGTG CAGAAGGTAA AGATTACATC GTGAAAGATG GCGATGTGAT GAACTTCCTT TTCAACGTCT AA
|
Protein sequence | MGFKCGIVGL PNVGKSTLFN ALTKAGIEAA NFPFCTIEPN TGVVPMPDPR LDQLAEIVKP QRTLPTTMEF VDIAGLVKGA SKGEGLGNQF LTNIRETEAI GHVVRCFEND NIIHVSGKVN PADDIEVINT ELALADLDTC ERAIHRVQKK AKGGDKDAKA ELAVLEKCLP QLENAGMLRA LDLSAEEKAA IRYLSFLTLK PTMYIANVNE DGFENNPYLD QVREIAAKEG SVVVPVCAAV EADIAELDDE ERDEFMQELG LEEPGLNRVI RAGYKLLNLQ TYFTAGVKEV RAWTIPVGAT APQAAGKIHT DFEKGFIRAQ TISFEDFITY KGEQGAKEAG KMRAEGKDYI VKDGDVMNFL FNV
|
| |