Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1939 |
Symbol | engD |
ID | 6145658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1959283 |
End bp | 1960374 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616815 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_001743991 |
Protein GI | 170680985 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.256808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0918473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCA AATGCGGTAT CGTCGGTTTG CCCAACGTCG GGAAATCTAC CCTGTTCAAC GCGCTGACCA AAGCCGGTAT TGAAGCGGCC AACTTTCCAT TCTGTACCAT TGAGCCGAAC ACAGGCGTCG TACCAATGCC TGACCCTCGC CTGGATCAAC TGGCTGAAAT CGTAAAACCG CAGCGTACGC TTCCCACGAC CATGGAATTT GTCGATATCG CCGGTCTGGT AAAAGGCGCA TCGAAAGGCG AAGGTCTGGG TAACCAGTTC CTGACCAACA TCCGTGAAAC CGAAGCGATC GGTCACGTTG TTCGCTGCTT TGAAAATGAC AACATCATTC ACGTTTCCGG CAAAGTTAAC CCGGCGGATG ATATTGAAGT TATCAACACC GAACTGGCGC TGGCGGATCT CGACACCTGT GAACGTGCGA TTCATCGCGT ACAGAAGAAA GCCAAAGGTG GCGATAAAGA CGCGAAAGCT GAGCTGGCGG TCCTGGAAAA ATGCCTGCCC CAGCTGGAAA ACGCAGGTAT GCTGCGCGCG CTGGATTTAA GCGCTGAAGA GAAAGCGGCT ATTCGTTACC TGAGCTTCCT GACGCTGAAA CCAACAATGT ACATCGCCAA CGTCAACGAA GACGGTTTTG AAAACAACCC GTATCTTGAC CAGGTGCGTG AAATCGCGGC GAAAGAAGGT TCTGTTGTGG TTCCGGTTTG TGCTGCTGTT GAAGCAGACA TTGCCGAACT GGACGACGAA GAACGTGACG AGTTTATGCA GGAGCTTGGG CTGGAAGAGC CGGGCCTGAA CCGTGTGATC CGTGCCGGTT ATAAGCTGCT GAACCTGCAA ACTTACTTCA CCGCTGGGGT GAAAGAAGTA CGTGCATGGA CCATTCCGGT TGGTGCAACC GCGCCGCAGG CCGCTGGCAA AATCCATACT GATTTTGAAA AAGGCTTTAT CCGTGCACAA ACCATCTCGT TTGAAGATTT CATCACTTAC AAAGGTGAAC AAGGCGCGAA AGAAGCAGGC AAAATGCGTG CAGAAGGCAA AGATTACATC GTTAAAGATG GCGATGTAAT GAACTTCCTG TTCAACGTCT AA
|
Protein sequence | MGFKCGIVGL PNVGKSTLFN ALTKAGIEAA NFPFCTIEPN TGVVPMPDPR LDQLAEIVKP QRTLPTTMEF VDIAGLVKGA SKGEGLGNQF LTNIRETEAI GHVVRCFEND NIIHVSGKVN PADDIEVINT ELALADLDTC ERAIHRVQKK AKGGDKDAKA ELAVLEKCLP QLENAGMLRA LDLSAEEKAA IRYLSFLTLK PTMYIANVNE DGFENNPYLD QVREIAAKEG SVVVPVCAAV EADIAELDDE ERDEFMQELG LEEPGLNRVI RAGYKLLNLQ TYFTAGVKEV RAWTIPVGAT APQAAGKIHT DFEKGFIRAQ TISFEDFITY KGEQGAKEAG KMRAEGKDYI VKDGDVMNFL FNV
|
| |