Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4921 |
Symbol | yjjG |
ID | 6146869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5037012 |
End bp | 5037689 |
Gene Length | 678 bp |
Protein Length | 225 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619724 |
Product | nucleotidase |
Protein accession | YP_001746828 |
Protein GI | 170683179 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0815818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGGG ACTGGATTTT CTTTGATGCC GATGAAACGC TGTTTACCTT TGACTCGTTC ACCGGACTGC AGCGGATGTT TCTTGATTAC AGCGTCACTT TTACCGCTGA AGATTTTCAG GACTATCAGG CCGTTAACAA GCCCCTGTGG GTAGATTATC AAAACGGCGC GATCACTTCA TTACAGCTTC AGCACGGGCG TTTTGAGAGC TGGGCCGAAC GGCTGAACGT CGAGCCAGGT AAACTCAACG AGGCCTTTAT TAATGCGATG GCGGAAATCT GTACGCCGTT GCCGGGCGCG GTTTCTCTGC TTAACGCCAT TCGTGGCAAT GCCAAAATCG GCATCATCAC CAACGGTTTT AGCGCCTTGC AACAGGTGCG TCTGGAACGC ACGGGCCTGC GTGATTATTT CGATTTGCTG GTGATTTCCG AAGAAGTTGG CGTTGCCAAA CCGAATAAGA AAATTTTCGA TTATGCGCTG GAACAGGCGG GCAATCCTGA CCGTTCACGC GTGCTGATGG TTGGCGACAC TGCCGAGTCC GATATTCTCG GTGGCATCAA CGCCGGGCTT GCGACCTGCT GGCTGAATGC GCACAATCGC GAGCAACCAG AAGGCATCGC GCCCACCTGG ACCGTTTCTT CGTTGCACGA ACTGGAGCAG CTCCTGTGTA AACACTGA
|
Protein sequence | MKWDWIFFDA DETLFTFDSF TGLQRMFLDY SVTFTAEDFQ DYQAVNKPLW VDYQNGAITS LQLQHGRFES WAERLNVEPG KLNEAFINAM AEICTPLPGA VSLLNAIRGN AKIGIITNGF SALQQVRLER TGLRDYFDLL VISEEVGVAK PNKKIFDYAL EQAGNPDRSR VLMVGDTAES DILGGINAGL ATCWLNAHNR EQPEGIAPTW TVSSLHELEQ LLCKH
|
| |