Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4917 |
Symbol | yjjG |
ID | 6271074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4583075 |
End bp | 4583752 |
Gene Length | 678 bp |
Protein Length | 225 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728645 |
Product | nucleotidase |
Protein accession | YP_001883036 |
Protein GI | 187730762 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.000116377 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGGG ATTGGATTTT CTTTGATGCC GATGAAACGC TGTTTACCTT TGACTCATTC ACCGGCCTGC AGCGGATGTT TCTTGATTAC AGCGTCACCT TTACCGCTGA AGATTTTCAG GACTATCAGG CCGTTAACAA GCCACTGTGG GTGGATTATC AAAACGGCGC GATCACTTCA TTACAGCTTC AGCACGGGCG TTTCGAGAGC TGGGCCGAAC GGCTGAAAGT TGAAGCAGGC TTGCTTAACG ATGCCTTTAT TAATGCGATG GCGGAAATCT GCACGCCGCT GCCGGGCGCG GTTTCTCTGC TTAACGCCAT TCGTGGCAAC GCTAAAATCG GCATCATCAC CAACGGTTTT AGCGCCTTGC AGCAAGTGCG TCTGGAACGG ACGGGCCTGC GTGATTACTT TGATTTGCTG GTGATTTCCG AAGAAGTTGG CATTGCCAAA CCGAATAAGA AAATTTTCGA TTATGCGCTG GAACTGGCGG GCAATCCTGA CCGTTCACGC GTGCTGATGG TTGGCGACAC TGCCGAGTCC GATATTCTCG GTGGCATCAA CGCCGGGCTT GCGACTTGCT GGCTGAATGC GCACCATCGC GAGCAACCAG AAGGCATCGC GCCCACCTGG ACCGTTTCAT CGTTGCACGA ACTGGAGCAG CTCCTGTGTA AACACTGA
|
Protein sequence | MKWDWIFFDA DETLFTFDSF TGLQRMFLDY SVTFTAEDFQ DYQAVNKPLW VDYQNGAITS LQLQHGRFES WAERLKVEAG LLNDAFINAM AEICTPLPGA VSLLNAIRGN AKIGIITNGF SALQQVRLER TGLRDYFDLL VISEEVGIAK PNKKIFDYAL ELAGNPDRSR VLMVGDTAES DILGGINAGL ATCWLNAHHR EQPEGIAPTW TVSSLHELEQ LLCKH
|
| |