Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4967 |
Symbol | yjjG |
ID | 6490124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4844465 |
End bp | 4845142 |
Gene Length | 678 bp |
Protein Length | 225 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642745010 |
Product | nucleotidase |
Protein accession | YP_002048579 |
Protein GI | 194449902 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.081412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGGG ACTGGATTTT CTTTGATGCC GATGAAACGC TGTTTACGTT TGATTCTTTC ACCGGCTTAC AGCGGATGTT CCTTGACTAT AGCGTCACCT TTACCGCTGA GGATTTCCAG GATTACCAGG CCGTGAATAA GCCGCTATGG GTGGATTATC AGAACGGCGC GATTACTTCA TTACAATTGC AGCACGCGCG CTTTCAAAGT TGGGCTGAAC GGCTAAACGT TGCGCCGGGG CTGCTGAATG ACGCTTTTAT TAGTGCGATG GCGGAGATCT GTTCTCCTTT GCCGGGCGCC GTTTCGCTAC TTAATGCGAT TCGCGGGCAG GCTAAAATCG GTATTATTAC TAACGGTTTT ACCGCGCTAC AACAAACTCG TCTGGAGCGC ACCGGGCTGC GCGAGTATTT CGATCTGCTG GTGATTTCCG AGCAGGTTGG CGTCGCGAAG CCCGATCCGA AAATCTTTAA CTACGCCCTG GAGCAGGCGG GGAATCCTGA CCGCTCGCGC GTATTAATGG TTGGCGATAC CGCGGAATCC GATATTCTTG GCGGCATTAA CGCCGGGCTG TCGACCTGCT GGCTTAACGC GCATCATCGC GAGCAGCCCG CGGGTATTCA TCCAACCTGG ACTGTGGCGT CATTAAGCGA ACTGGAGCAG CTCCTGTGTA AACACTGA
|
Protein sequence | MKWDWIFFDA DETLFTFDSF TGLQRMFLDY SVTFTAEDFQ DYQAVNKPLW VDYQNGAITS LQLQHARFQS WAERLNVAPG LLNDAFISAM AEICSPLPGA VSLLNAIRGQ AKIGIITNGF TALQQTRLER TGLREYFDLL VISEQVGVAK PDPKIFNYAL EQAGNPDRSR VLMVGDTAES DILGGINAGL STCWLNAHHR EQPAGIHPTW TVASLSELEQ LLCKH
|
| |