Gene EcSMS35_4921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4921 
SymbolyjjG 
ID6146869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5037012 
End bp5037689 
Gene Length678 bp 
Protein Length225 aa 
Translation table11 
GC content53% 
IMG OID641619724 
Productnucleotidase 
Protein accessionYP_001746828 
Protein GI170683179 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E
[TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0815818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGGG ACTGGATTTT CTTTGATGCC GATGAAACGC TGTTTACCTT TGACTCGTTC 
ACCGGACTGC AGCGGATGTT TCTTGATTAC AGCGTCACTT TTACCGCTGA AGATTTTCAG
GACTATCAGG CCGTTAACAA GCCCCTGTGG GTAGATTATC AAAACGGCGC GATCACTTCA
TTACAGCTTC AGCACGGGCG TTTTGAGAGC TGGGCCGAAC GGCTGAACGT CGAGCCAGGT
AAACTCAACG AGGCCTTTAT TAATGCGATG GCGGAAATCT GTACGCCGTT GCCGGGCGCG
GTTTCTCTGC TTAACGCCAT TCGTGGCAAT GCCAAAATCG GCATCATCAC CAACGGTTTT
AGCGCCTTGC AACAGGTGCG TCTGGAACGC ACGGGCCTGC GTGATTATTT CGATTTGCTG
GTGATTTCCG AAGAAGTTGG CGTTGCCAAA CCGAATAAGA AAATTTTCGA TTATGCGCTG
GAACAGGCGG GCAATCCTGA CCGTTCACGC GTGCTGATGG TTGGCGACAC TGCCGAGTCC
GATATTCTCG GTGGCATCAA CGCCGGGCTT GCGACCTGCT GGCTGAATGC GCACAATCGC
GAGCAACCAG AAGGCATCGC GCCCACCTGG ACCGTTTCTT CGTTGCACGA ACTGGAGCAG
CTCCTGTGTA AACACTGA
 
Protein sequence
MKWDWIFFDA DETLFTFDSF TGLQRMFLDY SVTFTAEDFQ DYQAVNKPLW VDYQNGAITS 
LQLQHGRFES WAERLNVEPG KLNEAFINAM AEICTPLPGA VSLLNAIRGN AKIGIITNGF
SALQQVRLER TGLRDYFDLL VISEEVGVAK PNKKIFDYAL EQAGNPDRSR VLMVGDTAES
DILGGINAGL ATCWLNAHNR EQPEGIAPTW TVSSLHELEQ LLCKH