Gene ECH74115_4577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4577 
SymboldusB 
ID6970713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4247007 
End bp4247972 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content52% 
IMG OID643388287 
ProducttRNA-dihydrouridine synthase B 
Protein accessionYP_002272722 
Protein GI209396246 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000029753 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0792821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG GACAATATCA GCTCAGAAAT CGCCTGATCG CAGCGCCCAT GGCTGGCATT 
ACAGACAGAC CTTTTCGGAC GTTGTGCTAC GAGATGGGAG CCGGATTGAC AGTATCCGAG
ATGATGTCTT CTAACCCACA GGTTTGGGAA AGCGACAAAT CTCGTTTACG GATGGTGCAC
ATTGATGAAC CCGGTATTCG CACCGTGCAA ATTGCTGGTA GCGATCCGAA AGAAATGGCA
GATGCAGCAC GTATTAACGT GGAAAGCGGT GCCCAGATTA TTGATATCAA TATGGGTTGC
CCGGCTAAAA AAGTGAATCG CAAGCTCGCA GGTTCAGCCC TCTTGCAGTA CCCGGATGTC
GTTAAATCGA TCCTTACCGA GGTCGTCAAT GCAGTGGACG TTCCTGTTAC CCTGAAGATT
CGCACCGGCT GGGCACCGGA ACACCGTAAC TGCGAAGAGA TTGCCCAACT GGCTGAAGAC
TGTGGCATTC AGGCTCTGAC CATTCATGGC CGTACACGCG CCTGTTTGTT CAATGGAGAA
GCTGAGTACG ACAGTATTCG GGCAGTTAAG CAGAAAGTTT CCATTCCGGT TATCGCGAAT
GGCGACATTA CTGACCCGCT TAAAGCCAGA GCTGTGCTCG ACTATACAGG GGCGGATGCC
CTGATGATAG GCCGCGCAGC TCAGGGAAGA CCCTGGATCT TTCGGGAAAT CCAGCATTAT
CTGGACACTG GGGAGTTGCT GCCCCCGCTG CCTTTGGCAG AGGTTAAGCG CTTGCTTTGC
GCGCACGTTC GGGAACTGCA TGACTTTTAT GGTCCGGCAA AAGGGTACCG AATTGCACGT
AAACACGTTT CCTGGTATCT CCAGGAACAC GCTCCAAATG ACCAGTTTCG GCGCACATTC
AACGCCATTG AGGATGCCAG CGAACAGCTG GAGGCGTTGG AGGCATACTT CGAAAATTTT
GCGTAA
 
Protein sequence
MRIGQYQLRN RLIAAPMAGI TDRPFRTLCY EMGAGLTVSE MMSSNPQVWE SDKSRLRMVH 
IDEPGIRTVQ IAGSDPKEMA DAARINVESG AQIIDINMGC PAKKVNRKLA GSALLQYPDV
VKSILTEVVN AVDVPVTLKI RTGWAPEHRN CEEIAQLAED CGIQALTIHG RTRACLFNGE
AEYDSIRAVK QKVSIPVIAN GDITDPLKAR AVLDYTGADA LMIGRAAQGR PWIFREIQHY
LDTGELLPPL PLAEVKRLLC AHVRELHDFY GPAKGYRIAR KHVSWYLQEH APNDQFRRTF
NAIEDASEQL EALEAYFENF A