Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4577 |
Symbol | dusB |
ID | 6970713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4247007 |
End bp | 4247972 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388287 |
Product | tRNA-dihydrouridine synthase B |
Protein accession | YP_002272722 |
Protein GI | 209396246 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000029753 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0792821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCG GACAATATCA GCTCAGAAAT CGCCTGATCG CAGCGCCCAT GGCTGGCATT ACAGACAGAC CTTTTCGGAC GTTGTGCTAC GAGATGGGAG CCGGATTGAC AGTATCCGAG ATGATGTCTT CTAACCCACA GGTTTGGGAA AGCGACAAAT CTCGTTTACG GATGGTGCAC ATTGATGAAC CCGGTATTCG CACCGTGCAA ATTGCTGGTA GCGATCCGAA AGAAATGGCA GATGCAGCAC GTATTAACGT GGAAAGCGGT GCCCAGATTA TTGATATCAA TATGGGTTGC CCGGCTAAAA AAGTGAATCG CAAGCTCGCA GGTTCAGCCC TCTTGCAGTA CCCGGATGTC GTTAAATCGA TCCTTACCGA GGTCGTCAAT GCAGTGGACG TTCCTGTTAC CCTGAAGATT CGCACCGGCT GGGCACCGGA ACACCGTAAC TGCGAAGAGA TTGCCCAACT GGCTGAAGAC TGTGGCATTC AGGCTCTGAC CATTCATGGC CGTACACGCG CCTGTTTGTT CAATGGAGAA GCTGAGTACG ACAGTATTCG GGCAGTTAAG CAGAAAGTTT CCATTCCGGT TATCGCGAAT GGCGACATTA CTGACCCGCT TAAAGCCAGA GCTGTGCTCG ACTATACAGG GGCGGATGCC CTGATGATAG GCCGCGCAGC TCAGGGAAGA CCCTGGATCT TTCGGGAAAT CCAGCATTAT CTGGACACTG GGGAGTTGCT GCCCCCGCTG CCTTTGGCAG AGGTTAAGCG CTTGCTTTGC GCGCACGTTC GGGAACTGCA TGACTTTTAT GGTCCGGCAA AAGGGTACCG AATTGCACGT AAACACGTTT CCTGGTATCT CCAGGAACAC GCTCCAAATG ACCAGTTTCG GCGCACATTC AACGCCATTG AGGATGCCAG CGAACAGCTG GAGGCGTTGG AGGCATACTT CGAAAATTTT GCGTAA
|
Protein sequence | MRIGQYQLRN RLIAAPMAGI TDRPFRTLCY EMGAGLTVSE MMSSNPQVWE SDKSRLRMVH IDEPGIRTVQ IAGSDPKEMA DAARINVESG AQIIDINMGC PAKKVNRKLA GSALLQYPDV VKSILTEVVN AVDVPVTLKI RTGWAPEHRN CEEIAQLAED CGIQALTIHG RTRACLFNGE AEYDSIRAVK QKVSIPVIAN GDITDPLKAR AVLDYTGADA LMIGRAAQGR PWIFREIQHY LDTGELLPPL PLAEVKRLLC AHVRELHDFY GPAKGYRIAR KHVSWYLQEH APNDQFRRTF NAIEDASEQL EALEAYFENF A
|
| |