Gene EcSMS35_3555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3555 
SymboldusB 
ID6146960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3637680 
End bp3638645 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content52% 
IMG OID641618384 
ProducttRNA-dihydrouridine synthase B 
Protein accessionYP_001745531 
Protein GI170680881 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000446962 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG GACAATATCA GCTCAGAAAT CGCCTGATCG CAGCGCCCAT GGCTGGCATT 
ACAGACAGAC CTTTTCGGAC GTTGTGCTAC GAGATGGGAG CCGGATTGAC AGTATCCGAG
ATGATGTCTT CTAACCCACA GGTTTGGGAA AGCGACAAAT CTCGTTTACG GATGGTGCAC
ATTGATGAAC CCGGTATTCG CACCGTGCAA ATTGCTGGTA GCGATCCGAA AGAAATGGCA
GATGCAGCAC GTATTAACGT GGAAAGCGGT GCCCAGATTA TTGATATCAA TATGGGTTGC
CCGGCTAAAA AAGTGAATCG CAAGCTCGCA GGTTCAGCCC TTTTGCAGTA CCCGGATGTC
GTTAAATCGA TCCTTACCGA GGTCGTCAAT GCAGTGGACG TTCCTGTTAC CCTGAAGATT
CGCACCGGCT GGGCGCCGGA ACACCGTAAC TGCGAAGAGA TTGCCCAACT GGCTGAAGAC
TGTGGCATTC AGGCTCTGAC CATTCATGGC CGTACACGCG CCTGTTTGTT CAATGGAGAA
GCTGAGTACG ACAGTATTCG GGCAGTTAAG CAGAAAGTTT CCATTCCGGT TATCGCGAAT
GGCGACATTA CTGACCCGCT TAAAGCCAGA GCTGTGCTCG ACTATACAGG GGCCGATGCC
CTGATGATAG GCCGCGCAGC TCAGGGAAGA CCCTGGATCT TTCGGGAAAT CCAGCATTAT
CTGGACACTG GGGAGTTGCT GCCCCCGCTG CCTTTGGCAG AGGTTAAGCG CTTGCTTTGC
GCGCACGTTC GGGAACTGCA TGACTTTTAT GGTCCGGCAA AAGGGTACCG AATTGCACGT
AAACACGTTT CCTGGTATCT CCAGGAACAC GCTCCAAATG ACCAGTTTCG GCGCACATTC
AACGCCATTG AGGATGCCAG CGAACAGCTG GAGGCGTTGG AGGCATACTT CGAAAATTTT
GCGTAA
 
Protein sequence
MRIGQYQLRN RLIAAPMAGI TDRPFRTLCY EMGAGLTVSE MMSSNPQVWE SDKSRLRMVH 
IDEPGIRTVQ IAGSDPKEMA DAARINVESG AQIIDINMGC PAKKVNRKLA GSALLQYPDV
VKSILTEVVN AVDVPVTLKI RTGWAPEHRN CEEIAQLAED CGIQALTIHG RTRACLFNGE
AEYDSIRAVK QKVSIPVIAN GDITDPLKAR AVLDYTGADA LMIGRAAQGR PWIFREIQHY
LDTGELLPPL PLAEVKRLLC AHVRELHDFY GPAKGYRIAR KHVSWYLQEH APNDQFRRTF
NAIEDASEQL EALEAYFENF A