Gene EcSMS35_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2287 
SymboldusC 
ID6146561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2315324 
End bp2316271 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID641617161 
ProducttRNA-dihydrouridine synthase C 
Protein accessionYP_001744334 
Protein GI170680883 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0065633 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGTGT TACTGGCACC GATGGAGGGT GTACTCGACT CTCTGGTGCG TGAATTGCTG 
ACCGAAGTTA ACGACTACGA TCTGTGCATC ACCGAGTTTG TCCGCGTGGT GGATCAACTG
CTGCCGGTAA AAGTCTTTCA TCGCATTTGC CCTGAGCTAC AAAACGCCAG CCGGACACCA
TCCGGTACGC TGGTGCGCGT GCAGTTGTTA GGTCAGTTCC CACAATGGCT GGCAGAGAAT
GCCGCCCGTG CGGTCGAGTT AGGTTCCTGG GGCGTGGATC TCAATTGCGG CTGCCCGTCG
AAAACGGTTA ACGGTAGCGG CGGTGGGGCG ACGTTACTCA AAGATCCTGA ACTCATCTAT
CAGGGTGCAA AAGCGATGCG CGAAGCTGTA CCGGCGCATT TGCCCGTCAG CGTAAAAGTG
CGTCTGGGCT GGGACAGCGG TGAGAAGAAA TTTGAAATCG CCGATGCGGT TCAACAGGCT
GGTGCTACAG AGCTGGTGGT GCATGGGCGG ACGAAAGAGC AGGGTTACCG CGCGGAGCAT
ATTGACTGGC AGGCGATTGG CGAGATTCGC CAGCGGCTGA ATATTCCGGT GATTGCCAAC
GGTGAAATCT GGAACTGGCA GAGTGCGCAA CAATGCATGA CGATCAGCGG ATGCGACGCA
GTGATGATTG GTCGCGGGGC GCTCAATATT CCCAACCTGA GCCGGGTGGT AAAATATAAC
GAACCGCGAA TGCCGTGGCC GGAGGTAGTT GCTTTGCTGC AAAAATATAC CCGTCTGGAA
AAGCAGGGCG ATACCGGGTT ATATCACGTA GCGCGGATTA AACAGTGGTT GAGTTATTTG
CGTAAAGAAT ACGATGAAGC AACAGAATTA TTTCAGCATG TTCGGATGTT GAATAATTCC
CCTGATATTG CAAGGGCTAT TCAGGCAATT GATATCGAGA AACTCTAA
 
Protein sequence
MRVLLAPMEG VLDSLVRELL TEVNDYDLCI TEFVRVVDQL LPVKVFHRIC PELQNASRTP 
SGTLVRVQLL GQFPQWLAEN AARAVELGSW GVDLNCGCPS KTVNGSGGGA TLLKDPELIY
QGAKAMREAV PAHLPVSVKV RLGWDSGEKK FEIADAVQQA GATELVVHGR TKEQGYRAEH
IDWQAIGEIR QRLNIPVIAN GEIWNWQSAQ QCMTISGCDA VMIGRGALNI PNLSRVVKYN
EPRMPWPEVV ALLQKYTRLE KQGDTGLYHV ARIKQWLSYL RKEYDEATEL FQHVRMLNNS
PDIARAIQAI DIEKL