Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2287 |
Symbol | dusC |
ID | 6146561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2315324 |
End bp | 2316271 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617161 |
Product | tRNA-dihydrouridine synthase C |
Protein accession | YP_001744334 |
Protein GI | 170680883 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0065633 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGTGT TACTGGCACC GATGGAGGGT GTACTCGACT CTCTGGTGCG TGAATTGCTG ACCGAAGTTA ACGACTACGA TCTGTGCATC ACCGAGTTTG TCCGCGTGGT GGATCAACTG CTGCCGGTAA AAGTCTTTCA TCGCATTTGC CCTGAGCTAC AAAACGCCAG CCGGACACCA TCCGGTACGC TGGTGCGCGT GCAGTTGTTA GGTCAGTTCC CACAATGGCT GGCAGAGAAT GCCGCCCGTG CGGTCGAGTT AGGTTCCTGG GGCGTGGATC TCAATTGCGG CTGCCCGTCG AAAACGGTTA ACGGTAGCGG CGGTGGGGCG ACGTTACTCA AAGATCCTGA ACTCATCTAT CAGGGTGCAA AAGCGATGCG CGAAGCTGTA CCGGCGCATT TGCCCGTCAG CGTAAAAGTG CGTCTGGGCT GGGACAGCGG TGAGAAGAAA TTTGAAATCG CCGATGCGGT TCAACAGGCT GGTGCTACAG AGCTGGTGGT GCATGGGCGG ACGAAAGAGC AGGGTTACCG CGCGGAGCAT ATTGACTGGC AGGCGATTGG CGAGATTCGC CAGCGGCTGA ATATTCCGGT GATTGCCAAC GGTGAAATCT GGAACTGGCA GAGTGCGCAA CAATGCATGA CGATCAGCGG ATGCGACGCA GTGATGATTG GTCGCGGGGC GCTCAATATT CCCAACCTGA GCCGGGTGGT AAAATATAAC GAACCGCGAA TGCCGTGGCC GGAGGTAGTT GCTTTGCTGC AAAAATATAC CCGTCTGGAA AAGCAGGGCG ATACCGGGTT ATATCACGTA GCGCGGATTA AACAGTGGTT GAGTTATTTG CGTAAAGAAT ACGATGAAGC AACAGAATTA TTTCAGCATG TTCGGATGTT GAATAATTCC CCTGATATTG CAAGGGCTAT TCAGGCAATT GATATCGAGA AACTCTAA
|
Protein sequence | MRVLLAPMEG VLDSLVRELL TEVNDYDLCI TEFVRVVDQL LPVKVFHRIC PELQNASRTP SGTLVRVQLL GQFPQWLAEN AARAVELGSW GVDLNCGCPS KTVNGSGGGA TLLKDPELIY QGAKAMREAV PAHLPVSVKV RLGWDSGEKK FEIADAVQQA GATELVVHGR TKEQGYRAEH IDWQAIGEIR QRLNIPVIAN GEIWNWQSAQ QCMTISGCDA VMIGRGALNI PNLSRVVKYN EPRMPWPEVV ALLQKYTRLE KQGDTGLYHV ARIKQWLSYL RKEYDEATEL FQHVRMLNNS PDIARAIQAI DIEKL
|
| |