Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4510 |
Symbol | dusA |
ID | 6145911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4609254 |
End bp | 4610273 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619326 |
Product | tRNA-dihydrouridine synthase A |
Protein accession | YP_001746438 |
Protein GI | 170682797 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00742] tRNA dihydrouridine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAA TCAACCAAAC CAGCGCAATG CCTGAAAAAA CTGACGTTCA CTGGAGTGGT CGGTTTAGCG TTGCACCAAT GCTCGACTGG ACGGACAGAC ATTGCCGCTA TTTCTTGCGT CTGCTTTCCC GCAATACGTT GCTGTATACC GAAATGGTGA CTACAGGGGC GATTATTCAC GGTAAAGGTG ATTACCTGGC GTACAGTGAA GAAGAACATC CGGTAGCATT GCAACTGGGC GGTAGCGATC CGGCGGCGCT GGCGCAGTGT GCAAAGCTGG CAGAAGCGCG CGGATATGAT GAGATCAACC TGAATGTCGG CTGCCCGTCT GACCGGGTGC AGAACGGCAT GTTTGGGGCG TGTCTGATGG GTAATGCGCA GCTGGTTGCC GACTGCGTGA AAGCGATGCG CGATGTGGTG TCGATTCCGG TGACGGTGAA AACGCGTATT GGCATTGACG ACCAGGACAG CTATGAATTT CTCTGCGATT TCATCAACAC CGTTTCCGGC AAAGGCGAGT GTGAGATGTT CATCATCCAC GCACGTAAAG CCTGGCTTTC GGGGTTAAGC CCGAAAGAAA ACCGTGAAAT CCCGCCGCTC GATTATCCGC GTGTGTATCA ACTGAAGCGT GACTTTCCGC ATCTGACAAT GTCGATTAAC GGTGGTATCA AGTCGCTGGA AGAGGCTAAA GCGCATTTGC AACATATGGA TGGCGTGATG GTCGGGCGCG AAGCGTATCA GAATCCGGGG ATTCTGGCGG CGGTAGACCG AGAGATTTTT GGTTCCTCGG ATACCGATGC CGATCCGGTG GCGGTAGTGC GCGCCATGTA TCCGTACATT GAGCGTGAAC TCAGCCAGGG GACGTATCTC GGCCATATTA CCCGGCATAT GCTGGGCTTG TTCCAGGGTA TTCCTGGCGC GCGGCAGTGG CGGCGTTATT TAAGTGAAAA TGCCCATAAA GCGGGTGCAG ACATTAATGT GCTGGAACAC GCGCTCAAAC TGGTGGCGGA TAAGCGTTAA
|
Protein sequence | MQKINQTSAM PEKTDVHWSG RFSVAPMLDW TDRHCRYFLR LLSRNTLLYT EMVTTGAIIH GKGDYLAYSE EEHPVALQLG GSDPAALAQC AKLAEARGYD EINLNVGCPS DRVQNGMFGA CLMGNAQLVA DCVKAMRDVV SIPVTVKTRI GIDDQDSYEF LCDFINTVSG KGECEMFIIH ARKAWLSGLS PKENREIPPL DYPRVYQLKR DFPHLTMSIN GGIKSLEEAK AHLQHMDGVM VGREAYQNPG ILAAVDREIF GSSDTDADPV AVVRAMYPYI ERELSQGTYL GHITRHMLGL FQGIPGARQW RRYLSENAHK AGADINVLEH ALKLVADKR
|
| |