Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4412 |
Symbol | trmA |
ID | 6145990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4507994 |
End bp | 4509094 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619233 |
Product | tRNA (uracil-5-)-methyltransferase |
Protein accession | YP_001746357 |
Protein GI | 170681986 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase |
TIGRFAM ID | [TIGR02143] tRNA (uracil-5-)-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00710465 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0138898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG GTCAGTCATT ACCGGATGCG TGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCCGCC AGTGAACTTA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACCACGC TGAGTAATCA GGCGGTGGTT TCCCTGCTGT ACCATAAGAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA AGCCCTGCGC GACGCACTGC GAGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA ATCGAGCTGG ATCAAGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC TACCGTCAGG TAGAAAACAG CTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC AACGGTAACT TTTCATTAGC ACTGGCGCGC AATTTTGATC GGGTATTAGC CACCGAAATC GCTAAGCCGT CGGTCGCTGC CGCGCAATAC AACATTGCCG CTAACCATAT TGATAACGTA CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGCGT ACGCGAGTTT AACCGCCTGC AAGGGATCGA TTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCA CCACGCAGCG GTCTGGACGG TGAAACCGAG AAAATGGTGC AGGCGTATCC ACGTATTCTG TATATCTCCT GCAATCCGGA AACGTTATGC AAGAATCTGG AAACATTAAG TCAGACGCAC AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC GTATTACTGA CCGCGAAGTA A
|
Protein sequence | MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP PRSGLDGETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG VLLTAK
|
| |