Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03799 |
Symbol | trmA |
ID | 8114965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4068551 |
End bp | 4069651 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644849959 |
Product | hypothetical protein |
Protein accession | YP_003001532 |
Protein GI | 251787228 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase |
TIGRFAM ID | [TIGR02143] tRNA (uracil-5-)-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000143968 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG GTCAGTCATT ACCGGATGCG CGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCCGCC AGTGAACTTA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACTACGC TGAGTAATCA GGCGGTGGTT TCCCTGCTAT ACCATAAGAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA GGCCCTGCGC GATGCACTGC GCGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA ATCGAGCTGG ATCAGGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC TACCGTCAGG TAGAAAACAG CTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC AACGGTAACT TTTCATTAGC GCTGGCGCGT AATTTTGATC GGGTATTAGC CACCGAAATC GCTAAGCCGT CGGTTGCTGC TGCGCAATAC AACATCGCAG CTAACCATAT TGATAACGTA CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGTGT GCGCGAGTTT AACCGCCTGC AAGGGATCGA CTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCA CCGCGCAGCG GTCTGGACAG TGAAACCGAG AAAATGGTGC AGGCGTATCC GCGTATTTTG TACATCTCCT GTAACCCGGA AACGTTATGC AAGAATCTGG AAACATTAAG CCAGACGCAC AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC GTATTACTGA CCGCGAAGTA A
|
Protein sequence | MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP PRSGLDSETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG VLLTAK
|
| |