Gene EcHS_A4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4199 
SymboltrmA 
ID5594462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4191836 
End bp4192936 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID640923302 
ProducttRNA (uracil-5-)-methyltransferase 
Protein accessionYP_001460760 
Protein GI157163442 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR02143] tRNA (uracil-5-)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0000773955 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA 
CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG
GTCAGTCATT ACCGGATGCG TGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT
CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCTGCC
AGTGAACTCA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT
CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACCACGC TGAGTAATCA GGCGGTGGTT
TCCCTGTTGT ACCATAAAAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA AGCCCTGCGC
GATGCACTGC GCGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA
ATCGAGCTGG ATCAGGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC
TACCGTCAGG TAGAAAACAG TTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG
GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC
AACGGTAACT TTTCATTAGC ACTGGCGCGC AATTTTGATC GGGTATTAGC CACCGAAATC
GCTAAGCCGT CGGTTGCTGC TGCGCAATAC AACATCGCAG CTAACCATAT TGATAACGTA
CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGTGT ACGCGAGTTT
AATCGCCTGC AAGGGATCGA CTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCG
CCGCGCAGCG GTCTGGACAG TGAAACCGAG AAAATGGTGC AGGCGTATCC ACGTATTCTG
TATATCTCCT GCAATCCGGA AACGTTATGC AAGAATCTGG AAACATTAAG CCAGACGCAC
AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC
GTATTACTGA CCGCGAAGTA A
 
Protein sequence
MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY 
HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV
SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI
YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI
AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP
PRSGLDSETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG
VLLTAK