Gene ECH74115_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4398 
Symbol 
ID6968385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4077453 
End bp4078589 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID643388120 
Productmethyltransferase family protein 
Protein accessionYP_002272557 
Protein GI209399561 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.468721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACT TAGACAACGG TTTCCGTTCA CTGACACTAC AACGTTTTCC GGCGACGGAT 
GACGTTAACC CGCTACAGGC GTGGGAAGCG GCGGATGAAT ATTTGCTGCA ACAGTTGGAC
GACACAGAAA TCCGCGGCCC GGTGTTGATC CTGAATGATG CCTTTGGTGC GTTAAGCTGC
GCGCTGGCGG AACATAAGCC GTACAGCATT GGCGACTCAT ACATCAGTGA ACTGGCGACG
CGCGAGAATT TACGCCTCAA CGGGATTGAT GAGTCGAGCG TGAAGTTTCT CGACAGCACC
GCCGACTACC CGCAACAGCC GGGTGTGGTG CTGATCAAAG TGCCGAAAAC ACTGGCATTG
CTGGAACAGC AACTGCGTGC GCTGCGCAAA GTGGTCACGC CACAAACACG TATTATTGCC
GGTGCCAAAG CCCGTGACAT CCACACTTCC ACGCTGGAAC TATTCGAAAA AGTGCTCGGC
CCGACCACCA CCACGCTGGC ATGGAAGAAA GCGCGCCTGA TTAACTGCAC TTTCAATGAA
CCACCGCTGG CCGATGCGCC GCAGACCGTT AGCTGGAAGC TGGAAGGTAC TGACTGGACT
ATCCACAACC ATGCGAATGT CTTCTCCCGC ACCGGGCTGG ATATCGGCGC GCGCTTCTTT
ATGCAACATC TGCCAGAGAA TCTCGAAGGG GAGATTGTCG ATCTCGGTTG CGGTAATGGC
GTTATTGGTC TGACGCTGCT TGATAAAAAC CCGCAGGCGA AAGTGGTGTT TGTCGATGAA
TCGCCGATGG CGGTTGCTTC CAGCCGTTTG AACGTTGAAA CCAACATGCC AGAGGCGTTG
GATCGCTGCG AATTTATGAT CAACAACGCG CTCTCCGGCG TGGAGCCTTT CCGCTTTAAT
GCTGTGCTCT GCAACCCGCC GTTTCATCAG CAACATGCGC TGACCGATAA CGTCGCCTGG
GAGATGTTCC ACCACGCCCG CCGCTGCCTG AAAATCAACG GCGAGCTGTA TATCGTTGCC
AACCGTCACC TGGATTACTT CCATAAACTG AAGAAAATTT TCGGCAACTG CACCACCATC
GCCACGAATA ATAAATTTGT GGTGCTGAAA GCAGTGAAGC TGGGGCGTCG TCGGTAA
 
Protein sequence
MSHLDNGFRS LTLQRFPATD DVNPLQAWEA ADEYLLQQLD DTEIRGPVLI LNDAFGALSC 
ALAEHKPYSI GDSYISELAT RENLRLNGID ESSVKFLDST ADYPQQPGVV LIKVPKTLAL
LEQQLRALRK VVTPQTRIIA GAKARDIHTS TLELFEKVLG PTTTTLAWKK ARLINCTFNE
PPLADAPQTV SWKLEGTDWT IHNHANVFSR TGLDIGARFF MQHLPENLEG EIVDLGCGNG
VIGLTLLDKN PQAKVVFVDE SPMAVASSRL NVETNMPEAL DRCEFMINNA LSGVEPFRFN
AVLCNPPFHQ QHALTDNVAW EMFHHARRCL KINGELYIVA NRHLDYFHKL KKIFGNCTTI
ATNNKFVVLK AVKLGRRR