Gene ECH74115_3818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3818 
Symbol 
ID6969365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3541805 
End bp3542842 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID643387603 
Productputative methyltransferase 
Protein accessionYP_002272056 
Protein GI209397290 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0566] rRNA methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00192845 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATG AAATGAAAGG TAAAAGCGGC AAGGTCAAAG TGATGTATGT CCGCAGTGAT 
GATGATTCTG ATAAACGTAC CCACAACCCG CGTACCGGGA AAGGGGGCGG GCGTCCAGGA
AAATCTCGTG CTGACGGTGG CCGTCGCCCC GCCCGCGATG ACAAACAGAG TCAGCCCCGT
GACCGCAAGT GGGAAGATTC GCCGTGGCGC ACGGTTTCCC GCGCGCCGGG TGATGAGACG
CCGGAAAAGG CCGATCACGG TGGCATCAGT GGTAAAAGTT TTATTGATCC GGAAGTGTTG
CGTCGTCAGC GTGCGGAAGA AACCCGCGTC TACGGCGAAA ACGCCTGTCA GGCACTGTTC
CAGAGCCGTC CGGAAGCGAT TGTTCGCGCC TGGTTTATCC AGAGTGTAAC GCCGCGTTTT
AAAGAAGCCT TGCGCTGGAT GGCAGCAAAC CGCAAAGCGT ACCATGTGGT AGATGAAGCG
GAATTGACAA AAGCGTCAGG CACGGAACAT CACGGTGGCG TTTGCTTCCT GATCAAAAAG
CGTAATGGCA CAACCGTGCA GCAGTGGGTA AGCCAGGCAG GCGCGCAGGA TTGTGTTCTG
GCACTGGAAA ACGAATCTAA CCCGCATAAC CTGGGTGGCA TGATGCGCAG CTGCGCGCAC
TTTGGCGTGA AAGGTGTTGT GGTGCAGGAT GCGGCACTGC TGGAGTCGGG GGCGGCTATC
CGTACCGCAG AAGGCGGCGC AGAGCACGTT CAGCCGATTA CTGGCGACAA CATTGTTAAC
GTGCTGGATG ATTTCCGTCA GGCGGGTTAC ACCGTAGTGA CAACTTCCAG CGAGCAGGGT
AAACCGCTGT TCAAAACCAG TCTGCCAGCG AAAATGGTAC TGGTGCTGGG TCAGGAATAT
GAAGGGTTAC CGGATGCCGC ACGCGATCCG AACGATCTGC GCGTGAAGAT TGATGGTACT
GGCAACGTTG CCGGGCTGAA TATCTCTGTC GCAACCGGCG TTCTTCTTGG CGAATGGTGG
CGTCAGAATA AAGCCTGA
 
Protein sequence
MNDEMKGKSG KVKVMYVRSD DDSDKRTHNP RTGKGGGRPG KSRADGGRRP ARDDKQSQPR 
DRKWEDSPWR TVSRAPGDET PEKADHGGIS GKSFIDPEVL RRQRAEETRV YGENACQALF
QSRPEAIVRA WFIQSVTPRF KEALRWMAAN RKAYHVVDEA ELTKASGTEH HGGVCFLIKK
RNGTTVQQWV SQAGAQDCVL ALENESNPHN LGGMMRSCAH FGVKGVVVQD AALLESGAAI
RTAEGGAEHV QPITGDNIVN VLDDFRQAGY TVVTTSSEQG KPLFKTSLPA KMVLVLGQEY
EGLPDAARDP NDLRVKIDGT GNVAGLNISV ATGVLLGEWW RQNKA