Gene ECH74115_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1767 
Symbol 
ID6971334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1694423 
End bp1695481 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content53% 
IMG OID643385717 
ProductDNA methylase 
Protein accessionYP_002270209 
Protein GI209399852 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000189808 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00347478 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT 
ATGCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTCAAA
GTGAAACCCG AGGGCTGGGA TAACCAGTGG GCGGGTGATG AAGATTACCT GAAGTGGCTG
GACCAGTGTC TTGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC
TGTGGCCATC GTCTGGCATC TGACACCGAA ATCATGATGC GTGAGCGGTT TAACGTGCTG
AACCATATCA TCTGGGCAAA GCCGTCCGGA CGCTGGAACG GGTGCAACAA GGAAAGCCTG
CGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCAG AGCATTATCA GGGGCCGTAT
CGTCCGAAAG ATGCCGGGTA TGAGGCGAAG GGTAGGACAC TGAAACAGCA TGTGATGGCC
CCGCTGATTG CTTACTTTCG TGATGCGCGC GCTGTCCTGG GGATAACGGC AAAACAGATT
GCAGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCGG TCAGTGGCAG
CTGCCGAACG AAAGCGATTA TCTGAAATTA CAGGCACTGT TTGCCCGGGT GGCAGAAGAG
AAGCATCAGC GGGGTGAACT GGAAAAGCCC CACCACCAGC TGGTGGATAC GTATGCCTCT
CTGAACCGAC AGTATGCGGA GCTGCAGAGT GAATATAAGC ATCTGCGGCG GTATTTCGGT
GTGACGGTGC AGGTGCCGTA CACCGATGTG TGGACGTATA AACCGGTGCA GTACTATCCA
GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGTTGCAGC AGATAATCAG CGCAAGCAGT
CGTCCGGGAG ACCTGGTTGC AGATTTCTTC ATGGGGTCGG GGTCGACAGT GAAAGCAGCG
ATGGCGCTGG GACGTCGTGC AACTGGCGTT GAACTGGAGA CTGAACGTTT TGAGCAGACG
GTGCGGGAAG TACAGGATTT AATCATTCGT AACGGATGA
 
Protein sequence
MLNTVKISSC ELINADCLEF MRSLPENSVD LIVTDPPYFK VKPEGWDNQW AGDEDYLKWL 
DQCLAQFWRV LKPAGSLYLF CGHRLASDTE IMMRERFNVL NHIIWAKPSG RWNGCNKESL
RAYFPATERI LFAEHYQGPY RPKDAGYEAK GRTLKQHVMA PLIAYFRDAR AVLGITAKQI
ADATGKKNMV SHWFSAGQWQ LPNESDYLKL QALFARVAEE KHQRGELEKP HHQLVDTYAS
LNRQYAELQS EYKHLRRYFG VTVQVPYTDV WTYKPVQYYP GKHPCEKPAE MLQQIISASS
RPGDLVADFF MGSGSTVKAA MALGRRATGV ELETERFEQT VREVQDLIIR NG