Gene EcE24377A_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0286 
Symbol 
ID5586832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp307639 
End bp309348 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content54% 
IMG OID640924011 
ProductN4/N6-methyltransferase family protein 
Protein accessionYP_001461440 
Protein GI157157373 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACG CTGAACAGCT ATTTCTGAAC GAGCTGGATA ACAAATTCTG GAAGGCCGCC 
GACAAACTGC GCGCCAATAT GGATGCCGCC AACTACAAGC ATGTGGTGCT GGGGCTAATC
TTCCTGAAGT ATGTTTCTGA TGCCTTCGAG GCGCGTCAGC AGGAGCTGAC GACCCTGTTC
CGCGATGTCG GTAATCCCGA CAACATCTAC GCCATGTCGC GCGATGATTA CGGTTCCGAC
GAAGAATACG CTCAGGCTAT CCAGGAAGAG CTGGAAGTTG AAGATTACTA CACCGAAAAG
AACATCTTCT GGGTGCCAAA AGCCGCGCGC TGGGACACGC TGAAAAACAA AGCCATGTTG
CCGACCGGCA CCGTGCTGTG GGTGGATGAA ACCACCGGCA AGGATGTGAC GCTGCGCTCT
GTGTCCTGGC TGGTGGATAA CGCGCTCGAT GAAATCGAAA AAACCAACCC GAAGCTGAAA
GGTATTCTGA ACCGTATCAG CCAGTATCAA TTGGGCAACG AAGTGTTGAC CGGGCTGATT
AATACTTTCT CTGACGCCAA CTTCAGCAAC CCGGAATATA ACGGCGAGAA GCTCAACTTA
AAGAGCAAAG ATATTCTCGG TCACGTGTAC GAATATTTCC TCGGTCAGTT CGCGCTGGCG
GAAGGTAAGC AGGGCGGCCA GTATTACACG CCAAAAAGTA TCGTCACCCT GATTGTTGAA
ATGCTGCAAC CGTATAACGG GCGCGTGTAT GACCCGGCGA TGGGTTCCGG CGGGTTCTTT
GTTTCCAGCG ACCGTTTTAT CGAAGAGCAC GCGGGCGAGA AGCAGTACAA CGCCGCCGAG
CAGAAGCGCA ATATCTCTGT TTACGGCCAG GAGTCGAACC CGACTACCTG GAAGCTGGCG
GCAATGAATA TGGCGATCCG GGGTATCGAC TTTAACTTCG GCAGCAAAAA CGCCGACACC
CTGCTGGACG ACCAGCACCC GGATCTGCGA GCTGACTTCG TGATGGCGAA CCCGCCGTTC
AACATGAAGG AGTGGTGGAA CGCCAAGCTG GAAAACGACG TGCGCTGGAA ATACGGCACA
CCGCCGCAGG GCAACGCCAA CTTTGCGTGG ATGCAGCACA TGATCCATCA CCTTGCGCCA
AAAGGTTCGA TGGCGCTGCT GCTGGCGAAC GGTTCGATGA GCTCCAACAC CAACAACGAA
GGCGAAATCC GCCGTAACCT GATCAAAGCC GATTTGGTCG AGTGCATGGT GGCGCTACCG
GGCCAGCTCT TTACCAACAC CCAAATCCCG GCCTGTATCT GGTTCCTGAC CAAAGACAAA
TCCAGCGGCA ACGGCAAAGC GCACCGCAAA GGCGAAGTGC TGTTTATCGA CGCCCGCAAG
ATTGGCTTTA TGAAAGACCG CGTGCTGCGT GACTTTACTC GTGAAGATAT CGCCAGAATT
GCCGACACCT TCCACAAATG GCAGGCAGAT AAAGAGTACG AAGACGAAGC CGGATTCTGC
TTCTCAGCAA CGCTGGAGGA TATCCAGAAA AACGACTTTG TGCTGACCCC TGGGCGCTAC
GTTGGTGCCG CCGAGCAAGC TGAAGATGAT GAACCGTTTG CCGAGAAGAT GGCGCGCCTG
ACGGCGCAGC TTAAAGGTCA GCTTGAAGAG AGCGCGAAGT TGGAAGCGCA GATTAAGGCG
AATCTGGGGG GGCTGGGTTA TGAGTTCTGA
 
Protein sequence
MNNAEQLFLN ELDNKFWKAA DKLRANMDAA NYKHVVLGLI FLKYVSDAFE ARQQELTTLF 
RDVGNPDNIY AMSRDDYGSD EEYAQAIQEE LEVEDYYTEK NIFWVPKAAR WDTLKNKAML
PTGTVLWVDE TTGKDVTLRS VSWLVDNALD EIEKTNPKLK GILNRISQYQ LGNEVLTGLI
NTFSDANFSN PEYNGEKLNL KSKDILGHVY EYFLGQFALA EGKQGGQYYT PKSIVTLIVE
MLQPYNGRVY DPAMGSGGFF VSSDRFIEEH AGEKQYNAAE QKRNISVYGQ ESNPTTWKLA
AMNMAIRGID FNFGSKNADT LLDDQHPDLR ADFVMANPPF NMKEWWNAKL ENDVRWKYGT
PPQGNANFAW MQHMIHHLAP KGSMALLLAN GSMSSNTNNE GEIRRNLIKA DLVECMVALP
GQLFTNTQIP ACIWFLTKDK SSGNGKAHRK GEVLFIDARK IGFMKDRVLR DFTREDIARI
ADTFHKWQAD KEYEDEAGFC FSATLEDIQK NDFVLTPGRY VGAAEQAEDD EPFAEKMARL
TAQLKGQLEE SAKLEAQIKA NLGGLGYEF