Gene EcE24377A_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2909 
Symbol 
ID5586861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2912713 
End bp2914260 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content46% 
IMG OID640926562 
ProductN4/N6-methyltransferase family protein 
Protein accessionYP_001463944 
Protein GI157159201 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AACCTAAAGA AATCAAAACA GATCCGTTAG AAGTCATCCT GTGGAAAGCG 
GCAGACAAGC TGCGTAAAAA CATTGATGCA GCCGAGTATA AGCATGTCGT GCTGGGCCTC
ATTTTCCTTA AGTATATTTC TGATTCTTTT GAATCTCATT ATGAGTTGCT GAAGGCAGGT
GAAGGCGAGT TCGCAGGCGC TGACCCGGAA GATAAAGACG AGTACACCGC TTACAACATT
TTCTTTGTCC CTGAGCTTGC ACGCTGGAAC TATCTAATAT CTAAGGCCAA GCTACCTGAA
ATCGGTAAGC TGGTTGATGA TGCTATGGAG CTTATCGAAG CGGGTAACCC ACAGCTAAAA
GGTGTGCTGC CGAAAGTCTA CGCTCGCCAG AACCTCGACG CCACCGTGCT GGGTGAACTG
ATAGATTTGA TTGGCAACAT TGCACTGGGA GATGCCAAAG CGCGTTCTGC TGATGTATTA
GGCCACGTAT TCGAATACTT CCTTGGTGAA TTTGCACTGG CAGAAGGTAA ACAGGGCGGT
CAGTTCTATA CGCCAAAATC CATCGTAAGC CTGCTGGTTA ACATGCTGGA ACCCTATAAA
GGCCGAGTCT TTGACCCCTG CTGTGGTTCT GGTGGTATGT TCGTTCAGTC AGAAAAATTT
GTAGAAGCAC ATCAGGGAAA TATTGACGAT ATTTCGATCT ATGGGCAGGA GTCCAACCAG
ACCACCTGGC GTCTGGCAAA AATGAACCTG GCAATTCGTG GGATTAATTC TGAACACGTT
CGCTGGAATA ATGAAGGTTC ATTTCTTAAC GATGCTCACA AAGATTTGAA ATCTGATTTT
ATCATAGCTA ACCCACCGTT TAACGTTTCC GACTGGTCTG GTGAGCAGCT TCGTGGTGAT
GCCCGCTGGC AATATGGCAT TCCACCTGCT GGCAACGCTA ACTTTGCCTG GATGCAACAC
TTCCTGTATC ACCTGTCGCC AAAAGGTCAG GCTGGCGTTG TGCTGGCAAA AGGGGCTTTA
ACCTCTAAAA GTTCGGGTGA AGGTGATATT CGTGCAGCAC TGGTCAAAGA TGCCAATGTG
ATTGATTGTA TCGTTAACTT ACCCGCAAAA CTGTTCCTGA ATACCCAGAT CCCAGCGGCC
TTATGGTTTA TGCGCCGAGA TCGTGAAAAC AGCAGTCATT ATCGTGATCG CAGTAAAGAA
ATTCTGTTTA TTGATGCCCG TAATCTTGGT CATTTAATCA ACCGCCGTAG CAAAGTGCTT
TCTGACGAAG ATATCAAAAC TATTGCTGAC ACCTACCATA ACTGGCGTAA CAAAGGTGGC
GACTACGAAG ATGTGGCTGG TTTCTGTGCA TCTGTCGATA TCAATGAAGT CGCTAAACTT
GATTATGTGC TGACGCCTGG CCGTTATGTT GGCCTTGCTG ACGAAGAAGA CGATTTTGAC
TTTAAAGAAC GTTTTACGGC TCTTAAAGCG GAGTTTGAAG CACAATTGGA AGAAGAAGCG
CATCTGAATA AGTCTATCGC TGAGAGTCTG GCGAAGGTGG TTTTATGA
 
Protein sequence
MARKPKEIKT DPLEVILWKA ADKLRKNIDA AEYKHVVLGL IFLKYISDSF ESHYELLKAG 
EGEFAGADPE DKDEYTAYNI FFVPELARWN YLISKAKLPE IGKLVDDAME LIEAGNPQLK
GVLPKVYARQ NLDATVLGEL IDLIGNIALG DAKARSADVL GHVFEYFLGE FALAEGKQGG
QFYTPKSIVS LLVNMLEPYK GRVFDPCCGS GGMFVQSEKF VEAHQGNIDD ISIYGQESNQ
TTWRLAKMNL AIRGINSEHV RWNNEGSFLN DAHKDLKSDF IIANPPFNVS DWSGEQLRGD
ARWQYGIPPA GNANFAWMQH FLYHLSPKGQ AGVVLAKGAL TSKSSGEGDI RAALVKDANV
IDCIVNLPAK LFLNTQIPAA LWFMRRDREN SSHYRDRSKE ILFIDARNLG HLINRRSKVL
SDEDIKTIAD TYHNWRNKGG DYEDVAGFCA SVDINEVAKL DYVLTPGRYV GLADEEDDFD
FKERFTALKA EFEAQLEEEA HLNKSIAESL AKVVL