Gene EcE24377A_4904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4904 
Symbol 
ID5586138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4890592 
End bp4893714 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content36% 
IMG OID640928505 
ProductN4/N6-methyltransferase family protein 
Protein accessionYP_001465832 
Protein GI157158703 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.658462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGT ACAACGAATT AGTTAAGAAG CTAAAAGAAA TTTTTCAGAT TGATCGACCA 
GAACTGGATT TCGGTATTTA TCGTATTTTA AATGCACGTG CTGATGAAAT TAACGACTAT
CTGGATAATA AGCTGAAGGC CAAAATCCAG TCTGCGCTGG CGGATGCAGG AAATGCCAAT
AAATCAGAAT TAGAACATCA GTTGCAACTG ACAATAAAAG CGGCAACTGA TGCGGGTGTG
GACCCTGCTG ATAGCCCGAA AGTACAGGAG CTTAAAAAGC AACTGGCGGC TATGGCATCC
GGAGCAAATG AGCATGAAAA TGCAGTATTT TCCCATCTGC TGACTTTCTT TTCGCGTTAT
TATGACAACG GCGATTTTAT CAGCAAACGC CGTTATAAAG GTAATACCTA CGCCATCCCT
TATTCTGGCG AAGAAGTGAT GTTGCACTGG GCAAATAAGG ACCAGTACTA CATTAAAAGC
GGTGAAAACT TTGCTAATTA TTCATTTAAA TTAGATGATG GTCGTAAAGT TAGCTTTAAA
TTGCTCGCAG CGGACACAGC AAAAGATAAC CGTAAAGATA ATGAATTAGA TCGCTGCTTT
GTACTTATTG AGCCGCATGT TCGTACCAAA ATTGATGAAG AAGGCGATGA GTACGAACAA
GAATATAAGC CAGTAGAAGT GGTGAAAACT TCATCTGTGG TTGACGGTAA ACTCGTTGAA
ACAGAAGAGT TAGTTATTCA CTTCGAATAT AAGGCCATGA AAAAGGGCAC TAAGCAGGAT
GCTTTAGTGC AGTCCGCTAT TTCAACAATT CTGGCTGATA AAACAGTTCA ACAGCATTGG
GTTGATCTTG CTAAACGTGC GCCAACAGAA AAAAATCCGT CACGTACAGA GTTGGAGCGT
CATCTTACAA CTTATACCCG GCGTAATACA GCAGACTACT TTATTCATAA AGATCTCGGT
GGTTTTTTAA CTAACGAACT GGATTTTTAT ATTAAAAATG AAGTAATGAA TCTCGATAAT
GTGCAAAATG CAGAAGTGTT TGCCAATATT GAGAAACAAT TGCGCATGAT TCAATGTTTG
CGTACTGTTG CGCTGGAACT CATTGCATTT TTGGCACAAA TTGAGAATTT CCAGAAGAAA
CTTTGGACTA AGAAGAAGTT TGTTGTTTCA AGTAATTACT ATATTACTAT GGACAAAATA
AGTTCAGAGT TTTACAGTGA AATAATTAAT AATGAAAAAA TAGTCCAAGA ATGGATTGAA
TTAGGCATCT TAGATGAGAA GCCATCTACC TCTGATGTGG AGTTGCTCAA GTTTGCAACA
GTTAATTTAT CTCATTTAAA CGGAGATCTA AGGAAAAAAA TATTAAATAG TATCAATGAT
ATTGACGATA ATGTTGATGG CGTTTTTATT AATGGTGATA ATTATCAAGC GTTAAATCTT
TTGAAGAAAA AATATAATTC CGTAATTGAT TGTGTCCATA TAGATCCACC ATATAACACA
GATACTAGTG GTTTTTTATA TAAAAATAGC TTTAAGCATT CAAGTTGGCT GAGTATGATG
GATAGCCGTT TGCTTTTTGT TAAAAATTTA CTCAGTGAAA ATGGTGTATT TTTTTGTCAT
ATTGATGAAA ATGAATATGA GCGACTTTAT TTAATTAATA AACAGTTAGG TCTTATCGAT
GCAGGGACAA TTATTTGGGA TAAACGCAAT CCCATGAATG GTGGTAGTGG CATTGCGATT
CAACATGAAT ATACTACTTG TTTTACAAAG AACATGATTT CTATTAATAA GAAAAATGAG
AATGTATTAG AAATACTCGA ATATGCAAAA TTAATTAAGA ATAAATATCC TGTAATTAAC
AGTGAAGCAA AGAAAAAATT TTATGATTGG GTAACAAAGA ATAAAAAACT TACTGGAGGT
GAACAAGCTT ACAAATATAT CGATGAGAAA GGTCGAGTGT ACCAGAGTGT CAGCCTCCGT
GCCCCAGAGC CAAGGACTGA CAAGAAATTC TATATACCAT TAATTCATCC AGTTACCGGG
AAACCATGCG CTATGCCACC AAATGGTTTT TCTCGTACCC CAGAAACATT AAAAGATATG
ATGGATAAGG GGGATATACT ATTTGGTGTT GATGAAACAA CGCAACCTAG ACAGAAAGCA
TATTTATATG CTGACGCTAA AAAGCAAATA ACATCAATAA TTCAGGATGC TAAAAAAGGA
AAAACTCAAA CAACTCATCT TGGAATTGAT TTTCCTTATT GTCACTCAAC TTCTTTATAT
AATTATTTAA TTGGGAGTGC ATCTCATAAT AAAAATGAGA TAATATTAGA TTTTTTTGCT
GGTTCAGGAA CAAGTGGCCA CTCAGTCATA GAGTTGAATC GTAAGGATGC GGGTTCAAGG
AAATATATAC TTGTTGAACA GGGGGAGTAT GCACAAAATG TAACTTTAAG CAGGTTGAGA
AAAGTTATAT TCAGTAGCAA TTGGAAAGAT GGTAAAGCTA TGGATTCAAC GACAGGCTCA
TCTCATATAC TTAAAGTTAT GAAAATAGAA AGTTACGAAG ATGTATTAAA TAGCCTTCGA
TTTGAACAAC AAAAAGAAAT AAAAGATCTT TTTGATACAG TGAATGAATC AGTAGCAAAT
GAGTATTTAA TAAAATATAT GCTTGAAACA GAAAGCAAAG GTTCACTACT CAGTACTGAT
AATTTCAAAA AACCATTTAA CTACGAAATG GATATTGCGA CGGATTCCGC CGGTGCCACT
GAGCGTAAAA ACATCGATTT AGTTGAAACT TTCAACTATC TCATTGGTTT ACATGTGAAA
TCCATCGAAT CCAATATCGA ACGTGGCTAT GTGCGTGTTG AAGGAATGCT ACCAACAGGC
GAGCGTACCC TTATTCTGTG GCGCGATTGC GACAAGATCG GTTATGAAGA ATTGAATAAA
TATGCCAACC GATTCGATTT ATATGCAAAA GAAAACACGT TCGACGTTAT CTATATCAAT
GGCGATCATA ACTTACCTAC CGCCTTTACT GTTGATGAAG AAGACGGCGA GATTGTTCGT
AGCCTGAAAA TCCGCCAGAT CGAACCTGAA TTCTTGAACC TGATGTTTGC GGAGGAGGTT
TAA
 
Protein sequence
MSKYNELVKK LKEIFQIDRP ELDFGIYRIL NARADEINDY LDNKLKAKIQ SALADAGNAN 
KSELEHQLQL TIKAATDAGV DPADSPKVQE LKKQLAAMAS GANEHENAVF SHLLTFFSRY
YDNGDFISKR RYKGNTYAIP YSGEEVMLHW ANKDQYYIKS GENFANYSFK LDDGRKVSFK
LLAADTAKDN RKDNELDRCF VLIEPHVRTK IDEEGDEYEQ EYKPVEVVKT SSVVDGKLVE
TEELVIHFEY KAMKKGTKQD ALVQSAISTI LADKTVQQHW VDLAKRAPTE KNPSRTELER
HLTTYTRRNT ADYFIHKDLG GFLTNELDFY IKNEVMNLDN VQNAEVFANI EKQLRMIQCL
RTVALELIAF LAQIENFQKK LWTKKKFVVS SNYYITMDKI SSEFYSEIIN NEKIVQEWIE
LGILDEKPST SDVELLKFAT VNLSHLNGDL RKKILNSIND IDDNVDGVFI NGDNYQALNL
LKKKYNSVID CVHIDPPYNT DTSGFLYKNS FKHSSWLSMM DSRLLFVKNL LSENGVFFCH
IDENEYERLY LINKQLGLID AGTIIWDKRN PMNGGSGIAI QHEYTTCFTK NMISINKKNE
NVLEILEYAK LIKNKYPVIN SEAKKKFYDW VTKNKKLTGG EQAYKYIDEK GRVYQSVSLR
APEPRTDKKF YIPLIHPVTG KPCAMPPNGF SRTPETLKDM MDKGDILFGV DETTQPRQKA
YLYADAKKQI TSIIQDAKKG KTQTTHLGID FPYCHSTSLY NYLIGSASHN KNEIILDFFA
GSGTSGHSVI ELNRKDAGSR KYILVEQGEY AQNVTLSRLR KVIFSSNWKD GKAMDSTTGS
SHILKVMKIE SYEDVLNSLR FEQQKEIKDL FDTVNESVAN EYLIKYMLET ESKGSLLSTD
NFKKPFNYEM DIATDSAGAT ERKNIDLVET FNYLIGLHVK SIESNIERGY VRVEGMLPTG
ERTLILWRDC DKIGYEELNK YANRFDLYAK ENTFDVIYIN GDHNLPTAFT VDEEDGEIVR
SLKIRQIEPE FLNLMFAEEV