Gene EcSMS35_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2171 
SymbolrlmL 
ID6145620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2177642 
End bp2179750 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content55% 
IMG OID641617047 
Product23S rRNA m(2)G2445 methyltransferase 
Protein accessionYP_001744221 
Protein GI170683350 
COG category[L] Replication, recombination and repair
[R] General function prediction only 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase
[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000766949 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.466381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTC TGTTTGCCAG TACGGCCCGT GGGCTGGAAG AGCTGTTAAA AACTGAACTG 
GAAAACCTGG GGGCCGTTGA ATGCCAGGTG GTTCAGGGTG GGGTCCATTT CAAGGGCGAC
ACACGGCTTG TTTACCAGAG CCTGATGTGG AGCCGCCTGG CCTCGCGTAT TATGTTGCCG
CTGGGCGAGT GTAAGGTTTA CAGCGATTTA GACCTCTATC TCGGTGTTCA GGCGATCAAC
TGGACAGAGA TGTTTAATCC TGGCGCGACC TTCGCTGTCC ACTTCAGTGG TTTGAATGAC
ACCATACGCA ACAGTCAGTA CGGTGCGATG AAAGTGAAAG ACGCGATCGT CGATGCTTTC
ACGCGGAAAA ATCTGCCGCG TCCAAATGTT GATCGCGATG CGCCGGATAT CCGCGTTAAC
GTCTGGCTGC ATAAAGAAAC TGCCAGTATC GCCCTTGATC TCAGTGGTGA TGGTTTACAT
CTGCGTGGCT ATCGCGATCG TGCTGGTATT GCGCCGATCA AAGAAACCCT GGCAGCCGCG
ATTGTTATGC GATCCGGCTG GCAGCCAGGA ACACCGCTGC TCGATCCGAT GTGTGGTTCC
GGTACGTTGC TGATTGAAGC GGCGATGCTG GCGACCGATC GCGCACCAGG CTTGCACCGT
GGACGTTGGG GCTTTAGCGG CTGGGCACAG CATGATGAAG CCATCTGGCA GGAAGTGAAA
GCAGAAGCGC AAACTCGCGC CCGTAAAGGC CTGGCGGAGT ACAGCTCCCA TTTCTACGGT
TCGGACAGCG ACGCACGGGT GATTCAACGT GCGCGCACTA ACGCCCGTCT TGCGGGGATT
GGTGAACTGA TCACCTTTGA GGTGAAAGAT GTCGCGCAAC TGGCCAATCC GCTGCCGAAA
GGGCCGTACG GTACAGTGTT GAGCAACCCG CCATACGGTG AACGTCTGGA CAGCGAACCG
GCGCTGATTG CGCTGCATAG TCTGCTGGGC CGAATCATGA AAAACCAGTT CGGTGGCTGG
AATCTCTCTT TGTTTAGTGC CTCGCCGGAT CTGCTCAGCT GCCTGCAGCT GCGTGCAGAC
AAACAGTACA AGGCGAAAAA CGGCCCGCTG GACTGCGTAC AGAAAAACTA CCATGTTGCC
GAAAGCACAC CAGACAGCAA ACCGGCGATG GCAGCGGAGG ACTACGCCAA CCGTCTGCGT
AAGAACCTCA AAAAATTCGA GAAGTGGGCT CGTCAGGAAG GAATTGAATG TTACCGCCTG
TATGACGCCG ATCTGCCGGA ATATAACGTT GCCGTTGACC GTTATGCCGA CTGGGTGGTG
GTGCAGGAGT ATGCGCCGCC AAAAACTATT GATGCGCACA AAGCGCGTCA GCGTCTGTTC
GATATTATCG CTGCAACCAT TTCGGTACTG GGGATTGCGC CAAACAAACT GGTACTGAAA
ACCCGTGAAC GCCAGAAGGG CAAAAATCAG TACCAGAAAC TGGGCGAGAA GGGCGAGTTT
CTCGAAGTCA CCGAATATAA CGCTCACTTG TGGGTGAACC TGACGGATTA CCTCGATACC
GGTCTGTTCC TCGATCACCG TATCGCCCGT CGTATGCTCG GTCAGATGAG CAAAGGCAAA
GATTTCCTCA ACCTGTTCTC TTATACCGGC AGCGCCACCG TGCACGCGGG ATTAGGCGGT
GCACGCAGCA CCACCACCGT GGATATGTCG CGTACTTATC TGGAGTGGGC AGAACGCAAC
CTGCGTCTGA ATGGCTTAAC CGGGCGTGCG CATCGCCTGA TTCAGGCCGA TTGCCTGGCG
TGGTTGCGTG AGGCAAATGA ACAGTTCGAT CTGATCTTTA TCGATCCGCC AACCTTTTCT
AACTCAAAAC GAATGGAAGA TGCGTTTGAT GTTCAGCGCG ATCATCTGGT GCTGATGAAA
GATTTGAAAC GTCTGCTGCG TGCAGGTGGG ACGATCATGT TCTCGAACAA CAAACGCGGC
TTCCGTATGG ATCTCGACGG CCTGGCGAAA CTGGGACTGA AAGCACAAGA AATTACGCAA
AAAACGCTCT CCCAGGATTT CGCCCGTAAC CGCCAGATCC ACAACTGCTG GCTGATTACC
GCAGCCTGA
 
Protein sequence
MNSLFASTAR GLEELLKTEL ENLGAVECQV VQGGVHFKGD TRLVYQSLMW SRLASRIMLP 
LGECKVYSDL DLYLGVQAIN WTEMFNPGAT FAVHFSGLND TIRNSQYGAM KVKDAIVDAF
TRKNLPRPNV DRDAPDIRVN VWLHKETASI ALDLSGDGLH LRGYRDRAGI APIKETLAAA
IVMRSGWQPG TPLLDPMCGS GTLLIEAAML ATDRAPGLHR GRWGFSGWAQ HDEAIWQEVK
AEAQTRARKG LAEYSSHFYG SDSDARVIQR ARTNARLAGI GELITFEVKD VAQLANPLPK
GPYGTVLSNP PYGERLDSEP ALIALHSLLG RIMKNQFGGW NLSLFSASPD LLSCLQLRAD
KQYKAKNGPL DCVQKNYHVA ESTPDSKPAM AAEDYANRLR KNLKKFEKWA RQEGIECYRL
YDADLPEYNV AVDRYADWVV VQEYAPPKTI DAHKARQRLF DIIAATISVL GIAPNKLVLK
TRERQKGKNQ YQKLGEKGEF LEVTEYNAHL WVNLTDYLDT GLFLDHRIAR RMLGQMSKGK
DFLNLFSYTG SATVHAGLGG ARSTTTVDMS RTYLEWAERN LRLNGLTGRA HRLIQADCLA
WLREANEQFD LIFIDPPTFS NSKRMEDAFD VQRDHLVLMK DLKRLLRAGG TIMFSNNKRG
FRMDLDGLAK LGLKAQEITQ KTLSQDFARN RQIHNCWLIT AA