Gene EcSMS35_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3584 
SymbolrsmB 
ID6146958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3662442 
End bp3663731 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID641618411 
Product16S rRNA methyltransferase B 
Protein accessionYP_001745551 
Protein GI170684100 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.287379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.154829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AACGTAATTT ACGTAGCATG GCGGCCCAGG CCGTTGAACA AGTCGTCGAG 
CAAGGGCAAT CATTAAGCAA CATTCTGCCA CCGCTCCAGC AAAAAGTTTC TGATAAAGAC
AAAGCACTTC TTCAAGAGTT GTGCTTTGGC GTACTGCGTA CGCTTTCACA GTTAGACTGG
CTGATTAATA AGCTAATGGC CCGTCCGATG ACAGGCAAAC AGCGTACTGT TCATTACCTG
ATTATGGTTG GTTTGTATCA ACTGCTTTAT ACCCGCATTC CACCTCATGC CGCGCTGGCT
GAAACGGTTG AAGGCGCTAT CGCAATTAAG CGCCCGCAAC TTAAAGGGTT GATAAACGGT
GTATTACGCC AGTTCCAGCG TCAGCAAGAA GAGTTATTAG CCGAGTTTAA TGCCAGTGAT
GCACGTTATC TGCATCCTTC CTGGTTGCTG AAGCGTCTGC AAAAAGCGTA TCCAGAGCAG
TGGCAATCCA TCGTCGAAGC CAATAACCAG CGTCCGCCAA TGTGGCTGCG CGTTAATCGT
ACGCATCATT CCCGCGACAG GTGGCTTGCA TTGCTGGATG AAGCAGGAAT GAAAGGTTTC
CCGCATGCGG ATTACCCTGA TGCTGTACGT CTGGAAACAC CTGCACCTGT TCATGCGCTA
CCTGGTTTTG AAGACGGATG GGTTACCGTT CAGGATGCCT CAGCACAAGG TTGCATGACC
TGGCTTGCGC CACAAAACGG TGAACACATT TTGGATCTTT GTGCCGCCCC CGGCGGTAAA
ACAACGCATA TCCTTGAGGT GGCACCAGAA GCGCAGGTTG TTGCGGTTGA TATTGACGAA
CAGCGCCTCT CTCGCGTTTA CGACAATTTA AAACGCCTTG GTATGAAGGC AACCGTGAAA
CAAGGTGATG GCCGTTACCC TTCCCAATGG TGTGGCGAGC AACAGTTTGA TCGCATTTTA
TTAGATGCGC CTTGTTCAGC AACCGGTGTG ATTCGTCGCC ATCCGGATAT TAAATGGTTA
CGTCGCGATC GCGATATCCC GGAACTCGCG CAATTGCAGT CTGAAATTCT CGACGCCATT
TGGCCGCATT TAAAATCCGG TGGAACCCTG GTCTATGCCA CCTGTTCGGT GTTACCGGAA
GAGAATAGCC TGCAGATTAA AGCCTTTTTG CAACGTACTG CTGATGCCGA ACTTTGCGAA
ACAGGAACAC CAGAGCAACC GGGTAAACAA AATCTACCTG GTGCCGAAGA GGGCGACGGC
TTCTTTTACG CTAAGCTAAT CAAAAAGTGA
 
Protein sequence
MKKQRNLRSM AAQAVEQVVE QGQSLSNILP PLQQKVSDKD KALLQELCFG VLRTLSQLDW 
LINKLMARPM TGKQRTVHYL IMVGLYQLLY TRIPPHAALA ETVEGAIAIK RPQLKGLING
VLRQFQRQQE ELLAEFNASD ARYLHPSWLL KRLQKAYPEQ WQSIVEANNQ RPPMWLRVNR
THHSRDRWLA LLDEAGMKGF PHADYPDAVR LETPAPVHAL PGFEDGWVTV QDASAQGCMT
WLAPQNGEHI LDLCAAPGGK TTHILEVAPE AQVVAVDIDE QRLSRVYDNL KRLGMKATVK
QGDGRYPSQW CGEQQFDRIL LDAPCSATGV IRRHPDIKWL RRDRDIPELA QLQSEILDAI
WPHLKSGGTL VYATCSVLPE ENSLQIKAFL QRTADAELCE TGTPEQPGKQ NLPGAEEGDG
FFYAKLIKK