Gene B21_03090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03090 
SymbolrsmB 
ID8113120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3295587 
End bp3296876 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID644849273 
Producthypothetical protein 
Protein accessionYP_003000846 
Protein GI251786542 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AACGTAATTT ACGTAGCATG GCGGCCCAGG CCGTTGAACA AGTCGTCGAG 
CAAGGGCAAT CATTAAGCAA CATTCTGCCA CCGCTCCAGC AAAAAGTTTC CGATAAAGAC
AAAGCACTTC TTCAAGAGTT GTGCTTTGGC GTACTGCGTA CGCTTTCGCA GTTAGACTGG
CTGATTAATA AGTTAATGGC CCGTCCGATG ACCGGCAAAC AGCGGACCGT GCATTACCTG
ATTATGGTTG GTTTGTATCA ACTGCTTTAT ACCCGCATTC CACCTCATGC TGCGCTGGCT
GAAACGGTTG AAGGCGCTAT CGCAATTAAG CGTCCGCAAC TTAAAGGGTT GATAAACGGT
GTATTACGCC AGTTCCAGCG TCAGCAAGAA GAGTTATTAG CCGAGTTTAA TGCCAGTGAT
GCACGTTATC TGCATCCTTC CTGGTTGCTG AAGCGTCTGC AAAAAGCGTA TCCAGAGCAG
TGGCAATCCA TCGTCGAAGC CAATAACCAG CGTCCGCCAA TGTGGCTGCG TATTAATCGT
ACGCATCATT CCCGCGACAG CTGGCTTGCA TTGCTGGATG AAGCAGGAAT GAAAGGTTTC
CCGCATGCGG ATTACCCTGA TGCTGTACGT CTGGAAACAC CTGCACCTGT TCATGCGCTA
CCTGGTTTTG AAGACGGATG GGTTACCGTT CAGGATGCAT CAGCACAAGG TTGCATGACC
TGGCTTGCGC CACAAAACGG TGAACACATT TTGGATCTTT GTGCCGCCCC CGGCGGTAAA
ACAACGCATA TCCTTGAGGT GGCACCAGAA GCGCAGGTTG TTGCGGTTGA TATCGACGAA
CAGCGCCTCT CTCGGGTTTA CGACAATTTA AAACGCCTTG GTATGAAGGC GACCGTGAAA
CAAGGTGATG GCCGTTACCC TTCTCAATGG TGTGGCGAGC AACAGTTTGA TCGCATTTTA
TTAGATGCGC CTTGTTCAGC AACCGGTGTG ATTCGTCGCC ATCCAGATAT TAAATGGTTA
CGTCGCGATC GCGATATCCC GGAACTCGCG CAATTGCAGT CTGAAATTCT CGACGCCATT
TGGCCGCATT TAAAAACCGG TGGAACTCTG GTCTATGCCA CCTGTTCGGT GTTACCGGAA
GAGAATAGCC TGCAGATTAA AGCCTTTTTG CAACGTACCG CTGATGCCGA ACTTTGCGAA
ACAGGAACAC CAGAGCAACC GGGTAAACAA AATCTACCTG GTGCCGAAGA GGGCGACGGC
TTCTTTTACG CTAAGCTAAT CAAAAAGTGA
 
Protein sequence
MKKQRNLRSM AAQAVEQVVE QGQSLSNILP PLQQKVSDKD KALLQELCFG VLRTLSQLDW 
LINKLMARPM TGKQRTVHYL IMVGLYQLLY TRIPPHAALA ETVEGAIAIK RPQLKGLING
VLRQFQRQQE ELLAEFNASD ARYLHPSWLL KRLQKAYPEQ WQSIVEANNQ RPPMWLRINR
THHSRDSWLA LLDEAGMKGF PHADYPDAVR LETPAPVHAL PGFEDGWVTV QDASAQGCMT
WLAPQNGEHI LDLCAAPGGK TTHILEVAPE AQVVAVDIDE QRLSRVYDNL KRLGMKATVK
QGDGRYPSQW CGEQQFDRIL LDAPCSATGV IRRHPDIKWL RRDRDIPELA QLQSEILDAI
WPHLKTGGTL VYATCSVLPE ENSLQIKAFL QRTADAELCE TGTPEQPGKQ NLPGAEEGDG
FFYAKLIKK