Gene EcSMS35_3376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3376 
Symbol 
ID6143975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3459537 
End bp3460673 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID641618205 
Productmethyltransferase family protein 
Protein accessionYP_001745354 
Protein GI170684079 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACT TAGACAACGG TTTCCGTTCA CTGACACTAC AACGTTTTCC GGCGACGGAT 
GACGTTAACC CGCTACAGGC GTGGGAAGCG GCGGATGAAT ATTTGCTGCA ACAGTTGGAC
GACACAGAAA TCCGCGGCCC GGTGTTGATC CTGAATGATG CCTTTGGTGC GTTAAGTTGT
GCGCTGGCGG AACATAAGCC GTACAGCATT GGCGACTCAT ACATCAGTGA ACTGGCCACG
CGCGAGAATT TACGCCTCAA CGGGATTGAT GAATCGAGCG TGAAGTTTCT CGACAGCACC
GCCGACTACC CGCAACAGCC GGGCGTGGTA CTGATCAAAG TACCGAAAAC ACTGGCGTTG
CTGGAACAAC AACTGCGTGC GCTGCGCAAA GTGGTCACGC CGGATACACG TATTATTGCC
GGTGCTAAAG CCCGTGACAT TCACACCTCC ACGCTGGAAC TGTTCGAAAA AGTGCTCGGT
CCGACCACCA CCACACTGGC ATGGAAGAAA GCGCGCCTGA TTAACTGCAC TTTCAACGAG
CCGCCGCTGG TTGATGCACC GCAGACCGTT AGCTGGAAGC TGGAAGGTAC TGACTGGACT
ATCCACAACC ATGCGAATGT CTTCTCCCGC ACCGGGCTGG ATATTGGCGC GCGCTTCTTT
ATGCAGCATC TGCCAGAGAA TCTCGAAGGT GAGATTGTCG ATCTCGGTTG TGGTAATGGC
GTTATTGGTC TGACGCTGCT TGATAAAAAC CCGCAGGCGA AAGTGGTGTT TGTCGATGAA
TCGCCGATGG CGGTTGCTTC CAGCCGTTTG AACGTTGAAA CCAACATGCC AGAGGCGTTG
GATCGCTGCG AGTTTATGAT TAACAACGCG CTCTCCGGCG TGGAGCCTTT CCGCTTTAAT
GCTGTGCTCT GCAACCCGCC GTTTCATCAG CAACATGCGC TGACCGATAA CGTCGCCTGG
GAGATGTTCC ACCACGCCCG CCGCTGTCTG AAAATCAACG GCGAGCTGTA TATCGTTGCC
AACCGTCACC TGGATTACTT CCACAAACTG AAGAAAATTT TCGGCAACTG CACCACCATC
GCCACGAATA ATAAATTTGT GGTGCTGAAA GCAGTGAAGC TGGGGCGTCG TCGGTAA
 
Protein sequence
MSHLDNGFRS LTLQRFPATD DVNPLQAWEA ADEYLLQQLD DTEIRGPVLI LNDAFGALSC 
ALAEHKPYSI GDSYISELAT RENLRLNGID ESSVKFLDST ADYPQQPGVV LIKVPKTLAL
LEQQLRALRK VVTPDTRIIA GAKARDIHTS TLELFEKVLG PTTTTLAWKK ARLINCTFNE
PPLVDAPQTV SWKLEGTDWT IHNHANVFSR TGLDIGARFF MQHLPENLEG EIVDLGCGNG
VIGLTLLDKN PQAKVVFVDE SPMAVASSRL NVETNMPEAL DRCEFMINNA LSGVEPFRFN
AVLCNPPFHQ QHALTDNVAW EMFHHARRCL KINGELYIVA NRHLDYFHKL KKIFGNCTTI
ATNNKFVVLK AVKLGRRR