Gene SbBS512_E3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3520 
Symbol 
ID6268374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3272408 
End bp3273544 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID641727395 
Productmethyltransferase family protein 
Protein accessionYP_001881841 
Protein GI187732632 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACT TAGACAACGG TTTCCGTTCA CTGACACTAC AACGTTTTCC GGCGACGGAT 
GACGTTAACC CGCTACAGGC GTGGGAAGCG GCGGATGAAT ATTTGCTGCA ACAGTTGGAC
GACACAGAAA TCCGCGGCCC GGTGTTGATC CTGAATGATG CCTTTGGTGC GTTAAGCTGC
GCGCTGGCAG AACATAAGCC GTACAGCATT GGCGACTCAT ACATCAGTGA ACTGGCGACG
CGCGAGAATT TACGCCTCAA CGGGATTGAT GAGTCGAGCG TGAAGTTTCT CGACAGCACC
GCCGACTACC CGCAACAGCC GGGTGTGGTG CTGATCAAAG TGCCGAAAAC ACTGGCATTG
CTGGAACAAC AACTGCGTGC GCTGCGCAAA GTGGTCACGT CGGATACACG TATTATTGCC
GGTGCCAAAG CCCGTGACAT TCACACTTCC ACGCTGGAAC TGTTCGAAAA AGTGCTCGGC
CCGACCACCA CCACGCTGGC ATGGAAAAAA GCGCGCCTGA TTAATTGCAC GTTCAATGAA
CCGCCGCTGG CCGATGCGCC GCAGACCGTT AGCTGGAAGC TGGAAGGTAC TGACTGGACT
ATCCATAACC ATGCGAATGT CTTCTCCCGC ACCGGGCTTG ATATCGGCGC GCGCTTCTTT
ATGCAACATC TGCCAGAGAA TCTCGAAGGG GAGATTGTCG ATCTCGGTTG CGGTAATGGC
GTTATTGGTC TGACGCTGCT TGATAAAAAC CCGCAGGCGA AAGTGGTGTT TGTCGATGAA
TCGCCGATGG CGGTTGCTTC CAGCCGTTTG AACGTTGAAA CCAACATGCC AGAGGCGTTG
GATCGCAGCG AGTTTATGAT CAACAACGCG CTCTCCGGCG TGGAGCCTTT CCGCTTTAAT
GCTGTGCTCT GCAACCCGCC GTTTCACCAA CAACATGCGC TGACCGATAA CGTCGCCTGG
GAGATGTTCC ACCATGCCCG CCGCTGCCTG AAAATCAACA GCGAGCTGTA TATTGTTGCC
AACCGTCACT TGGATTACTT CCATAAACTG AAGAAGATTT TCGGCAACTG CACCACCATC
GCCACGAATA ATAAATTTGT GGTGCTGAAA GCAGTGAAGC TGGGGCGTCG TCGGTAA
 
Protein sequence
MSHLDNGFRS LTLQRFPATD DVNPLQAWEA ADEYLLQQLD DTEIRGPVLI LNDAFGALSC 
ALAEHKPYSI GDSYISELAT RENLRLNGID ESSVKFLDST ADYPQQPGVV LIKVPKTLAL
LEQQLRALRK VVTSDTRIIA GAKARDIHTS TLELFEKVLG PTTTTLAWKK ARLINCTFNE
PPLADAPQTV SWKLEGTDWT IHNHANVFSR TGLDIGARFF MQHLPENLEG EIVDLGCGNG
VIGLTLLDKN PQAKVVFVDE SPMAVASSRL NVETNMPEAL DRSEFMINNA LSGVEPFRFN
AVLCNPPFHQ QHALTDNVAW EMFHHARRCL KINSELYIVA NRHLDYFHKL KKIFGNCTTI
ATNNKFVVLK AVKLGRRR