Gene SbBS512_E2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2103 
SymbolyebU 
ID6273013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1913037 
End bp1914482 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content52% 
IMG OID641726140 
ProductrRNA (cytosine-C(5)-)-methyltransferase RsmF 
Protein accessionYP_001880634 
Protein GI187732363 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0235271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTGG CCCAACACAC CGTTTATTTC CCGGACGCGT TTCTGACACA AATGCGCGAA 
GCTATGCCTT CGACGCTCTC ATTTGATGAT TTTCTTGCCG CCTGTCAGCG CCCGTTGCGC
CGCAGCATTC GCGTTAATAC GCTGAAAATC TCCGTTGCTG ATTTCCTGCA ATTAACCGCT
CCTTATGGCT GGACGCTTAC GCCAATTCCG TGGTGTGAAG AAGGTTTCTG GATTGAACGC
GACAATGAAG ATGCATTGCC ATTGGGTAGT ACCGCCGAGC ATTTAAGTGG CCTGTTTTAT
ATTCAGGAAG CCAGTTCAAT GTTGCCCGTC GCCGCCTTGT TTGCTGACGG TAATGCACCA
CAGCGGGTGA TGGATGTCGC TGCTGCGCCA GGCTCAAAAA CGACGCAAAT TGCCGCGCGG
ATGAATAACG AAGGGGCAAT CCTTGCCAAT GAGTTTTCCG CCAGTCGGGT AAAAGTGTTA
CATGCCAATA TCAGCCGCTG TGGCATCAGT AATGTTGCGC TCACACATTT TGATGGCCGC
GTGTTTGGTG CGGCAGTGCC AGAAATGTTT GATGCCATTT TGCTGGACGC TCCCTGCTCC
GGCGAAGGCG TGGTGCGTAA AGATCCCGAT GCGCTAAAAA ACTGGTCACC AGAAAGCAAT
CAGGAAATCG CAGCTACACA ACGGGAGCTT ATCGACAGCG CCTTTCATGC ATTACGTCCT
GGTGGTACGC TGGTTTACTC GACCTGTACC TTAAACCAGG AAGAAAACGA AGCCGTTTGC
CTGTGGCTGA AAGAGACTTA CCCCGACGCA GTAGAGTTTT TACCACTTGG CGATCTCTTC
CCTGGTGCAA ACAAAGCGCT GACCGAAGAA GGCTTTTTGC ATGTTTTCCC ACAAATTTAC
GACTGCGAAG GCTTCTTCGT TGCTCGTCTG CGTAAAACTC AGGCGATTCC CGCCTTACCC
GCCCCCAAAT ACAAAGTCGG TAATTTTCCG TTCAGCCCGG TGAAAGATCG CGAAGCTGGA
CAAATTCGCC AGGCGGCTGC AGATGTTGGC TTAAACTGGG ATGAAAACCT GCGCCTCTGG
CAGCGTGACA AAGAACTGTG GTTGTTCCCG GTGGGCATTG AAGCCCTGAT CGGTAAAGTC
CGATTTTCTC GCTTGGGGAT TAAACTTGCC GAAACGCACA ACAAAGGTTA TCGCTGGCAG
CATGAAGCAG TTATTGCCCT TGCCACCCCC GACAATGTGA ACGCTTTTGA ACTGACACCG
CAGGAAGCGG AGGAGTGGTA TCGCGGGCGC GATGTTTACC CGCAAGCCGC GCCAGTGGCG
GATGACGTGT TGGTTACTTT CCAGCATCAA CCGATTGGTT TAGCCAAACG GATTGGTTCG
CGATTGAAAA ACAGCTATCC GCGTGAACTG GTGCGCGATG GGAAACTTTT TACCGGTAAC
GCCTGA
 
Protein sequence
MLVAQHTVYF PDAFLTQMRE AMPSTLSFDD FLAACQRPLR RSIRVNTLKI SVADFLQLTA 
PYGWTLTPIP WCEEGFWIER DNEDALPLGS TAEHLSGLFY IQEASSMLPV AALFADGNAP
QRVMDVAAAP GSKTTQIAAR MNNEGAILAN EFSASRVKVL HANISRCGIS NVALTHFDGR
VFGAAVPEMF DAILLDAPCS GEGVVRKDPD ALKNWSPESN QEIAATQREL IDSAFHALRP
GGTLVYSTCT LNQEENEAVC LWLKETYPDA VEFLPLGDLF PGANKALTEE GFLHVFPQIY
DCEGFFVARL RKTQAIPALP APKYKVGNFP FSPVKDREAG QIRQAAADVG LNWDENLRLW
QRDKELWLFP VGIEALIGKV RFSRLGIKLA ETHNKGYRWQ HEAVIALATP DNVNAFELTP
QEAEEWYRGR DVYPQAAPVA DDVLVTFQHQ PIGLAKRIGS RLKNSYPREL VRDGKLFTGN
A