Gene EcSMS35_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3462 
SymboltruB 
ID6146044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3538589 
End bp3539533 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content53% 
IMG OID641618291 
ProducttRNA pseudouridine synthase B 
Protein accessionYP_001745440 
Protein GI170680914 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.266438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTC CTCGTCGTCG CGGTCGCGAC ATTAACGGCG TTTTGTTGCT GGATAAACCG 
CAGGGGATGT CCAGCAACGA TGCGCTGCAA AAAGTGAAAC GGATTTATAA CGCCAACCGT
GCCGGGCATA CCGGTGCGCT GGACCCGCTG GCGACCGGCA TGTTGCCGAT TTGCCTCGGG
GAAGCGACGA AGTTTTCCCA GTATCTGCTG GACTCCGACA AACGCTATCG GGTCATTGCG
CGTCTTGGAC AGCGTACCGA TACGTCTGAT GCCGACGGAC AAATCGTTGA AGAACGTCCG
GTAACCTTTA GTGCAGAGCA ACTGGCGGCG GCACTGGATA CTTTCCGTGG CGATATCGAA
CAGATCCCTT CGATGTATTC AGCACTGAAA TACCAGGGCA AAAAACTGTA CGAATATGCG
CGTCAGGGCA TTGAAGTTCC GCGTGAAGCG CGCCCGATTA CCGTTTATGA ATTGCTGTTT
ATTCGCCACG AAGGCAATGA GCTGGAGCTG GAAATTCACT GCTCAAAAGG CACTTATATC
CGCACCATCA TTGATGACCT GGGTGAAAAA CTCGGTTGTG GCGCGCATGT TATTTACCTG
CGTCGTCTGG CGGTAAGTAA ATATCCGGTT GAACGGATGG TGACCCTGGA ACATCTGCGT
GAACTGGTTG AACAAGCCGA ACAGCAGGAT ATTCCAGCCG CGGAGTTACT TGATCCATTA
CTGATGCCAA TGGACAGTCC AGCTTCGGAC TACCCGGTGG TGAATCTTCC GTTAACGTCT
TCTGTTTACT TCAAAAATGG TAACCCGGTT CGTACATCCG GTGCGCCGCT GGAAGGACTG
GTTCGCGTTA CAGAAGGTGA GAACGGCAAG TTTATCGGTA TGGGCGAAAT TGACGATGAA
GGCCGCGTCG CGCCTCGTCG CCTGGTGGTT GAATACCCGG CGTAA
 
Protein sequence
MSRPRRRGRD INGVLLLDKP QGMSSNDALQ KVKRIYNANR AGHTGALDPL ATGMLPICLG 
EATKFSQYLL DSDKRYRVIA RLGQRTDTSD ADGQIVEERP VTFSAEQLAA ALDTFRGDIE
QIPSMYSALK YQGKKLYEYA RQGIEVPREA RPITVYELLF IRHEGNELEL EIHCSKGTYI
RTIIDDLGEK LGCGAHVIYL RRLAVSKYPV ERMVTLEHLR ELVEQAEQQD IPAAELLDPL
LMPMDSPASD YPVVNLPLTS SVYFKNGNPV RTSGAPLEGL VRVTEGENGK FIGMGEIDDE
GRVAPRRLVV EYPA