Gene EcSMS35_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2041 
SymbolrluC 
ID6146596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2060627 
End bp2061586 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content51% 
IMG OID641616917 
Product23S rRNA pseudouridylate synthase C 
Protein accessionYP_001744093 
Protein GI170684091 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000773418 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000351853 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACAG AGACTCCATC CGTAAAAATT GTTGCTATCA CCGCTGACGA AGCGGGGCAA 
CGTATCGATA ACTTTTTGCG TACCCAATTG AAAGGTGTAC CAAAAAGTAT GATTTACCGT
ATTTTGCGTA AAGGCGAAGT GCGGGTGAAC AAAAAACGTA TTAAGCCTGA ATATAAACTC
GAAGCGGGTG ATGAGGTACG TATCCCACCG GTTCGCGTTG CTGAACGGGA AGAAGAGGCG
GTTTCGCCAC ATCTGCAAAA GGTGGCAGCG CTGGCGGACG TCATCTTATA TGAAGATGAC
CATATCCTGG TGCTGAATAA ACCTTCCGGT ACGGCGGTAC ATGGCGGCAG TGGTTTAAGC
TTCGGCGTTA TCGAAGGTTT GCGGGCGTTG CGCCCGGAAG CGCGGTTCCT TGAACTGGTT
CATCGTCTCG ACCGGGACAC TTCAGGTGTG TTGCTGGTAG CGAAAAAACG CTCGGCGTTG
CGTTCTCTGC ATGAGCAATT ACGTGAAAAA GGGATGCAAA AAGATTACCT GGCGCTGGTG
CGCGGTCAGT GGCAGTCGCA TGTGAAGAGC GTTCAAGCGC CGTTATTGAA AAATATTTTG
CAAAGCGGCG AACGTATCGT GCGTGTCAGT CAGGAAGGCA AACCGTCGGA AACACGCTTT
AAAGTGGAAG AACGCTATGC ATTTGCCACC CTGGTGCGTT GTAGCCCGGT AACAGGGCGT
ACTCATCAGA TCCGTGTGCA TACACAATAT GCGGGTCATC CGATTGCCTT TGACGATCGC
TACGGTGACC GTGAATTTGA CAGACACCTC ACTGAAGCAG GCACGGGATT AAATCGCCTG
TTCCTGCACG CCGCAGCGTT GAAGTTTACT CATCCGGGGA CCGGTGAGGT GATGCGTATT
GAAGCGCCGA TGGATGAAGG TTTGAAGCGT TGTTTGCAAA AGCTGCGTAA CGCGCGCTAA
 
Protein sequence
MKTETPSVKI VAITADEAGQ RIDNFLRTQL KGVPKSMIYR ILRKGEVRVN KKRIKPEYKL 
EAGDEVRIPP VRVAEREEEA VSPHLQKVAA LADVILYEDD HILVLNKPSG TAVHGGSGLS
FGVIEGLRAL RPEARFLELV HRLDRDTSGV LLVAKKRSAL RSLHEQLREK GMQKDYLALV
RGQWQSHVKS VQAPLLKNIL QSGERIVRVS QEGKPSETRF KVEERYAFAT LVRCSPVTGR
THQIRVHTQY AGHPIAFDDR YGDREFDRHL TEAGTGLNRL FLHAAALKFT HPGTGEVMRI
EAPMDEGLKR CLQKLRNAR