Gene ECH74115_1464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1464 
SymbolrluC 
ID6972195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1447632 
End bp1448591 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content52% 
IMG OID643385437 
Product23S rRNA pseudouridylate synthase C 
Protein accessionYP_002269931 
Protein GI209400210 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000220754 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000000172116 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACAG AGACTCCATC CGTAAAAATT GTTGCTATCA CCGCCGACGA AGCGGGGCAA 
CGTATCGACA ACTTTTTGCG TACCCAATTG AAAGGCGTAC CAAAAAGTAT GATTTACCGT
ATTTTGCGTA AAGGCGAAGT GCGGGTGAAC AAAAAACGTA TTAAGCCTGA ATATAAACTC
GAAGCGGGTG ATGAGGTGCG TATTCCACCG GTTCGCGTTG CTGAGCGGGA AGAAGAGGCG
GTTTCGCCAC ATCTGCAAAA GGTGGCGGCG CTGGCGGACG TCATCTTATA TGAAGATGAT
CACATCCTGG TGCTGAATAA ACCTTCCGGT ACGGCGGTAC ATGGCGGCAG TGGTTTAAGC
TTCGGCGTTA TTGAAGGTTT GCGGGCGTTG CGCCCGGAAG CGCGGTTCCT TGAACTGGTT
CATCGTCTTG ACCGGGACAC CTCAGGTGTG TTGCTGGTAG CGAAAAAACG CTCGGCGTTG
CGTTCTCTGC ATGAGCAATT ACGTGAAAAA GGGATGCAAA AAGATTACCT GGCGCTGGTG
CGCGGTCAGT GGCAGTCGCA TGTGAAGAGC GTTCAGGCGC CGTTATTGAA AAATATTCTG
CAAAGCGGCG AACGTATCGT GCGTGTGAGT CAGGAAGGCA AACCGTCGGA AACACGCTTT
AAAGTGGAAG AACGCTATGC ATTTGCCACC CTGGTGCGTT GTAGTCCGGT AACAGGGCGC
ACTCATCAGA TCCGTGTGCA TACACAGTAT GCAGGTCATC CGATTGCCTT TGACGATCGC
TACGGTGACC GTGAATTTGA CAGACAGCTC ACTGAAGCAG GCACGGGATT AAATCGTCTG
TTCCTGCACG CCGCAGCGTT GAAGTTTACC CATCCGGGGA CCGGTGAGGT GATGCGTATC
GAAGCGCCGA TGGATGATGG TTTGAAGCGT TGTTTGCAAA AGCTGCGTAA CGCGCGCTAA
 
Protein sequence
MKTETPSVKI VAITADEAGQ RIDNFLRTQL KGVPKSMIYR ILRKGEVRVN KKRIKPEYKL 
EAGDEVRIPP VRVAEREEEA VSPHLQKVAA LADVILYEDD HILVLNKPSG TAVHGGSGLS
FGVIEGLRAL RPEARFLELV HRLDRDTSGV LLVAKKRSAL RSLHEQLREK GMQKDYLALV
RGQWQSHVKS VQAPLLKNIL QSGERIVRVS QEGKPSETRF KVEERYAFAT LVRCSPVTGR
THQIRVHTQY AGHPIAFDDR YGDREFDRQL TEAGTGLNRL FLHAAALKFT HPGTGEVMRI
EAPMDDGLKR CLQKLRNAR