Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2041 |
Symbol | rluC |
ID | 6146596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2060627 |
End bp | 2061586 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616917 |
Product | 23S rRNA pseudouridylate synthase C |
Protein accession | YP_001744093 |
Protein GI | 170684091 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000773418 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.000351853 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACAG AGACTCCATC CGTAAAAATT GTTGCTATCA CCGCTGACGA AGCGGGGCAA CGTATCGATA ACTTTTTGCG TACCCAATTG AAAGGTGTAC CAAAAAGTAT GATTTACCGT ATTTTGCGTA AAGGCGAAGT GCGGGTGAAC AAAAAACGTA TTAAGCCTGA ATATAAACTC GAAGCGGGTG ATGAGGTACG TATCCCACCG GTTCGCGTTG CTGAACGGGA AGAAGAGGCG GTTTCGCCAC ATCTGCAAAA GGTGGCAGCG CTGGCGGACG TCATCTTATA TGAAGATGAC CATATCCTGG TGCTGAATAA ACCTTCCGGT ACGGCGGTAC ATGGCGGCAG TGGTTTAAGC TTCGGCGTTA TCGAAGGTTT GCGGGCGTTG CGCCCGGAAG CGCGGTTCCT TGAACTGGTT CATCGTCTCG ACCGGGACAC TTCAGGTGTG TTGCTGGTAG CGAAAAAACG CTCGGCGTTG CGTTCTCTGC ATGAGCAATT ACGTGAAAAA GGGATGCAAA AAGATTACCT GGCGCTGGTG CGCGGTCAGT GGCAGTCGCA TGTGAAGAGC GTTCAAGCGC CGTTATTGAA AAATATTTTG CAAAGCGGCG AACGTATCGT GCGTGTCAGT CAGGAAGGCA AACCGTCGGA AACACGCTTT AAAGTGGAAG AACGCTATGC ATTTGCCACC CTGGTGCGTT GTAGCCCGGT AACAGGGCGT ACTCATCAGA TCCGTGTGCA TACACAATAT GCGGGTCATC CGATTGCCTT TGACGATCGC TACGGTGACC GTGAATTTGA CAGACACCTC ACTGAAGCAG GCACGGGATT AAATCGCCTG TTCCTGCACG CCGCAGCGTT GAAGTTTACT CATCCGGGGA CCGGTGAGGT GATGCGTATT GAAGCGCCGA TGGATGAAGG TTTGAAGCGT TGTTTGCAAA AGCTGCGTAA CGCGCGCTAA
|
Protein sequence | MKTETPSVKI VAITADEAGQ RIDNFLRTQL KGVPKSMIYR ILRKGEVRVN KKRIKPEYKL EAGDEVRIPP VRVAEREEEA VSPHLQKVAA LADVILYEDD HILVLNKPSG TAVHGGSGLS FGVIEGLRAL RPEARFLELV HRLDRDTSGV LLVAKKRSAL RSLHEQLREK GMQKDYLALV RGQWQSHVKS VQAPLLKNIL QSGERIVRVS QEGKPSETRF KVEERYAFAT LVRCSPVTGR THQIRVHTQY AGHPIAFDDR YGDREFDRHL TEAGTGLNRL FLHAAALKFT HPGTGEVMRI EAPMDEGLKR CLQKLRNAR
|
| |